{"id":26477,"date":"2025-06-06T12:20:59","date_gmt":"2025-06-06T06:35:59","guid":{"rendered":"https:\/\/www.revoscience.com\/en\/?p=26477"},"modified":"2025-06-06T12:25:15","modified_gmt":"2025-06-06T06:40:15","slug":"new-system-enables-robots-to-solve-manipulation-problems-in-seconds","status":"publish","type":"post","link":"https:\/\/www.revoscience.com\/en\/new-system-enables-robots-to-solve-manipulation-problems-in-seconds\/","title":{"rendered":"New system enables robots to solve manipulation problems in seconds"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><strong><em>Researchers have developed an algorithm that enables a robot to \u201cthink ahead\u201d and consider thousands of potential motion plans simultaneously.<\/em><\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"900\" height=\"600\" src=\"https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp\" alt=\"\" class=\"wp-image-26478\" title=\"\" srcset=\"https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp 900w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-675x450.webp 675w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-768x512.webp 768w, https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-150x100.webp 150w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><\/figure>\n\n\n<div class=\"wp-block-post-author\"><div class=\"wp-block-post-author__content\"><p class=\"wp-block-post-author__name\">Adam Zewe<\/p><\/div><\/div>\n\n\n<p class=\"wp-block-paragraph\">CAMBRIDGE, MA\u2014 Ready for that long-awaited summer vacation? First, you\u2019ll need to pack all the items required for your trip into a suitcase, ensuring everything fits securely without crushing any fragile items.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Because humans possess strong visual and geometric reasoning skills, this is usually a straightforward problem, even if it may take a bit of finagling to squeeze everything in.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To a robot, though, it is an extremely complex planning challenge that requires thinking simultaneously about many actions, constraints, and mechanical capabilities. Finding an effective solution could take the robot a very long time, if it can even come up with one.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers from MIT and NVIDIA Research have developed a novel algorithm that dramatically speeds up the robot\u2019s planning process. Their approach enables a robot to \u201cthink ahead\u201d by evaluating thousands of possible solutions in parallel and then refining the best ones to meet the constraints of the robot and its environment.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Instead of testing each potential action one at a time, like many existing approaches, this new method considers thousands of actions simultaneously, solving multistep manipulation problems in a matter of seconds.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The researchers harness the massive computational power of specialized processors called graphics processing units (GPUs) to enable this speedup.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In a factory or warehouse, their technique could enable robots to rapidly determine how to manipulate and tightly pack items that have different shapes and sizes without damaging them, knocking anything over, or colliding with obstacles, even in a narrow space.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cThis would be very helpful in industrial settings where time really does matter and you need to find an effective solution as fast as possible. If your algorithm takes minutes to find a plan, as opposed to seconds, that costs the business money,\u201d says MIT graduate student William Shen, SM \u201923, lead author of the\u00a0<a href=\"https:\/\/link.mediaoutreach.meltwater.com\/ls\/click?upn=u001.aGL2w8mpmadAd46sBDLfbJQfXi-2BgjtsRXhSuJl6mKAiDe0FFbczNJtTbDhfoo-2FhNt3mI_Gmh-2FjktplCfWo1o-2BFbkY3J9eYBJUJc-2BSUmMkHo42Dqe4Z0qTEKCmSFnQfWCe8-2B8jgXgQQcW-2Fb1rLKfKZRu-2BLLGScwMYc-2FOCX9RDmpXEBR4BY9i7y-2BNgpMuREG7n76alZMrpgb2Yal13CtFyFK55OF-2Bp03xyDMPU-2FbEy0Y1FyA9bLPAeUYNJNCmQHn-2B-2BcF-2F6xLtrAojf-2B7gkpbqH4kLse-2FuaO1XP-2BaOWPXH3FN8n-2FeLHceAbkyvnQ36l1odMIkiOPFJwFLTk9-2BaWuMfI-2FI0wClJBY0Li9Ydlhw9lxzdk7J5-2FvtiVCVuR-2FaghE4vlIGR84nUITKLtn5FIp9MWLlnX0rRZjMHc7cZXSrnLH37BhfyOIVTztMYkoLEmN5UDxsvsEKXyYGOeRv6gVdgHOZE-2FayA-3D-3D\" target=\"_blank\" rel=\"noreferrer noopener\">paper<\/a>\u00a0on this technique.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">He is joined on the paper by Caelan Garrett \u201915, MEng \u201915, PhD \u201921, a senior research scientist at NVIDIA Research; Nishanth Kumar, an MIT graduate student; Ankit Goyal, an NVIDIA research scientist; Tucker Hermans, an NVIDIA research scientist and associate professor at the University of Utah; Leslie Pack Kaelbling, the Panasonic Professor of Computer Science and Engineering at MIT and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL); Tom\u00e1s Lozano-P\u00e9rez, an MIT professor of computer science and engineering and a member of CSAIL; and Fabio Ramos, principal research scientist at NVIDIA and a professor at the University of Sydney. The research will be presented at the Robotics: Science and Systems Conference.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Planning in parallel<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The researchers\u2019 algorithm is designed for what is called task and motion planning (TAMP). The goal of a TAMP algorithm is to come up with a task plan for a robot, which is a high-level sequence of actions, along with a motion plan, which includes low-level action parameters, like joint positions and gripper orientation, that complete that high-level plan.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To create a plan for packing items in a box, a robot needs to reason about many variables, such as the final orientation of packed objects so they fit together, as well as how it is going to pick them up and manipulate them using its arm and gripper.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">It must do this while determining how to avoid collisions and achieve any user-specified constraints, such as a certain order in which to pack items.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">With so many potential sequences of actions, sampling possible solutions at random and trying one at a time could take an extremely long time.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cIt is a very large search space, and a lot of actions the robot does in that space don\u2019t achieve anything productive,\u201d Garrett adds.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Instead, the researchers\u2019 algorithm, called cuTAMP, which is accelerated using a parallel computing platform called CUDA, simulates and refines thousands of solutions in parallel. It does this by combining two techniques, sampling and optimization.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Sampling involves choosing a solution to try. But rather than sampling solutions randomly, cuTAMP limits the range of potential solutions to those most likely to satisfy the problem\u2019s constraints. This modified sampling procedure allows cuTAMP to broadly explore potential solutions while narrowing down the sampling space.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cOnce we combine the outputs of these samples, we get a much better starting point than if we sampled randomly. This ensures we can find solutions more quickly during optimization,\u201d Shen says.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Once cuTAMP has generated that set of samples, it performs a parallelized optimization procedure that computes a cost, which corresponds to how well each sample avoids collisions and satisfies the motion constraints of the robot, as well as any user-defined objectives.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">It updates the samples in parallel, chooses the best candidates, and repeats the process until it narrows them down to a successful solution.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Harnessing accelerated computing<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The researchers leverage GPUs, specialized processors that are far more powerful for parallel computation and workloads than general-purpose CPUs, to scale up the number of solutions they can sample and optimize simultaneously. This maximized the performance of their algorithm.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cUsing GPUs, the computational cost of optimizing one solution is the same as optimizing hundreds or thousands of solutions,\u201d Shen explains.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">When they tested their approach on Tetris-like packing challenges in simulation, cuTAMP took only a few seconds to find successful, collision-free plans that might take sequential planning approaches much longer to solve.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">And when deployed on a real robotic arm, the algorithm always found a solution in under 30 seconds.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The system works across robots and has been tested on a robotic arm at MIT and a humanoid robot at NVIDIA. Since cuTAMP is not a machine-learning algorithm, it requires no training data, which could enable it to be readily deployed in many situations.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cYou can give it a brand-new problem, and it will provably solve it,\u201d Garrett says.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The algorithm is generalizable to situations beyond packing, like a robot using tools. A user could&nbsp;incorporate different skill types into the system to expand a robot\u2019s capabilities automatically.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In the future, the researchers want to&nbsp;<a href=\"https:\/\/link.mediaoutreach.meltwater.com\/ls\/click?upn=u001.aGL2w8mpmadAd46sBDLfbNvVHjDa-2BGvCfsXyNRHCngTDUK-2BmVJ1JprgSvuUt3H7q3hiX_Gmh-2FjktplCfWo1o-2BFbkY3J9eYBJUJc-2BSUmMkHo42Dqe4Z0qTEKCmSFnQfWCe8-2B8jgXgQQcW-2Fb1rLKfKZRu-2BLLGScwMYc-2FOCX9RDmpXEBR4BY9i7y-2BNgpMuREG7n76alZMrpgb2Yal13CtFyFK55OF-2Bp03xyDMPU-2FbEy0Y1FyA9bLPAeUYNJNCmQHn-2B-2BcF-2F6xLtrAojf-2B7gkpbqH4kLse-2FuaO1XP-2BaOWPXH3FN8n-2FeLEYYtw5-2FWcNwY0j0opisSe-2B-2FgNeNFReyOA7c9u1T2CYC9JOEc36I-2F9M8Adgf8e9GA0YoTmbHbOzLk6ECbZLEiUMR06JpuvTbKAD2Vb4UBpe-2Bl9JaJpzTvCh2UVjQf-2F9d0OnxRD79mLjS4o4qgeWdAAA03Ja-2BcnmphFnT-2BlEwHHOrg-3D-3D\" rel=\"noreferrer noopener\" target=\"_blank\">leverage large language models and vision language models<\/a>&nbsp;within cuTAMP, enabling a robot to formulate and execute a plan that achieves specific objectives based on voice commands from a user.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This work is supported, in part, by the National Science Foundation (NSF), Air Force Office for Scientific Research, Office of Naval Research, MIT Quest for Intelligence, NVIDIA, and the Robotics and Artificial Intelligence Institute.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Ready for that long-awaited summer vacation? First, you\u2019ll need to pack all the items required for your trip into a suitcase, ensuring everything fits securely without crushing any fragile items.<\/p>\n","protected":false},"author":2,"featured_media":26478,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[43,17],"tags":[],"class_list":["post-26477","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-computer-science","category-research"],"featured_image_urls":{"full":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp",900,600,false],"thumbnail":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-200x200.webp",200,200,true],"medium":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-675x450.webp",675,450,true],"medium_large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-768x512.webp",750,500,true],"large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp",750,500,false],"1536x1536":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp",900,600,false],"2048x2048":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp",900,600,false],"ultp_layout_landscape_large":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0.webp",900,600,false],"ultp_layout_landscape":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-870x570.webp",870,570,true],"ultp_layout_portrait":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-600x600.webp",600,600,true],"ultp_layout_square":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-600x600.webp",600,600,true],"newspaper-x-single-post":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-760x490.webp",760,490,true],"newspaper-x-recent-post-big":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-550x360.webp",550,360,true],"newspaper-x-recent-post-list-image":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-95x65.webp",95,65,true],"web-stories-poster-portrait":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-640x600.webp",640,600,true],"web-stories-publisher-logo":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-96x96.webp",96,96,true],"web-stories-thumbnail":["https:\/\/www.revoscience.com\/en\/wp-content\/uploads\/2025\/06\/MIT_Parallel-Planning-01_0-150x100.webp",150,100,true]},"author_info":{"info":["Adam Zewe"]},"category_info":"<a href=\"https:\/\/www.revoscience.com\/en\/category\/computer-science\/\" rel=\"category tag\">Computer Science<\/a> <a href=\"https:\/\/www.revoscience.com\/en\/category\/news\/research\/\" rel=\"category tag\">Research<\/a>","tag_info":"Research","comment_count":"0","_links":{"self":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/26477","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/comments?post=26477"}],"version-history":[{"count":2,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/26477\/revisions"}],"predecessor-version":[{"id":26481,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/posts\/26477\/revisions\/26481"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/media\/26478"}],"wp:attachment":[{"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/media?parent=26477"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/categories?post=26477"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.revoscience.com\/en\/wp-json\/wp\/v2\/tags?post=26477"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}