Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Leslie Pack Kaelbling

Discovering State and Action Abstractions for Generalized Task and Motion Planning

Sep 23, 2021

Aidan Curtis, Tom Silver, Joshua B. Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Discovering State and Action Abstractions for Generalized Task and Motion Planning

Figure 2 for Discovering State and Action Abstractions for Generalized Task and Motion Planning

Figure 3 for Discovering State and Action Abstractions for Generalized Task and Motion Planning

Figure 4 for Discovering State and Action Abstractions for Generalized Task and Motion Planning

Abstract:Generalized planning accelerates classical planning by finding an algorithm-like policy that solves multiple instances of a task. A generalized plan can be learned from a few training examples and applied to an entire domain of problems. Generalized planning approaches perform well in discrete AI planning problems that involve large numbers of objects and extended action sequences to achieve the goal. In this paper, we propose an algorithm for learning features, abstractions, and generalized plans for continuous robotic task and motion planning (TAMP) and examine the unique difficulties that arise when forced to consider geometric and physical constraints as a part of the generalized plan. Additionally, we show that these simple generalized plans learned from only a handful of examples can be used to improve the search efficiency of TAMP solvers.

Via

Access Paper or Ask Questions

Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances

Aug 10, 2021

Aidan Curtis, Xiaolin Fang, Leslie Pack Kaelbling, Tomás Lozano-Pérez, Caelan Reed Garrett

Figure 1 for Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances

Figure 2 for Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances

Figure 3 for Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances

Figure 4 for Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances

Abstract:We present a strategy for designing and building very general robot manipulation systems involving the integration of a general-purpose task-and-motion planner with engineered and learned perception modules that estimate properties and affordances of unknown objects. Such systems are closed-loop policies that map from RGB images, depth images, and robot joint encoder measurements to robot joint position commands. We show that following this strategy a task-and-motion planner can be used to plan intelligent behaviors even in the absence of a priori knowledge regarding the set of manipulable objects, their geometries, and their affordances. We explore several different ways of implementing such perceptual modules for segmentation, property detection, shape estimation, and grasp generation. We show how these modules are integrated within the PDDLStream task and motion planning framework. Finally, we demonstrate that this strategy can enable a single system to perform a wide variety of real-world multi-step manipulation tasks, generalizing over a broad class of objects, object arrangements, and goals, without any prior knowledge of the environment and without re-training.

* The first two authors contributed equally and are listed in alphabetical order

Via

Access Paper or Ask Questions

Temporal and Object Quantification Networks

Jun 10, 2021

Jiayuan Mao, Zhezheng Luo, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu, Leslie Pack Kaelbling, Tomer D. Ullman

Figure 1 for Temporal and Object Quantification Networks

Figure 2 for Temporal and Object Quantification Networks

Figure 3 for Temporal and Object Quantification Networks

Figure 4 for Temporal and Object Quantification Networks

Abstract:We present Temporal and Object Quantification Networks (TOQ-Nets), a new class of neuro-symbolic networks with a structural bias that enables them to learn to recognize complex relational-temporal events. This is done by including reasoning layers that implement finite-domain quantification over objects and time. The structure allows them to generalize directly to input instances with varying numbers of objects in temporal sequences of varying lengths. We evaluate TOQ-Nets on input domains that require recognizing event-types in terms of complex temporal relational patterns. We demonstrate that TOQ-Nets can generalize from small amounts of data to scenarios containing more objects than were present during training and to temporal warpings of input sequences.

* IJCAI 2021. First two authors contributed equally. Project page: http://toqnet.csail.mit.edu/

Via

Access Paper or Ask Questions

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

May 28, 2021

Rohan Chitnis, Tom Silver, Joshua B. Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Figure 2 for Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Figure 3 for Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Figure 4 for Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Abstract:Despite recent, independent progress in model-based reinforcement learning and integrated symbolic-geometric robotic planning, synthesizing these techniques remains challenging because of their disparate assumptions and strengths. In this work, we take a step toward bridging this gap with Neuro-Symbolic Relational Transition Models (NSRTs), a novel class of transition models that are data-efficient to learn, compatible with powerful robotic planning methods, and generalizable over objects. NSRTs have both symbolic and neural components, enabling a bilevel planning scheme where symbolic AI planning in an outer loop guides continuous planning with neural models in an inner loop. Experiments in four robotic planning domains show that NSRTs can be learned after only tens or hundreds of training episodes, and then used for fast planning in new tasks that require up to 60 actions to reach the goal and involve many more objects than were seen during training. Video: https://tinyurl.com/chitnis-nsrts

Via

Access Paper or Ask Questions

Learning When to Quit: Meta-Reasoning for Motion Planning

Mar 07, 2021

Yoonchang Sung, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Learning When to Quit: Meta-Reasoning for Motion Planning

Figure 2 for Learning When to Quit: Meta-Reasoning for Motion Planning

Figure 3 for Learning When to Quit: Meta-Reasoning for Motion Planning

Figure 4 for Learning When to Quit: Meta-Reasoning for Motion Planning

Abstract:Anytime motion planners are widely used in robotics. However, the relationship between their solution quality and computation time is not well understood, and thus, determining when to quit planning and start execution is unclear. In this paper, we address the problem of deciding when to stop deliberation under bounded computational capacity, so called meta-reasoning, for anytime motion planning. We propose data-driven learning methods, model-based and model-free meta-reasoning, that are applicable to different environment distributions and agnostic to the choice of anytime motion planners. As a part of the framework, we design a convolutional neural network-based optimal solution predictor that predicts the optimal path length from a given 2D workspace image. We empirically evaluate the performance of the proposed methods in simulation in comparison with baselines.

* 8 pages, 5 figures, Submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

Via

Access Paper or Ask Questions

Learning Symbolic Operators for Task and Motion Planning

Feb 28, 2021

Tom Silver, Rohan Chitnis, Joshua Tenenbaum, Leslie Pack Kaelbling, Tomas Lozano-Perez

Figure 1 for Learning Symbolic Operators for Task and Motion Planning

Figure 2 for Learning Symbolic Operators for Task and Motion Planning

Figure 3 for Learning Symbolic Operators for Task and Motion Planning

Figure 4 for Learning Symbolic Operators for Task and Motion Planning

Abstract:Robotic planning problems in hybrid state and action spaces can be solved by integrated task and motion planners (TAMP) that handle the complex interaction between motion-level decisions and task-level plan feasibility. TAMP approaches rely on domain-specific symbolic operators to guide the task-level search, making planning efficient. In this work, we formalize and study the problem of operator learning for TAMP. Central to this study is the view that operators define a lossy abstraction of the transition model of the underlying domain. We then propose a bottom-up relational learning method for operator learning and show how the learned operators can be used for planning in a TAMP system. Experimentally, we provide results in three domains, including long-horizon robotic planning tasks. We find our approach to substantially outperform several baselines, including three graph neural network-based model-free approaches based on recent work. Video: https://youtu.be/iVfpX9BpBRo

Via

Access Paper or Ask Questions

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Oct 14, 2020

Ferran Alet, Kenji Kawaguchi, Maria Bauza, Nurullah Giray Kuru, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Figure 2 for Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Figure 3 for Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Abstract:From CNNs to attention mechanisms, encoding inductive biases into neural networks has been a fruitful source of improvement in machine learning. Auxiliary losses are a general way of encoding biases in order to help networks learn better representations by adding extra terms to the loss function. However, since they are minimized on the training data, they suffer from the same generalization gap as regular task losses. Moreover, by changing the loss function, the network is optimizing a different objective than the one we care about. In this work we solve both problems: first, we take inspiration from \textit{transductive learning} and note that, after receiving an input but before making a prediction, we can fine-tune our models on any unsupervised objective. We call this process tailoring, because we customize the model to each input. Second, we formulate a nested optimization (similar to those in meta-learning) and train our models to perform well on the task loss after adapting to the tailoring loss. The advantages of tailoring and meta-tailoring are discussed theoretically and demonstrated empirically on several diverse examples: encoding inductive conservation laws from physics to improve predictions, improving local smoothness to increase robustness to adversarial examples, and using contrastive losses on the query image to improve generalization.

Via

Access Paper or Ask Questions

Integrated Task and Motion Planning

Oct 02, 2020

Caelan Reed Garrett, Rohan Chitnis, Rachel Holladay, Beomjoon Kim, Tom Silver, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Integrated Task and Motion Planning

Figure 2 for Integrated Task and Motion Planning

Figure 3 for Integrated Task and Motion Planning

Figure 4 for Integrated Task and Motion Planning

Abstract:The problem of planning for a robot that operates in environments containing a large number of objects, taking actions to move itself through the world as well as to change the state of the objects, is known as task and motion planning (TAMP). TAMP problems contain elements of discrete task planning, discrete-continuous mathematical programming, and continuous motion planning, and thus cannot be effectively addressed by any of these fields directly. In this paper, we define a class of TAMP problems and survey algorithms for solving them, characterizing the solution methods in terms of their strategies for solving the continuous-space subproblems and their techniques for integrating the discrete and continuous components of the search.

* Accepted to the Annual Review of Control, Robotics, and Autonomous Systems. Vol. 4 (Volume publication date May 2021)

Via

Access Paper or Ask Questions

Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks

Sep 11, 2020

Tom Silver, Rohan Chitnis, Aidan Curtis, Joshua Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks

Figure 2 for Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks

Figure 3 for Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks

Figure 4 for Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks

Abstract:Real-world planning problems often involve hundreds or even thousands of objects, straining the limits of modern planners. In this work, we address this challenge by learning to predict a small set of objects that, taken together, would be sufficient for finding a plan. We propose a graph neural network architecture for predicting object importance in a single pass, thereby incurring little overhead while substantially reducing the number of objects that must be considered by the planner. Our approach treats the planner and transition model as black boxes, and can be used with any off-the-shelf planner. Empirically, across classical planning, probabilistic planning, and robotic task and motion planning, we find that our method results in planning that is significantly faster than several baselines, including other partial grounding strategies and lifted planners. We conclude that learning to predict a sufficient set of objects for a planning problem is a simple, powerful, and general mechanism for planning in large instances. Video: https://youtu.be/FWsVJc2fvCE

Via

Access Paper or Ask Questions

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Jul 26, 2020

Rohan Chitnis, Tom Silver, Beomjoon Kim, Leslie Pack Kaelbling, Tomas Lozano-Perez

Figure 1 for CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Figure 2 for CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Figure 3 for CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Figure 4 for CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Abstract:Meta-planning, or learning to guide planning from experience, is a promising approach to improving the computational cost of planning. A general meta-planning strategy is to learn to impose constraints on the states considered and actions taken by the agent. We observe that (1) imposing a constraint can induce context-specific independences that render some aspects of the domain irrelevant, and (2) an agent can take advantage of this fact by imposing constraints on its own behavior. These observations lead us to propose the context-specific abstract Markov decision process (CAMP), an abstraction of a factored MDP that affords efficient planning. We then describe how to learn constraints to impose so the CAMP optimizes a trade-off between rewards and computational cost. Our experiments consider five planners across four domains, including robotic navigation among movable obstacles (NAMO), robotic task and motion planning for sequential manipulation, and classical planning. We find planning with learned CAMPs to consistently outperform baselines, including Stilman's NAMO-specific algorithm. Video: https://youtu.be/wTXt6djcAd4

Via

Access Paper or Ask Questions