Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Leslie Pack Kaelbling

Learning compositional models of robot skills for task and motion planning

Jun 08, 2020

Zi Wang, Caelan Reed Garrett, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Learning compositional models of robot skills for task and motion planning

Figure 2 for Learning compositional models of robot skills for task and motion planning

Figure 3 for Learning compositional models of robot skills for task and motion planning

Figure 4 for Learning compositional models of robot skills for task and motion planning

Abstract:The objective of this work is to augment the basic abilities of a robot by learning to use new sensorimotor primitives to solve complex long-horizon manipulation problems. This requires flexible generative planning that can combine primitive abilities in novel combinations and thus generalize across a wide variety of problems. In order to plan with primitive actions, we must have models of the preconditions and effects of those actions: under what circumstances will executing this primitive successfully achieve some particular effect in the world? We use, and develop novel improvements on, state-of-the-art methods for active learning and sampling. We use Gaussian process methods for learning the conditions of operator effectiveness from small numbers of expensive training examples. We develop adaptive sampling methods for generating a comprehensive and diverse sequence of continuous parameter values (such as pouring waypoints for a cup) configurations and during planning for solving a new task, so that a complete robot plan can be found as efficiently as possible. We demonstrate our approach in an integrated system, combining traditional robotics primitives with our newly learned models using an efficient robot task and motion planner. We evaluate our approach both in simulation and in the real world through measuring the quality of the selected pours and scoops. Finally, we apply our integrated system to a variety of long-horizon simulated and real-world manipulation problems.

* First two authors contributed equally. arXiv admin note: text overlap with arXiv:1803.00967

Via

Access Paper or Ask Questions

Visual Prediction of Priors for Articulated Object Interaction

Jun 06, 2020

Caris Moses, Michael Noseworthy, Leslie Pack Kaelbling, Tomás Lozano-Pérez, Nicholas Roy

Figure 1 for Visual Prediction of Priors for Articulated Object Interaction

Figure 2 for Visual Prediction of Priors for Articulated Object Interaction

Figure 3 for Visual Prediction of Priors for Articulated Object Interaction

Figure 4 for Visual Prediction of Priors for Articulated Object Interaction

Abstract:Exploration in novel settings can be challenging without prior experience in similar domains. However, humans are able to build on prior experience quickly and efficiently. Children exhibit this behavior when playing with toys. For example, given a toy with a yellow and blue door, a child will explore with no clear objective, but once they have discovered how to open the yellow door, they will most likely be able to open the blue door much faster. Adults also exhibit this behavior when entering new spaces such as kitchens. We develop a method, Contextual Prior Prediction, which provides a means of transferring knowledge between interactions in similar domains through vision. We develop agents that exhibit exploratory behavior with increasing efficiency, by learning visual features that are shared across environments, and how they correlate to actions. Our problem is formulated as a Contextual Multi-Armed Bandit where the contexts are images, and the robot has access to a parameterized action space. Given a novel object, the objective is to maximize reward with few interactions. A domain which strongly exhibits correlations between visual features and motion is kinemetically constrained mechanisms. We evaluate our method on simulated prismatic and revolute joints.

* IEEE International Conference on Robotics and Automation (ICRA) 2020

Via

Access Paper or Ask Questions

Meta-learning curiosity algorithms

Mar 11, 2020

Ferran Alet, Martin F. Schneider, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Meta-learning curiosity algorithms

Figure 2 for Meta-learning curiosity algorithms

Figure 3 for Meta-learning curiosity algorithms

Figure 4 for Meta-learning curiosity algorithms

Abstract:We hypothesize that curiosity is a mechanism found by evolution that encourages meaningful exploration early in an agent's life in order to expose it to experiences that enable it to obtain high rewards over the course of its lifetime. We formulate the problem of generating curious behavior as one of meta-learning: an outer loop will search over a space of curiosity mechanisms that dynamically adapt the agent's reward signal, and an inner loop will perform standard reinforcement learning using the adapted reward signal. However, current meta-RL methods based on transferring neural network weights have only generalized between very similar tasks. To broaden the generalization, we instead propose to meta-learn algorithms: pieces of code similar to those designed by humans in ML papers. Our rich language of programs combines neural networks with other building blocks such as buffers, nearest-neighbor modules and custom loss functions. We demonstrate the effectiveness of the approach empirically, finding two novel curiosity algorithms that perform on par or better than human-designed published curiosity algorithms in domains as disparate as grid navigation with image inputs, acrobot, lunar lander, ant and hopper.

* Published in ICLR 2020

Via

Access Paper or Ask Questions

GLIB: Exploration via Goal-Literal Babbling for Lifted Operator Learning

Jan 22, 2020

Rohan Chitnis, Tom Silver, Joshua Tenenbaum, Leslie Pack Kaelbling, Tomas Lozano-Perez

Figure 1 for GLIB: Exploration via Goal-Literal Babbling for Lifted Operator Learning

Figure 2 for GLIB: Exploration via Goal-Literal Babbling for Lifted Operator Learning

Figure 3 for GLIB: Exploration via Goal-Literal Babbling for Lifted Operator Learning

Figure 4 for GLIB: Exploration via Goal-Literal Babbling for Lifted Operator Learning

Abstract:We address the problem of efficient exploration for learning lifted operators in sequential decision-making problems without extrinsic goals or rewards. Inspired by human curiosity, we propose goal-literal babbling (GLIB), a simple and general method for exploration in such problems. GLIB samples goals that are conjunctions of literals, which can be understood as specific, targeted effects that the agent would like to achieve in the world, and plans to achieve these goals using the operators being learned. We conduct a case study to elucidate two key benefits of GLIB: robustness to overly general preconditions and efficient exploration in domains with effects at long horizons. We also provide theoretical guarantees and further empirical results, finding GLIB to be effective on a range of benchmark planning tasks.

Via

Access Paper or Ask Questions

Online Replanning in Belief Space for Partially Observable Task and Motion Problems

Nov 11, 2019

Caelan Reed Garrett, Chris Paxton, Tomás Lozano-Pérez, Leslie Pack Kaelbling, Dieter Fox

Figure 1 for Online Replanning in Belief Space for Partially Observable Task and Motion Problems

Figure 2 for Online Replanning in Belief Space for Partially Observable Task and Motion Problems

Figure 3 for Online Replanning in Belief Space for Partially Observable Task and Motion Problems

Figure 4 for Online Replanning in Belief Space for Partially Observable Task and Motion Problems

Abstract:To solve multi-step manipulation tasks in the real world, an autonomous robot must take actions to observe its environment and react to unexpected observations. This may require opening a drawer to observe its contents or moving an object out of the way to examine the space behind it. If the robot fails to detect an important object, it must update its belief about the world and compute a new plan of action. Additionally, a robot that acts noisily will never exactly arrive at a desired state. Still, it is important that the robot adjusts accordingly in order to keep making progress towards achieving the goal. In this work, we present an online planning and execution system for robots faced with these kinds of challenges. Our approach is able to efficiently solve partially observable problems both in simulation and in a real-world kitchen.

Via

Access Paper or Ask Questions

Differentiable Algorithm Networks for Composable Robot Learning

May 28, 2019

Peter Karkus, Xiao Ma, David Hsu, Leslie Pack Kaelbling, Wee Sun Lee, Tomas Lozano-Perez

Figure 1 for Differentiable Algorithm Networks for Composable Robot Learning

Figure 2 for Differentiable Algorithm Networks for Composable Robot Learning

Figure 3 for Differentiable Algorithm Networks for Composable Robot Learning

Figure 4 for Differentiable Algorithm Networks for Composable Robot Learning

Abstract:This paper introduces the Differentiable Algorithm Network (DAN), a composable architecture for robot learning systems. A DAN is composed of neural network modules, each encoding a differentiable robot algorithm and an associated model; and it is trained end-to-end from data. DAN combines the strengths of model-driven modular system design and data-driven end-to-end learning. The algorithms and models act as structural assumptions to reduce the data requirements for learning; end-to-end learning allows the modules to adapt to one another and compensate for imperfect models and algorithms, in order to achieve the best overall system performance. We illustrate the DAN methodology through a case study on a simulated robot system, which learns to navigate in complex 3-D environments with only local visual observations and an image of a partially correct 2-D floor map.

* RSS 2019 camera ready. Video is available at https://youtu.be/4jcYlTSJF4Y

Via

Access Paper or Ask Questions

Graph Element Networks: adaptive, structured computation and memory

May 13, 2019

Ferran Alet, Adarsh K. Jeewajee, Maria Bauza, Alberto Rodriguez, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Graph Element Networks: adaptive, structured computation and memory

Figure 2 for Graph Element Networks: adaptive, structured computation and memory

Figure 3 for Graph Element Networks: adaptive, structured computation and memory

Figure 4 for Graph Element Networks: adaptive, structured computation and memory

Abstract:We explore the use of graph neural networks (GNNs) to model spatial processes in which there is no a priori graphical structure. Similar to finite element analysis, we assign nodes of a GNN to spatial locations and use a computational process defined on the graph to model the relationship between an initial function defined over a space and a resulting function in the same space. We use GNNs as a computational substrate, and show that the locations of the nodes in space as well as their connectivity can be optimized to focus on the most complex parts of the space. Moreover, this representational strategy allows the learned input-output relationship to generalize over the size of the underlying space and run the same model at different levels of precision, trading computation for accuracy. We demonstrate this method on a traditional PDE problem, a physical prediction problem from robotics, and learning to predict scene images from novel viewpoints.

* Accepted to ICML 2019

Via

Access Paper or Ask Questions

Few-Shot Bayesian Imitation Learning with Logic over Programs

Apr 12, 2019

Tom Silver, Kelsey R. Allen, Alex K. Lew, Leslie Pack Kaelbling, Josh Tenenbaum

Figure 1 for Few-Shot Bayesian Imitation Learning with Logic over Programs

Figure 2 for Few-Shot Bayesian Imitation Learning with Logic over Programs

Figure 3 for Few-Shot Bayesian Imitation Learning with Logic over Programs

Figure 4 for Few-Shot Bayesian Imitation Learning with Logic over Programs

Abstract:We describe an expressive class of policies that can be efficiently learned from a few demonstrations. Policies are represented as logical combinations of programs drawn from a small domain-specific language (DSL). We define a prior over policies with a probabilistic grammar and derive an approximate Bayesian inference algorithm to learn policies from demonstrations. In experiments, we study five strategy games played on a 2D grid with one shared DSL. After a few demonstrations of each game, the inferred policies generalize to new game instances that differ substantially from the demonstrations. We argue that the proposed method is an apt choice for policy learning tasks that have scarce training data and feature significant, structured variation between task instances.

Via

Access Paper or Ask Questions

Every Local Minimum is a Global Minimum of an Induced Model

Apr 07, 2019

Kenji Kawaguchi, Leslie Pack Kaelbling

Figure 1 for Every Local Minimum is a Global Minimum of an Induced Model

Figure 2 for Every Local Minimum is a Global Minimum of an Induced Model

Figure 3 for Every Local Minimum is a Global Minimum of an Induced Model

Figure 4 for Every Local Minimum is a Global Minimum of an Induced Model

Abstract:For non-convex optimization in machine learning, this paper proves that every local minimum achieves the global optimality of the perturbable gradient basis model at any differentiable point. As a result, non-convex machine learning is theoretically as supported as convex machine learning with a hand-crafted basis in terms of the loss at differentiable local minima, except in the case when a preference is given to the hand-crafted basis over the perturbable gradient basis. The proofs of these results are derived under mild assumptions. Accordingly, the proven results are directly applicable to many machine learning models, including practical deep neural networks, without any modification of practical methods. Furthermore, as special cases of our general results, this paper improves or complements several state-of-the-art theoretical results in the literature with a simple and unified proof technique.

Via

Access Paper or Ask Questions

STRIPStream: Integrating Symbolic Planners and Blackbox Samplers

Feb 27, 2019

Caelan Reed Garrett, Tomás Lozano-Pérez, Leslie Pack Kaelbling

Figure 1 for STRIPStream: Integrating Symbolic Planners and Blackbox Samplers

Figure 2 for STRIPStream: Integrating Symbolic Planners and Blackbox Samplers

Figure 3 for STRIPStream: Integrating Symbolic Planners and Blackbox Samplers

Figure 4 for STRIPStream: Integrating Symbolic Planners and Blackbox Samplers

Abstract:Many planning applications involve complex relationships defined on high-dimensional, continuous variables. For example, robotic manipulation requires planning with kinematic, collision, visibility, and motion constraints involving robot configurations, object transforms, and robot trajectories. These constraints typically require specialized procedures to sample satisfying values. We extend the STRIPS planning language to support a generic, declarative specification for these procedures while treating their implementation as black boxes. We also describe cost-sensitive planning within this framework. We provide several domain-independent algorithms that reduce STRIPStream problems to a sequence of finite-domain STRIPS planning problems. Finally, we evaluate our algorithms on three robotic planning domains.

Via

Access Paper or Ask Questions