Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Leslie Pack Kaelbling

Visibility-Aware Navigation Among Movable Obstacles

Dec 06, 2022

Jose Muguira-Iturralde, Aidan Curtis, Yilun Du, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Abstract:In this paper, we examine the problem of visibility-aware robot navigation among movable obstacles (VANAMO). A variant of the well-known NAMO robotic planning problem, VANAMO puts additional visibility constraints on robot motion and object movability. This new problem formulation lifts the restrictive assumption that the map is fully visible and the object positions are fully known. We provide a formal definition of the VANAMO problem and propose the Look and Manipulate Backchaining (LaMB) algorithm for solving such problems. LaMB has a simple vision-based API that makes it more easily transferable to real-world robot applications and scales to the large 3D environments. To evaluate LaMB, we construct a set of tasks that illustrate the complex interplay between visibility and object movability that can arise in mobile base manipulation problems in unknown environments. We show that LaMB outperforms NAMO and visibility-aware motion planning approaches as well as simple combinations of them on complex manipulation problems with partial observability.

Via

Access Paper or Ask Questions

SE(3)-Equivariant Relational Rearrangement with Neural Descriptor Fields

Nov 17, 2022

Anthony Simeonov, Yilun Du, Lin Yen-Chen, Alberto Rodriguez, Leslie Pack Kaelbling, Tomas Lozano-Perez, Pulkit Agrawal

Abstract:We present a method for performing tasks involving spatial relations between novel object instances initialized in arbitrary poses directly from point cloud observations. Our framework provides a scalable way for specifying new tasks using only 5-10 demonstrations. Object rearrangement is formalized as the question of finding actions that configure task-relevant parts of the object in a desired alignment. This formalism is implemented in three steps: assigning a consistent local coordinate frame to the task-relevant object parts, determining the location and orientation of this coordinate frame on unseen object instances, and executing an action that brings these frames into the desired alignment. We overcome the key technical challenge of determining task-relevant local coordinate frames from a few demonstrations by developing an optimization method based on Neural Descriptor Fields (NDFs) and a single annotated 3D keypoint. An energy-based learning scheme to model the joint configuration of the objects that satisfies a desired relational task further improves performance. The method is tested on three multi-object rearrangement tasks in simulation and on a real robot. Project website, videos, and code: https://anthonysimeonov.github.io/r-ndf/

* CoRL 2022, first two authors contributed equally, website and code: https://anthonysimeonov.github.io/r-ndf/

Via

Access Paper or Ask Questions

Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Aug 16, 2022

Nishanth Kumar, Willie McClinton, Rohan Chitnis, Tom Silver, Tomás Lozano-Pérez, Leslie Pack Kaelbling

Figure 1 for Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Figure 2 for Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Figure 3 for Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Figure 4 for Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Abstract:Bilevel planning, in which a high-level search over an abstraction of an environment is used to guide low-level decision making, is an effective approach to solving long-horizon tasks in continuous state and action spaces. Recent work has shown that action abstractions that enable such bilevel planning can be learned in the form of symbolic operators and neural samplers given symbolic predicates and demonstrations that achieve known goals. In this work, we show that existing approaches fall short in environments where actions tend to cause a large number of predicates to change. To address this issue, we propose to learn operators with ignore effects. The key idea motivating our approach is that modeling every observed change in the predicates is unnecessary; the only changes that need be modeled are those that are necessary for high-level search to achieve the specified goal. Experimentally, we show that our approach is able to learn operators with ignore effects across six hybrid robotic domains that enable an agent to solve novel variations of a task, with different initial states, goals, and numbers of objects, significantly more efficiently than several baselines.

Via

Access Paper or Ask Questions

Learning Neuro-Symbolic Skills for Bilevel Planning

Jun 21, 2022

Tom Silver, Ashay Athalye, Joshua B. Tenenbaum, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for Learning Neuro-Symbolic Skills for Bilevel Planning

Figure 2 for Learning Neuro-Symbolic Skills for Bilevel Planning

Figure 3 for Learning Neuro-Symbolic Skills for Bilevel Planning

Figure 4 for Learning Neuro-Symbolic Skills for Bilevel Planning

Abstract:Decision-making is challenging in robotics environments with continuous object-centric states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches, such as task and motion planning (TAMP), address these challenges by decomposing decision-making into two or more levels of abstraction. In a setting where demonstrations and symbolic predicates are given, prior work has shown how to learn symbolic operators and neural samplers for TAMP with manually designed parameterized policies. Our main contribution is a method for learning parameterized polices in combination with operators and samplers. These components are packaged into modular neuro-symbolic skills and sequenced together with search-then-sample TAMP to solve new tasks. In experiments in four robotics domains, we show that our approach -- bilevel planning with neuro-symbolic skills -- can solve a wide range of tasks with varying initial states, goals, and objects, outperforming six baselines and ablations. Video: https://youtu.be/PbFZP8rPuGg Code: https://tinyurl.com/skill-learning

Via

Access Paper or Ask Questions

Fully Persistent Spatial Data Structures for Efficient Queries in Path-Dependent Motion Planning Applications

Jun 06, 2022

Sathwik Karnik, Tomás Lozano-Pérez, Leslie Pack Kaelbling, Gustavo Nunes Goretkin

Figure 1 for Fully Persistent Spatial Data Structures for Efficient Queries in Path-Dependent Motion Planning Applications

Figure 2 for Fully Persistent Spatial Data Structures for Efficient Queries in Path-Dependent Motion Planning Applications

Figure 3 for Fully Persistent Spatial Data Structures for Efficient Queries in Path-Dependent Motion Planning Applications

Figure 4 for Fully Persistent Spatial Data Structures for Efficient Queries in Path-Dependent Motion Planning Applications

Abstract:Motion planning is a ubiquitous problem that is often a bottleneck in robotic applications. We demonstrate that motion planning problems such as minimum constraint removal, belief-space planning, and visibility-aware motion planning (VAMP) benefit from a path-dependent formulation, in which the state at a search node is represented implicitly by the path to that node. A naive approach to computing the feasibility of a successor node in such a path-dependent formulation takes time linear in the path length to the node, in contrast to a (possibly very large) constant time for a more typical search formulation. For long-horizon plans, performing this linear-time computation, which we call the lookback, for each node becomes prohibitive. To improve upon this, we introduce the use of a fully persistent spatial data structure (FPSDS), which bounds the size of the lookback. We then focus on the application of the FPSDS in VAMP, which involves incremental geometric computations that can be accelerated by filtering configurations with bounding volumes using nearest-neighbor data structures. We demonstrate an asymptotic and practical improvement in the runtime of finding VAMP solutions in several illustrative domains. To the best of our knowledge, this is the first use of a fully persistent data structure for accelerating motion planning.

* Presented at the 2022 IEEE International Conference on Robotics and Automation (ICRA) and will appear in the official proceedings

Via

Access Paper or Ask Questions

PG3: Policy-Guided Planning for Generalized Policy Generation

Apr 21, 2022

Ryan Yang, Tom Silver, Aidan Curtis, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for PG3: Policy-Guided Planning for Generalized Policy Generation

Figure 2 for PG3: Policy-Guided Planning for Generalized Policy Generation

Figure 3 for PG3: Policy-Guided Planning for Generalized Policy Generation

Figure 4 for PG3: Policy-Guided Planning for Generalized Policy Generation

Abstract:A longstanding objective in classical planning is to synthesize policies that generalize across multiple problems from the same domain. In this work, we study generalized policy search-based methods with a focus on the score function used to guide the search over policies. We demonstrate limitations of two score functions and propose a new approach that overcomes these limitations. The main idea behind our approach, Policy-Guided Planning for Generalized Policy Generation (PG3), is that a candidate policy should be used to guide planning on training problems as a mechanism for evaluating that candidate. Theoretical results in a simplified setting give conditions under which PG3 is optimal or admissible. We then study a specific instantiation of policy search where planning problems are PDDL-based and policies are lifted decision lists. Empirical results in six domains confirm that PG3 learns generalized policies more efficiently and effectively than several baselines. Code: https://github.com/ryangpeixu/pg3

* IJCAI 2022

Via

Access Paper or Ask Questions

Inventing Relational State and Action Abstractions for Effective and Efficient Bilevel Planning

Mar 17, 2022

Tom Silver, Rohan Chitnis, Nishanth Kumar, Willie McClinton, Tomas Lozano-Perez, Leslie Pack Kaelbling, Joshua Tenenbaum

Figure 1 for Inventing Relational State and Action Abstractions for Effective and Efficient Bilevel Planning

Figure 2 for Inventing Relational State and Action Abstractions for Effective and Efficient Bilevel Planning

Figure 3 for Inventing Relational State and Action Abstractions for Effective and Efficient Bilevel Planning

Figure 4 for Inventing Relational State and Action Abstractions for Effective and Efficient Bilevel Planning

Abstract:Effective and efficient planning in continuous state and action spaces is fundamentally hard, even when the transition model is deterministic and known. One way to alleviate this challenge is to perform bilevel planning with abstractions, where a high-level search for abstract plans is used to guide planning in the original transition space. In this paper, we develop a novel framework for learning state and action abstractions that are explicitly optimized for both effective (successful) and efficient (fast) bilevel planning. Given demonstrations of tasks in an environment, our data-efficient approach learns relational, neuro-symbolic abstractions that generalize over object identities and numbers. The symbolic components resemble the STRIPS predicates and operators found in AI planning, and the neural components refine the abstractions into actions that can be executed in the environment. Experimentally, we show across four robotic planning environments that our learned abstractions are able to quickly solve held-out tasks of longer horizons than were seen in the demonstrations, and can even outperform the efficiency of abstractions that we manually specified. We also find that as the planner configuration varies, the learned abstractions adapt accordingly, indicating that our abstraction learning method is both "task-aware" and "planner-aware." Code: https://tinyurl.com/predicators-release

Via

Access Paper or Ask Questions

Representation, learning, and planning algorithms for geometric task and motion planning

Mar 09, 2022

Beomjoon Kim, Luke Shimanuki, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Representation, learning, and planning algorithms for geometric task and motion planning

Figure 2 for Representation, learning, and planning algorithms for geometric task and motion planning

Figure 3 for Representation, learning, and planning algorithms for geometric task and motion planning

Figure 4 for Representation, learning, and planning algorithms for geometric task and motion planning

Abstract:We present a framework for learning to guide geometric task and motion planning (GTAMP). GTAMP is a subclass of task and motion planning in which the goal is to move multiple objects to target regions among movable obstacles. A standard graph search algorithm is not directly applicable, because GTAMP problems involve hybrid search spaces and expensive action feasibility checks. To handle this, we introduce a novel planner that extends basic heuristic search with random sampling and a heuristic function that prioritizes feasibility checking on promising state action pairs. The main drawback of such pure planners is that they lack the ability to learn from planning experience to improve their efficiency. We propose two learning algorithms to address this. The first is an algorithm for learning a rank function that guides the discrete task level search, and the second is an algorithm for learning a sampler that guides the continuous motionlevel search. We propose design principles for designing data efficient algorithms for learning from planning experience and representations for effective generalization. We evaluate our framework in challenging GTAMP problems, and show that we can improve both planning and data efficiency

* International Journal of Robotics Research 2021

Via

Access Paper or Ask Questions

Specifying and achieving goals in open uncertain robot-manipulation domains

Dec 21, 2021

Leslie Pack Kaelbling, Alex LaGrassa, Tomás Lozano-Pérez

Figure 1 for Specifying and achieving goals in open uncertain robot-manipulation domains

Figure 2 for Specifying and achieving goals in open uncertain robot-manipulation domains

Figure 3 for Specifying and achieving goals in open uncertain robot-manipulation domains

Figure 4 for Specifying and achieving goals in open uncertain robot-manipulation domains

Abstract:This paper describes an integrated solution to the problem of describing and interpreting goals for robots in open uncertain domains. Given a formal specification of a desired situation, in which objects are described only by their properties, general-purpose planning and reasoning tools are used to derive appropriate actions for a robot. These goals are carried out through an online combination of hierarchical planning, state-estimation, and execution that operates robustly in real robot domains with substantial occlusion and sensing error.

* Paper completed in 2019

Via

Access Paper or Ask Questions

Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators

Sep 30, 2021

Clement Gehring, Masataro Asai, Rohan Chitnis, Tom Silver, Leslie Pack Kaelbling, Shirin Sohrabi, Michael Katz

Figure 1 for Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators

Figure 2 for Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators

Figure 3 for Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators

Figure 4 for Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators

Abstract:Recent advances in reinforcement learning (RL) have led to a growing interest in applying RL to classical planning domains or applying classical planning methods to some complex RL domains. However, the long-horizon goal-based problems found in classical planning lead to sparse rewards for RL, making direct application inefficient. In this paper, we propose to leverage domain-independent heuristic functions commonly used in the classical planning literature to improve the sample efficiency of RL. These classical heuristics act as dense reward generators to alleviate the sparse-rewards issue and enable our RL agent to learn domain-specific value functions as residuals on these heuristics, making learning easier. Correct application of this technique requires consolidating the discounted metric used in RL and the non-discounted metric used in heuristics. We implement the value functions using Neural Logic Machines, a neural network architecture designed for grounded first-order logic inputs. We demonstrate on several classical planning domains that using classical heuristics for RL allows for good sample efficiency compared to sparse-reward RL. We further show that our learned value functions generalize to novel problem instances in the same domain.

* Equal contributions by the first two authors. This manuscript is significantly updated from the ICAPS PRL (Planning and RL) workshop version with the same title with additional experiments comparing existing work (STRIPS-HGN (Shen, Trevizan, and Thiebaux 2020) and GBFS-GNN (Rivlin, Hazan, and Karpas 2019))

Via

Access Paper or Ask Questions