Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Leslie Pack Kaelbling

Integrating Human-Provided Information Into Belief State Representation Using Dynamic Factorization

Jul 30, 2018

Rohan Chitnis, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Integrating Human-Provided Information Into Belief State Representation Using Dynamic Factorization

Figure 2 for Integrating Human-Provided Information Into Belief State Representation Using Dynamic Factorization

Figure 3 for Integrating Human-Provided Information Into Belief State Representation Using Dynamic Factorization

Figure 4 for Integrating Human-Provided Information Into Belief State Representation Using Dynamic Factorization

Abstract:In partially observed environments, it can be useful for a human to provide the robot with declarative information that represents probabilistic relational constraints on properties of objects in the world, augmenting the robot's sensory observations. For instance, a robot tasked with a search-and-rescue mission may be informed by the human that two victims are probably in the same room. An important question arises: how should we represent the robot's internal knowledge so that this information is correctly processed and combined with raw sensory information? In this paper, we provide an efficient belief state representation that dynamically selects an appropriate factoring, combining aspects of the belief when they are correlated through information and separating them when they are not. This strategy works in open domains, in which the set of possible objects is not known in advance, and provides significant improvements in inference time over a static factoring, leading to more efficient planning for complex partially observed tasks. We validate our approach experimentally in two open-domain planning problems: a 2D discrete gridworld task and a 3D continuous cooking task. A supplementary video can be found at http://tinyurl.com/chitnis-iros-18.

* IROS 2018 final version

Via

Access Paper or Ask Questions

Learning to guide task and motion planning using score-space representation

Jul 26, 2018

Beomjoon Kim, Zi Wang, Leslie Pack Kaelbling, Tomas Lozano-Perez

Figure 1 for Learning to guide task and motion planning using score-space representation

Figure 2 for Learning to guide task and motion planning using score-space representation

Figure 3 for Learning to guide task and motion planning using score-space representation

Figure 4 for Learning to guide task and motion planning using score-space representation

Abstract:In this paper, we propose a learning algorithm that speeds up the search in task and motion planning problems. Our algorithm proposes solutions to three different challenges that arise in learning to improve planning efficiency: what to predict, how to represent a planning problem instance, and how to transfer knowledge from one problem instance to another. We propose a method that predicts constraints on the search space based on a generic representation of a planning problem instance, called score-space, where we represent a problem instance in terms of the performance of a set of solutions attempted so far. Using this representation, we transfer knowledge, in the form of constraints, from previous problems based on the similarity in score space. We design a sequential algorithm that efficiently predicts these constraints, and evaluate it in three different challenging task and motion planning problems. Results indicate that our approach performs orders of magnitudes faster than an unguided planner

Via

Access Paper or Ask Questions

Selecting Representative Examples for Program Synthesis

Jun 07, 2018

Yewen Pu, Zachery Miranda, Armando Solar-Lezama, Leslie Pack Kaelbling

Figure 1 for Selecting Representative Examples for Program Synthesis

Figure 2 for Selecting Representative Examples for Program Synthesis

Figure 3 for Selecting Representative Examples for Program Synthesis

Figure 4 for Selecting Representative Examples for Program Synthesis

Abstract:Program synthesis is a class of regression problems where one seeks a solution, in the form of a source-code program, mapping the inputs to their corresponding outputs exactly. Due to its precise and combinatorial nature, program synthesis is commonly formulated as a constraint satisfaction problem, where input-output examples are encoded as constraints and solved with a constraint solver. A key challenge of this formulation is scalability: while constraint solvers work well with a few well-chosen examples, a large set of examples can incur significant overhead in both time and memory. We describe a method to discover a subset of examples that is both small and representative: the subset is constructed iteratively, using a neural network to predict the probability of unchosen examples conditioned on the chosen examples in the subset, and greedily adding the least probable example. We empirically evaluate the representativeness of the subsets constructed by our method, and demonstrate such subsets can significantly improve synthesis time and stability.

Via

Access Paper or Ask Questions

Generalization in Deep Learning

Feb 22, 2018

Kenji Kawaguchi, Leslie Pack Kaelbling, Yoshua Bengio

Figure 1 for Generalization in Deep Learning

Figure 2 for Generalization in Deep Learning

Figure 3 for Generalization in Deep Learning

Figure 4 for Generalization in Deep Learning

Abstract:With a direct analysis of neural networks, this paper presents a mathematically tight generalization theory to partially address an open problem regarding the generalization of deep learning. Unlike previous bound-based theory, our main theory is quantitatively as tight as possible for every dataset individually, while producing qualitative insights competitively. Our results give insight into why and how deep learning can generalize well, despite its large capacity, complexity, possible algorithmic instability, nonrobustness, and sharp minima, answering to an open question in the literature. We also discuss limitations of our results and propose additional open problems.

* Extended version: all previous results remain unchanged and new theoretical results were added with improved presentation

Via

Access Paper or Ask Questions

FFRob: Leveraging Symbolic Planning for Efficient Task and Motion Planning

Dec 01, 2017

Caelan Reed Garrett, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for FFRob: Leveraging Symbolic Planning for Efficient Task and Motion Planning

Figure 2 for FFRob: Leveraging Symbolic Planning for Efficient Task and Motion Planning

Figure 3 for FFRob: Leveraging Symbolic Planning for Efficient Task and Motion Planning

Figure 4 for FFRob: Leveraging Symbolic Planning for Efficient Task and Motion Planning

Abstract:Mobile manipulation problems involving many objects are challenging to solve due to the high dimensionality and multi-modality of their hybrid configuration spaces. Planners that perform a purely geometric search are prohibitively slow for solving these problems because they are unable to factor the configuration space. Symbolic task planners can efficiently construct plans involving many variables but cannot represent the geometric and kinematic constraints required in manipulation. We present the FFRob algorithm for solving task and motion planning problems. First, we introduce Extended Action Specification (EAS) as a general purpose planning representation that supports arbitrary predicates as conditions. We adapt existing heuristic search ideas for solving \proc{strips} planning problems, particularly delete-relaxations, to solve EAS problem instances. We then apply the EAS representation and planners to manipulation problems resulting in FFRob. FFRob iteratively discretizes task and motion planning problems using batch sampling of manipulation primitives and a multi-query roadmap structure that can be conditionalized to evaluate reachability under different placements of movable objects. This structure enables the EAS planner to efficiently compute heuristics that incorporate geometric and kinematic planning constraints to give a tight estimate of the distance to the goal. Additionally, we show FFRob is probabilistically complete and has finite expected runtime. Finally, we empirically demonstrate FFRob's effectiveness on complex and diverse task and motion planning tasks including rearrangement planning and navigation among movable objects.

* The International Journal of Robotics Research (IJRR), 2017

Via

Access Paper or Ask Questions

Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples

Nov 04, 2017

Beomjoon Kim, Leslie Pack Kaelbling, Tomas Lozano-Perez

Figure 1 for Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples

Figure 2 for Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples

Figure 3 for Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples

Figure 4 for Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples

Abstract:In robotics, it is essential to be able to plan efficiently in high-dimensional continuous state-action spaces for long horizons. For such complex planning problems, unguided uniform sampling of actions until a path to a goal is found is hopelessly inefficient, and gradient-based approaches often fall short when the optimization manifold of a given problem is not smooth. In this paper we present an approach that guides the search of a state-space planner, such as A*, by learning an action-sampling distribution that can generalize across different instances of a planning problem. The motivation is that, unlike typical learning approaches for planning for continuous action space that estimate a policy, an estimated action sampler is more robust to error since it has a planner to fall back on. We use a Generative Adversarial Network (GAN), and address an important issue: search experience consists of a relatively large number of actions that are not on a solution path and a relatively small number of actions that actually are on a solution path. We introduce a new technique, based on an importance-ratio estimation method, for using samples from a non-target distribution to make GAN learning more data-efficient. We provide theoretical guarantees and empirical evaluation in three challenging continuous robot planning problems to illustrate the effectiveness of our algorithm.

Via

Access Paper or Ask Questions

Provably Safe Robot Navigation with Obstacle Uncertainty

May 31, 2017

Brian Axelrod, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Provably Safe Robot Navigation with Obstacle Uncertainty

Figure 2 for Provably Safe Robot Navigation with Obstacle Uncertainty

Figure 3 for Provably Safe Robot Navigation with Obstacle Uncertainty

Figure 4 for Provably Safe Robot Navigation with Obstacle Uncertainty

Abstract:As drones and autonomous cars become more widespread it is becoming increasingly important that robots can operate safely under realistic conditions. The noisy information fed into real systems means that robots must use estimates of the environment to plan navigation. Efficiently guaranteeing that the resulting motion plans are safe under these circumstances has proved difficult. We examine how to guarantee that a trajectory or policy is safe with only imperfect observations of the environment. We examine the implications of various mathematical formalisms of safety and arrive at a mathematical notion of safety of a long-term execution, even when conditioned on observational information. We present efficient algorithms that can prove that trajectories or policies are safe with much tighter bounds than in previous work. Notably, the complexity of the environment does not affect our methods ability to evaluate if a trajectory or policy is safe. We then use these safety checking methods to design a safe variant of the RRT planning algorithm.

* RSS 2017

Via

Access Paper or Ask Questions

STRIPS Planning in Infinite Domains

May 28, 2017

Caelan Reed Garrett, Tomás Lozano-Pérez, Leslie Pack Kaelbling

Figure 1 for STRIPS Planning in Infinite Domains

Figure 2 for STRIPS Planning in Infinite Domains

Figure 3 for STRIPS Planning in Infinite Domains

Figure 4 for STRIPS Planning in Infinite Domains

Abstract:Many robotic planning applications involve continuous actions with highly non-linear constraints, which cannot be modeled using modern planners that construct a propositional representation. We introduce STRIPStream: an extension of the STRIPS language which can model these domains by supporting the specification of blackbox generators to handle complex constraints. The outputs of these generators interact with actions through possibly infinite streams of objects and static predicates. We provide two algorithms which both reduce STRIPStream problems to a sequence of finite-domain planning problems. The representation and algorithms are entirely domain independent. We demonstrate our framework on simple illustrative domains, and then on a high-dimensional, continuous robotic task and motion planning domain.

* 11 pages

Via

Access Paper or Ask Questions

Focused Model-Learning and Planning for Non-Gaussian Continuous State-Action Systems

Oct 23, 2016

Zi Wang, Stefanie Jegelka, Leslie Pack Kaelbling, Tomás Lozano-Pérez

Figure 1 for Focused Model-Learning and Planning for Non-Gaussian Continuous State-Action Systems

Figure 2 for Focused Model-Learning and Planning for Non-Gaussian Continuous State-Action Systems

Figure 3 for Focused Model-Learning and Planning for Non-Gaussian Continuous State-Action Systems

Figure 4 for Focused Model-Learning and Planning for Non-Gaussian Continuous State-Action Systems

Abstract:We introduce a framework for model learning and planning in stochastic domains with continuous state and action spaces and non-Gaussian transition models. It is efficient because (1) local models are estimated only when the planner requires them; (2) the planner focuses on the most relevant states to the current planning problem; and (3) the planner focuses on the most informative and/or high-value actions. Our theoretical analysis shows the validity and asymptotic optimality of the proposed approach. Empirically, we demonstrate the effectiveness of our algorithm on a simulated multi-modal pushing problem.

Via

Access Paper or Ask Questions

Learning to Rank for Synthesizing Planning Heuristics

Aug 03, 2016

Caelan Reed Garrett, Leslie Pack Kaelbling, Tomas Lozano-Perez

Figure 1 for Learning to Rank for Synthesizing Planning Heuristics

Abstract:We investigate learning heuristics for domain-specific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner's performance than mean squared error. Thus, we instead frame learning a heuristic as a learning to rank problem which we solve using a RankSVM formulation. Additionally, we introduce new methods for computing features that capture temporal interactions in an approximate plan. Our experiments on recent International Planning Competition problems show that the RankSVM learned heuristics outperform both the original heuristics and heuristics learned through ordinary regression.

* International Joint Conference on Artificial Intelligence (IJCAI) 2016

Via

Access Paper or Ask Questions