Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gaurav S. Sukhatme

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Nov 11, 2021

I-Chun Arthur Liu, Shagun Uppal, Gaurav S. Sukhatme, Joseph J. Lim, Peter Englert, Youngwoon Lee

Figure 1 for Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Figure 2 for Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Figure 3 for Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Figure 4 for Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Abstract:Learning complex manipulation tasks in realistic, obstructed environments is a challenging problem due to hard exploration in the presence of obstacles and high-dimensional visual observations. Prior work tackles the exploration problem by integrating motion planning and reinforcement learning. However, the motion planner augmented policy requires access to state information, which is often not available in the real-world settings. To this end, we propose to distill a state-based motion planner augmented policy to a visual control policy via (1) visual behavioral cloning to remove the motion planner dependency along with its jittery motion, and (2) vision-based reinforcement learning with the guidance of the smoothed trajectories from the behavioral cloning agent. We evaluate our method on three manipulation tasks in obstructed environments and compare it against various reinforcement learning and imitation learning baselines. The results demonstrate that our framework is highly sample-efficient and outperforms the state-of-the-art algorithms. Moreover, coupled with domain randomization, our policy is capable of zero-shot transfer to unseen environment settings with distractors. Code and videos are available at https://clvrai.com/mopa-pd

* Published at the Conference on Robot Learning (CoRL) 2021

Via

Access Paper or Ask Questions

LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Nov 10, 2021

Yizhou Zhao, Kaixiang Lin, Zhiwei Jia, Qiaozi Gao, Govind Thattai, Jesse Thomason, Gaurav S. Sukhatme

Figure 1 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Figure 2 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Figure 3 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Figure 4 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Abstract:Learning-based methods for training embodied agents typically require a large number of high-quality scenes that contain realistic layouts and support meaningful interactions. However, current simulators for Embodied AI (EAI) challenges only provide simulated indoor scenes with a limited number of layouts. This paper presents Luminous, the first research framework that employs state-of-the-art indoor scene synthesis algorithms to generate large-scale simulated scenes for Embodied AI challenges. Further, we automatically and quantitatively evaluate the quality of generated indoor scenes via their ability to support complex household tasks. Luminous incorporates a novel scene generation algorithm (Constrained Stochastic Scene Generation (CSSG)), which achieves competitive performance with human-designed scenes. Within Luminous, the EAI task executor, task instruction generation module, and video rendering toolkit can collectively generate a massive multimodal dataset of new scenes for the training and evaluation of Embodied AI agents. Extensive experimental results demonstrate the effectiveness of the data generated by Luminous, enabling the comprehensive assessment of embodied agents on generalization and robustness.

* 2021 paper, Amazon

Via

Access Paper or Ask Questions

A Simple Approach to Continual Learning by Transferring Skill Parameters

Oct 19, 2021

K. R. Zentner, Ryan Julian, Ujjwal Puri, Yulun Zhang, Gaurav S. Sukhatme

Figure 1 for A Simple Approach to Continual Learning by Transferring Skill Parameters

Figure 2 for A Simple Approach to Continual Learning by Transferring Skill Parameters

Figure 3 for A Simple Approach to Continual Learning by Transferring Skill Parameters

Figure 4 for A Simple Approach to Continual Learning by Transferring Skill Parameters

Abstract:In order to be effective general purpose machines in real world environments, robots not only will need to adapt their existing manipulation skills to new circumstances, they will need to acquire entirely new skills on-the-fly. A great promise of continual learning is to endow robots with this ability, by using their accumulated knowledge and experience from prior skills. We take a fresh look at this problem, by considering a setting in which the robot is limited to storing that knowledge and experience only in the form of learned skill policies. We show that storing skill policies, careful pre-training, and appropriately choosing when to transfer those skill policies is sufficient to build a continual learner in the context of robotic manipulation. We analyze which conditions are needed to transfer skills in the challenging Meta-World simulation benchmark. Using this analysis, we introduce a pair-wise metric relating skills that allows us to predict the effectiveness of skill transfer between tasks, and use it to reduce the problem of continual learning to curriculum selection. Given an appropriate curriculum, we show how to continually acquire robotic manipulation skills without forgetting, and using far fewer samples than needed to train them from scratch.

* Submitted to ICRA 2022

Via

Access Paper or Ask Questions

Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

Sep 25, 2021

Amanda Prorok, Matthew Malencia, Luca Carlone, Gaurav S. Sukhatme, Brian M. Sadler, Vijay Kumar

Figure 1 for Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

Figure 2 for Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

Figure 3 for Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

Figure 4 for Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

Abstract:Robustness is key to engineering, automation, and science as a whole. However, the property of robustness is often underpinned by costly requirements such as over-provisioning, known uncertainty and predictive models, and known adversaries. These conditions are idealistic, and often not satisfiable. Resilience on the other hand is the capability to endure unexpected disruptions, to recover swiftly from negative events, and bounce back to normality. In this survey article, we analyze how resilience is achieved in networks of agents and multi-robot systems that are able to overcome adversity by leveraging system-wide complementarity, diversity, and redundancy - often involving a reconfiguration of robotic capabilities to provide some key ability that was not present in the system a priori. As society increasingly depends on connected automated systems to provide key infrastructure services (e.g., logistics, transport, and precision agriculture), providing the means to achieving resilient multi-robot systems is paramount. By enumerating the consequences of a system that is not resilient (fragile), we argue that resilience must become a central engineering design consideration. Towards this goal, the community needs to gain clarity on how it is defined, measured, and maintained. We address these questions across foundational robotics domains, spanning perception, control, planning, and learning. One of our key contributions is a formal taxonomy of approaches, which also helps us discuss the defining factors and stressors for a resilient system. Finally, this survey article gives insight as to how resilience may be achieved. Importantly, we highlight open problems that remain to be tackled in order to reap the benefits of resilient robotic systems.

Via

Access Paper or Ask Questions

Adaptive Sampling using POMDPs with Domain-Specific Considerations

Sep 23, 2021

Gautam Salhotra, Christopher E. Denniston, David A. Caron, Gaurav S. Sukhatme

Figure 1 for Adaptive Sampling using POMDPs with Domain-Specific Considerations

Figure 2 for Adaptive Sampling using POMDPs with Domain-Specific Considerations

Figure 3 for Adaptive Sampling using POMDPs with Domain-Specific Considerations

Figure 4 for Adaptive Sampling using POMDPs with Domain-Specific Considerations

Abstract:We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that rollouts are more valuable after some initial information is gained about the environment. Thus, a linear increase in the number of rollouts, i.e. allocating a fixed number at each step, is not appropriate for adaptive sampling tasks. The second alters which actions the agent chooses to explore when building the planning tree. We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore. The third improvement is in determining how many actions the agent should take from one plan. Typically, an agent will plan to take the first action from the planning tree and then call the planner again from the new state. Using statistical techniques, we show that it is possible to greatly reduce the number of rollouts by increasing the number of actions taken from a single planning tree without affecting the agent's final reward. Finally, we demonstrate experimentally, on simulated and real aquatic data from an underwater robot, that these improvements can be combined, leading to better adaptive sampling. The code for this work is available at https://github.com/uscresl/AdaptiveSamplingPOMCP

* Accepted at ICRA 2021 6 pages + 1 page Appendix. The first two authors had an equal contribution

Via

Access Paper or Ask Questions

Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation

Sep 18, 2021

Eric Heiden, Christopher E. Denniston, David Millard, Fabio Ramos, Gaurav S. Sukhatme

Figure 1 for Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation

Figure 2 for Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation

Figure 3 for Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation

Figure 4 for Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation

Abstract:To accurately reproduce measurements from the real world, simulators need to have an adequate model of the physical system and require the parameters of the model be identified. We address the latter problem of estimating parameters through a Bayesian inference approach that approximates a posterior distribution over simulation parameters given real sensor measurements. By extending the commonly used Gaussian likelihood model for trajectories via the multiple-shooting formulation, our chosen particle-based inference algorithm Stein Variational Gradient Descent is able to identify highly nonlinear, underactuated systems. We leverage GPU code generation and differentiable simulation to evaluate the likelihood and its gradient for many particles in parallel. Our algorithm infers non-parametric distributions over simulation parameters more accurately than comparable baselines and handles constraints over parameters efficiently through gradient-based optimization. We evaluate estimation performance on several physical experiments. On an underactuated mechanism where a 7-DOF robot arm excites an object with an unknown mass configuration, we demonstrate how our inference technique can identify symmetries between the parameters and provide highly accurate predictions. Project website: https://uscresl.github.io/prob-diff-sim

* Extended version. Submitted to ICRA 2022

Via

Access Paper or Ask Questions

Adaptive and Risk-Aware Target Tracking with Heterogeneous Robot Teams

May 09, 2021

Siddharth Mayya, Ragesh K. Ramachandran, Lifeng Zhou, Gaurav S. Sukhatme, Vijay Kumar

Figure 1 for Adaptive and Risk-Aware Target Tracking with Heterogeneous Robot Teams

Figure 2 for Adaptive and Risk-Aware Target Tracking with Heterogeneous Robot Teams

Figure 3 for Adaptive and Risk-Aware Target Tracking with Heterogeneous Robot Teams

Figure 4 for Adaptive and Risk-Aware Target Tracking with Heterogeneous Robot Teams

Abstract:We consider a scenario where a team of robots with heterogeneous sensors must track a set of hostile targets which induce sensory failures on the robots. In particular, the likelihood of failures depends on the proximity between the targets and the robots. We propose a control framework that implicitly addresses the competing objectives of performance maximization and sensor preservation (which impacts the future performance of the team). Our framework consists of a predictive component -- which accounts for the risk of being detected by the target, and a reactive component -- which maximizes the performance of the team regardless of the failures that have already occurred. Based on a measure of the abundance of sensors in the team, our framework can generate aggressive and risk-averse robot configurations to track the targets. Crucially, the heterogeneous sensing capabilities of the robots are explicitly considered in each step, allowing for a more expressive risk-performance trade-off. Simulated experiments with induced sensor failures demonstrate the efficacy of the proposed approach.

* Submitted to the International Conference on Intelligent Robots and Systems 2021. 9 pages

Via

Access Paper or Ask Questions

Adaptive Sampling: Algorithmic vs. Human Waypoint Selection

Apr 24, 2021

Stephanie Kemna, Sara Kangaslahti, Oliver Kroemer, Gaurav S. Sukhatme

Figure 1 for Adaptive Sampling: Algorithmic vs. Human Waypoint Selection

Figure 2 for Adaptive Sampling: Algorithmic vs. Human Waypoint Selection

Figure 3 for Adaptive Sampling: Algorithmic vs. Human Waypoint Selection

Figure 4 for Adaptive Sampling: Algorithmic vs. Human Waypoint Selection

Abstract:Robots are used for collecting samples from natural environments to create models of, for example, temperature or algae fields in the ocean. Adaptive informative sampling is a proven technique for this kind of spatial field modeling. This paper compares the performance of humans versus adaptive informative sampling algorithms for selecting informative waypoints. The humans and simulated robot are given the same information for selecting waypoints, and both are evaluated on the accuracy of the resulting model. We developed a graphical user interface for selecting waypoints and visualizing samples. Eleven participants iteratively picked waypoints for twelve scenarios. Our simulated robot used Gaussian Process regression with two entropy-based optimization criteria to iteratively choose waypoints. Our results show that the robot can on average perform better than the average human, and approximately as good as the best human, when the model assumptions correspond to the actual field. However, when the model assumptions do not correspond as well to the characteristics of the field, both human and robot performance are no better than random sampling.

* 12 pages, 7 figures, not yet published

Via

Access Paper or Ask Questions

NeuralSim: Augmenting Differentiable Simulators with Neural Networks

Nov 09, 2020

Eric Heiden, David Millard, Erwin Coumans, Yizhou Sheng, Gaurav S. Sukhatme

Figure 1 for NeuralSim: Augmenting Differentiable Simulators with Neural Networks

Figure 2 for NeuralSim: Augmenting Differentiable Simulators with Neural Networks

Figure 3 for NeuralSim: Augmenting Differentiable Simulators with Neural Networks

Figure 4 for NeuralSim: Augmenting Differentiable Simulators with Neural Networks

Abstract:Differentiable simulators provide an avenue for closing the sim-to-real gap by enabling the use of efficient, gradient-based optimization algorithms to find the simulation parameters that best fit the observed sensor readings. Nonetheless, these analytical models can only predict the dynamical behavior of systems for which they have been designed. In this work, we study the augmentation of a novel differentiable rigid-body physics engine via neural networks that is able to learn nonlinear relationships between dynamic quantities and can thus learn effects not accounted for in traditional simulators.Such augmentations require less data to train and generalize better compared to entirely data-driven models. Through extensive experiments, we demonstrate the ability of our hybrid simulator to learn complex dynamics involving frictional contacts from real data, as well as match known models of viscous friction, and present an approach for automatically discovering useful augmentations. We show that, besides benefiting dynamics modeling, inserting neural networks can accelerate model-based control architectures. We observe a ten-fold speed-up when replacing the QP solver inside a model-predictive gait controller for quadruped robots with a neural network, allowing us to significantly improve control delays as we demonstrate in real-hardware experiments. We publish code, additional results and videos from our experiments on our project webpage at https://sites.google.com/usc.edu/neuralsim.

* Submitted to ICRA 2021

Via

Access Paper or Ask Questions

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Oct 22, 2020

Jun Yamada, Youngwoon Lee, Gautam Salhotra, Karl Pertsch, Max Pflueger, Gaurav S. Sukhatme, Joseph J. Lim, Peter Englert

Figure 1 for Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Figure 2 for Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Figure 3 for Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Figure 4 for Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

Abstract:Deep reinforcement learning (RL) agents are able to learn contact-rich manipulation tasks by maximizing a reward signal, but require large amounts of experience, especially in environments with many obstacles that complicate exploration. In contrast, motion planners use explicit models of the agent and environment to plan collision-free paths to faraway goals, but suffer from inaccurate models in tasks that require contacts with the environment. To combine the benefits of both approaches, we propose motion planner augmented RL (MoPA-RL) which augments the action space of an RL agent with the long-horizon planning capabilities of motion planners. Based on the magnitude of the action, our approach smoothly transitions between directly executing the action and invoking a motion planner. We evaluate our approach on various simulated manipulation tasks and compare it to alternative action spaces in terms of learning efficiency and safety. The experiments demonstrate that MoPA-RL increases learning efficiency, leads to a faster exploration, and results in safer policies that avoid collisions with the environment. Videos and code are available at https://clvrai.com/mopa-rl .

* Published at the Conference on Robot Learning (CoRL) 2020

Via

Access Paper or Ask Questions