Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dieter Fox

University of Washington

Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

Dec 07, 2020

Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard(+5 more)

Figure 1 for Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

Abstract:This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference. Twelve leaders of the field took competing debate positions on the definition, viability, and importance of transferring skills from simulation to the real world in the context of robotics problems. The debaters also joined a large panel discussion, answering audience questions and outlining the future of Sim2Real in robotics. Furthermore, we invited extended abstracts to this workshop which are summarized in this report. Based on the workshop, this report concludes with directions for practitioners exploiting this technology and for researchers further exploring open problems in this area.

* Summary of the "2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics" held in conjunction with "Robotics: Science and System 2020". Website: https://sim2real.github.io/

Via

Access Paper or Ask Questions

Object Rearrangement Using Learned Implicit Collision Functions

Nov 21, 2020

Michael Danielczuk, Arsalan Mousavian, Clemens Eppner, Dieter Fox

Figure 1 for Object Rearrangement Using Learned Implicit Collision Functions

Figure 2 for Object Rearrangement Using Learned Implicit Collision Functions

Figure 3 for Object Rearrangement Using Learned Implicit Collision Functions

Figure 4 for Object Rearrangement Using Learned Implicit Collision Functions

Abstract:Robotic object rearrangement combines the skills of picking and placing objects. When object models are unavailable, typical collision-checking models may be unable to predict collisions in partial point clouds with occlusions, making generation of collision-free grasping or placement trajectories challenging. We propose a learned collision model that accepts scene and query object point clouds and predicts collisions for 6DOF object poses within the scene. We train the model on a synthetic set of 1 million scene/object point cloud pairs and 2 billion collision queries. We leverage the learned collision model as part of a model predictive path integral (MPPI) policy in a tabletop rearrangement task and show that the policy can plan collision-free grasps and placements for objects unseen in training in both simulated and physical cluttered scenes with a Franka Panda robot. The learned model outperforms both traditional pipelines and learned ablations by 9.8% in accuracy on a dataset of simulated collision queries and is 75x faster than the best-performing baseline. Videos and supplementary material are available at https://sites.google.com/nvidia.com/scenecollisionnet.

* Michael Danielczuk and Arsalan Mousavian contributed equally

Via

Access Paper or Ask Questions

ACRONYM: A Large-Scale Grasp Dataset Based on Simulation

Nov 18, 2020

Clemens Eppner, Arsalan Mousavian, Dieter Fox

Figure 1 for ACRONYM: A Large-Scale Grasp Dataset Based on Simulation

Figure 2 for ACRONYM: A Large-Scale Grasp Dataset Based on Simulation

Figure 3 for ACRONYM: A Large-Scale Grasp Dataset Based on Simulation

Figure 4 for ACRONYM: A Large-Scale Grasp Dataset Based on Simulation

Abstract:We introduce ACRONYM, a dataset for robot grasp planning based on physics simulation. The dataset contains 17.7M parallel-jaw grasps, spanning 8872 objects from 262 different categories, each labeled with the grasp result obtained from a physics simulator. We show the value of this large and diverse dataset by using it to train two state-of-the-art learning-based grasp planning algorithms. Grasp performance improves significantly when compared to the original smaller dataset. Data and tools can be accessed at https://sites.google.com/nvidia.com/graspdataset.

Via

Access Paper or Ask Questions

A User's Guide to Calibrating Robotics Simulators

Nov 17, 2020

Bhairav Mehta, Ankur Handa, Dieter Fox, Fabio Ramos

Figure 1 for A User's Guide to Calibrating Robotics Simulators

Figure 2 for A User's Guide to Calibrating Robotics Simulators

Figure 3 for A User's Guide to Calibrating Robotics Simulators

Figure 4 for A User's Guide to Calibrating Robotics Simulators

Abstract:Simulators are a critical component of modern robotics research. Strategies for both perception and decision making can be studied in simulation first before deployed to real world systems, saving on time and costs. Despite significant progress on the development of sim-to-real algorithms, the analysis of different methods is still conducted in an ad-hoc manner, without a consistent set of tests and metrics for comparison. This paper fills this gap and proposes a set of benchmarks and a framework for the study of various algorithms aimed to transfer models and policies learnt in simulation to the real world. We conduct experiments on a wide range of well known simulated environments to characterize and offer insights into the performance of different algorithms. Our analysis can be useful for practitioners working in this area and can help make informed choices about the behavior and main properties of sim-to-real algorithms. We open-source the benchmark, training data, and trained models, which can be found at https://github.com/NVlabs/sim-parameter-estimation.

* Accepted at Conference on Robot Learning 2020

Via

Access Paper or Ask Questions

Reactive Human-to-Robot Handovers of Arbitrary Objects

Nov 17, 2020

Wei Yang, Chris Paxton, Arsalan Mousavian, Yu-Wei Chao, Maya Cakmak, Dieter Fox

Figure 1 for Reactive Human-to-Robot Handovers of Arbitrary Objects

Figure 2 for Reactive Human-to-Robot Handovers of Arbitrary Objects

Figure 3 for Reactive Human-to-Robot Handovers of Arbitrary Objects

Figure 4 for Reactive Human-to-Robot Handovers of Arbitrary Objects

Abstract:Human-robot object handovers have been an actively studied area of robotics over the past decade; however, very few techniques and systems have addressed the challenge of handing over diverse objects with arbitrary appearance, size, shape, and rigidity. In this paper, we present a vision-based system that enables reactive human-to-robot handovers of unknown objects. Our approach combines closed-loop motion planning with real-time, temporally-consistent grasp generation to ensure reactivity and motion smoothness. Our system is robust to different object positions and orientations, and can grasp both rigid and non-rigid objects. We demonstrate the generalizability, usability, and robustness of our approach on a novel benchmark set of 26 diverse household objects, a user study with naive users (N=6) handing over a subset of 15 objects, and a systematic evaluation examining different ways of handing objects. More results and videos can be found at https://sites.google.com/nvidia.com/handovers-of-arbitrary-objects.

Via

Access Paper or Ask Questions

Sim-to-Real Task Planning and Execution from Perception via Reactivity and Recovery

Nov 17, 2020

Shohin Mukherjee, Chris Paxton, Arsalan Mousavian, Adam Fishman, Maxim Likhachev, Dieter Fox

Figure 1 for Sim-to-Real Task Planning and Execution from Perception via Reactivity and Recovery

Figure 2 for Sim-to-Real Task Planning and Execution from Perception via Reactivity and Recovery

Figure 3 for Sim-to-Real Task Planning and Execution from Perception via Reactivity and Recovery

Figure 4 for Sim-to-Real Task Planning and Execution from Perception via Reactivity and Recovery

Abstract:Zero-shot execution of unseen robotic tasks is an important problem in robotics. One potential approach is through task planning: combining known skills based on their preconditions and effects to achieve a user-specified goal. In this work, we propose such a task planning approach to build a reactive system for multi-step manipulation tasks that can be trained on simulation data and applied in the real-world. We explore a block-stacking task because it has a clear structure, where multiple skills must be chained together: pick up a block, place it on top of another block, etc. We learn these skills, along with a set of predicate preconditions and termination conditions, entirely in simulation. All components are learned as PointNet++ models, parameterized by the masks of relevant objects. The predicates allow us to create high-level plans combining different skills. They also serve as precondition functions for the skills, which enables the system to recognize failures and accomplish long-horizon tasks from perceptual input, which is critical for real-world execution. We evaluate our proposed approach in both simulation and in the real-world, showing an increase in success rate from 91.6% to 98% in simulation and from 10% to 80% success rate in the real-world as compared with naive baselines.

* Under review

Via

Access Paper or Ask Questions

Geometric Fabrics for the Acceleration-based Design of Robotic Motion

Nov 11, 2020

Mandy Xie, Karl Van Wyk, Anqi Li, Muhammad Asif Rana, Dieter Fox, Byron Boots, Nathan Ratliff

Figure 1 for Geometric Fabrics for the Acceleration-based Design of Robotic Motion

Figure 2 for Geometric Fabrics for the Acceleration-based Design of Robotic Motion

Abstract:This paper describes the pragmatic design and construction of geometric fabrics for shaping a robot's task-independent nominal behavior, capturing behavioral components such as obstacle avoidance, joint limit avoidance, redundancy resolution, global navigation heuristics, etc. Geometric fabrics constitute the most concrete incarnation of a new mathematical formulation for reactive behavior called optimization fabrics. Fabrics generalize recent work on Riemannian Motion Policies (RMPs); they add provable stability guarantees and improve design consistency while promoting the intuitive acceleration-based principles of modular design that make RMPs successful. We describe a suite of mathematical modeling tools that practitioners can employ in practice and demonstrate both how to mitigate system complexity by constructing behaviors layer-wise and how to employ these tools to design robust, strongly-generalizing, policies that solve practical problems one would expect to find in industry applications. Our system exhibits intelligent global navigation behaviors expressed entirely as fabrics with zero planning or state machine governance.

Via

Access Paper or Ask Questions

Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

Nov 08, 2020

Junha Roh, Christoforos Mavrogiannis, Rishabh Madan, Dieter Fox, Siddhartha S. Srinivasa

Figure 1 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

Figure 2 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

Figure 3 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

Figure 4 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

Abstract:We focus on decentralized navigation among multiple non-communicating rational agents at \emph{uncontrolled} intersections, i.e., street intersections without traffic signs or signals. Avoiding collisions in such domains relies on the ability of agents to predict each others' intentions reliably, and react quickly. Multiagent trajectory prediction is NP-hard whereas the sample complexity of existing data-driven approaches limits their applicability. Our key insight is that the geometric structure of the intersection and the incentive of agents to move efficiently and avoid collisions (rationality) reduces the space of likely behaviors, effectively relaxing the problem of trajectory prediction. In this paper, we collapse the space of multiagent trajectories at an intersection into a set of modes representing different classes of multiagent behavior, formalized using a notion of topological invariance. Based on this formalism, we design Multiple Topologies Prediction (MTP), a data-driven trajectory-prediction mechanism that reconstructs trajectory representations of high-likelihood modes in multiagent intersection scenes. We show that MTP outperforms a state-of-the-art multimodal trajectory prediction baseline (MFP) in terms of prediction accuracy by 78.24% on a challenging simulated dataset. Finally, we show that MTP enables our optimization-based planner, MTPnav, to achieve collision-free and time-efficient navigation across a variety of challenging intersection scenarios on the CARLA simulator.

* Preprint of a paper with the same title, accepted to the Conference on Robot Learning 2020

Via

Access Paper or Ask Questions

STReSSD: Sim-To-Real from Sound for Stochastic Dynamics

Nov 05, 2020

Carolyn Matl, Yashraj Narang, Dieter Fox, Ruzena Bajcsy, Fabio Ramos

Figure 1 for STReSSD: Sim-To-Real from Sound for Stochastic Dynamics

Figure 2 for STReSSD: Sim-To-Real from Sound for Stochastic Dynamics

Figure 3 for STReSSD: Sim-To-Real from Sound for Stochastic Dynamics

Figure 4 for STReSSD: Sim-To-Real from Sound for Stochastic Dynamics

Abstract:Sound is an information-rich medium that captures dynamic physical events. This work presents STReSSD, a framework that uses sound to bridge the simulation-to-reality gap for stochastic dynamics, demonstrated for the canonical case of a bouncing ball. A physically-motivated noise model is presented to capture stochastic behavior of the balls upon collision with the environment. A likelihood-free Bayesian inference framework is used to infer the parameters of the noise model, as well as a material property called the coefficient of restitution, from audio observations. The same inference framework and the calibrated stochastic simulator are then used to learn a probabilistic model of ball dynamics. The predictive capabilities of the dynamics model are tested in two robotic experiments. First, open-loop predictions anticipate probabilistic success of bouncing a ball into a cup. The second experiment integrates audio perception with a robotic arm to track and deflect a bouncing ball in real-time. We envision that this work is a step towards integrating audio-based inference for dynamic robotic tasks. Experimental results can be viewed at https://youtu.be/b7pOrgZrArk.

* 25 pages, 35 figures, The Conference on Robot Learning (CoRL) 2020

Via

Access Paper or Ask Questions

Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds

Oct 02, 2020

Lirui Wang, Yu Xiang, Dieter Fox

Figure 1 for Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds

Figure 2 for Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds

Figure 3 for Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds

Figure 4 for Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds

Abstract:6D robotic grasping beyond top-down bin-picking scenarios is a challenging task. Previous solutions based on 6D grasp synthesis with robot motion planning usually operate in an open-loop setting without considering the dynamics and contacts of objects, which makes them sensitive to grasp synthesis errors. In this work, we propose a novel method for learning closed-loop control policies for 6D robotic grasping using point clouds from an egocentric camera. We combine imitation learning and reinforcement learning in order to grasp unseen objects and handle the continuous 6D action space, where expert demonstrations are obtained from a joint motion and grasp planner. We introduce a goal-auxiliary actor-critic algorithm, which uses grasping goal prediction as an auxiliary task to facilitate policy learning. The supervision on grasping goals can be obtained from the expert planner for known objects or from hindsight goals for unknown objects. Overall, our learned closed-loop policy achieves over 90% success rates on grasping various ShapeNet objects and YCB objects in the simulation. Our video can be found at https://www.youtube.com/watch?v=rKsCRXLykiY&t=1s .

Via

Access Paper or Ask Questions