Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Held

Neural Grasp Distance Fields for Robot Manipulation

Nov 04, 2022

Thomas Weng, David Held, Franziska Meier, Mustafa Mukadam

Abstract:We formulate grasp learning as a neural field and present Neural Grasp Distance Fields (NGDF). Here, the input is a 6D pose of a robot end effector and output is a distance to a continuous manifold of valid grasps for an object. In contrast to current approaches that predict a set of discrete candidate grasps, the distance-based NGDF representation is easily interpreted as a cost, and minimizing this cost produces a successful grasp pose. This grasp distance cost can be incorporated directly into a trajectory optimizer for joint optimization with other costs such as trajectory smoothness and collision avoidance. During optimization, as the various costs are balanced and minimized, the grasp target is allowed to smoothly vary, as the learned grasp field is continuous. In simulation benchmarks with a Franka arm, we find that joint grasping and planning with NGDF outperforms baselines by 63% execution success while generalizing to unseen query poses and unseen object shapes. Project page: https://sites.google.com/view/neural-grasp-distance-fields.

Via

Access Paper or Ask Questions

Learning to Grasp the Ungraspable with Emergent Extrinsic Dexterity

Nov 02, 2022

Wenxuan Zhou, David Held

Abstract:A simple gripper can solve more complex manipulation tasks if it can utilize the external environment such as pushing the object against the table or a vertical wall, known as "Extrinsic Dexterity." Previous work in extrinsic dexterity usually has careful assumptions about contacts which impose restrictions on robot design, robot motions, and the variations of the physical parameters. In this work, we develop a system based on reinforcement learning (RL) to address these limitations. We study the task of "Occluded Grasping" which aims to grasp the object in configurations that are initially occluded; the robot needs to move the object into a configuration from which these grasps can be achieved. We present a system with model-free RL that successfully achieves this task using a simple gripper with extrinsic dexterity. The policy learns emergent behaviors of pushing the object against the wall to rotate and then grasp it without additional reward terms on extrinsic dexterity. We discuss important components of the system including the design of the RL problem, multi-grasp training and selection, and policy generalization with automatic curriculum. Most importantly, the policy trained in simulation is zero-shot transferred to a physical robot. It demonstrates dynamic and contact-rich motions with a simple gripper that generalizes across objects with various size, density, surface friction, and shape with a 78% success rate. Videos can be found at https://sites.google.com/view/grasp-ungraspable/.

* 6th Conference on Robot Learning (CoRL 2022)

Via

Access Paper or Ask Questions

AutoBag: Learning to Open Plastic Bags and Insert Objects

Oct 31, 2022

Lawrence Yunliang Chen, Baiyu Shi, Daniel Seita, Richard Cheng, Thomas Kollar, David Held, Ken Goldberg

Figure 1 for AutoBag: Learning to Open Plastic Bags and Insert Objects

Figure 2 for AutoBag: Learning to Open Plastic Bags and Insert Objects

Figure 3 for AutoBag: Learning to Open Plastic Bags and Insert Objects

Figure 4 for AutoBag: Learning to Open Plastic Bags and Insert Objects

Abstract:Thin plastic bags are ubiquitous in retail stores, healthcare, food handling, recycling, homes, and school lunchrooms. They are challenging both for perception (due to specularities and occlusions) and for manipulation (due to the dynamics of their 3D deformable structure). We formulate the task of manipulating common plastic shopping bags with two handles from an unstructured initial state to a state where solid objects can be inserted into the bag for transport. We propose a self-supervised learning framework where a dual-arm robot learns to recognize the handles and rim of plastic bags using UV-fluorescent markings; at execution time, the robot does not use UV markings or UV light. We propose Autonomous Bagging (AutoBag), where the robot uses the learned perception model to open plastic bags through iterative manipulation. We present novel metrics to evaluate the quality of a bag state and new motion primitives for reorienting and opening bags from visual observations. In physical experiments, a YuMi robot using AutoBag is able to open bags and achieve a success rate of 16/30 for inserting at least one item across a variety of initial bag configurations. Supplementary material is available at https://sites.google.com/view/autobag .

Via

Access Paper or Ask Questions

Planning with Spatial-Temporal Abstraction from Point Clouds for Deformable Object Manipulation

Oct 27, 2022

Xingyu Lin, Carl Qi, Yunchu Zhang, Zhiao Huang, Katerina Fragkiadaki, Yunzhu Li, Chuang Gan, David Held

Abstract:Effective planning of long-horizon deformable object manipulation requires suitable abstractions at both the spatial and temporal levels. Previous methods typically either focus on short-horizon tasks or make strong assumptions that full-state information is available, which prevents their use on deformable objects. In this paper, we propose PlAnning with Spatial-Temporal Abstraction (PASTA), which incorporates both spatial abstraction (reasoning about objects and their relations to each other) and temporal abstraction (reasoning over skills instead of low-level actions). Our framework maps high-dimension 3D observations such as point clouds into a set of latent vectors and plans over skill sequences on top of the latent set representation. We show that our method can effectively perform challenging sequential deformable object manipulation tasks in the real world, which require combining multiple tool-use skills such as cutting with a knife, pushing with a pusher, and spreading the dough with a roller.

Via

Access Paper or Ask Questions

Dfferentiable Raycasting for Self-supervised Occupancy Forecasting

Oct 04, 2022

Tarasha Khurana, Peiyun Hu, Achal Dave, Jason ZIglar, David Held, Deva Ramanan

Figure 1 for Dfferentiable Raycasting for Self-supervised Occupancy Forecasting

Figure 2 for Dfferentiable Raycasting for Self-supervised Occupancy Forecasting

Figure 3 for Dfferentiable Raycasting for Self-supervised Occupancy Forecasting

Figure 4 for Dfferentiable Raycasting for Self-supervised Occupancy Forecasting

Abstract:Motion planning for safe autonomous driving requires learning how the environment around an ego-vehicle evolves with time. Ego-centric perception of driveable regions in a scene not only changes with the motion of actors in the environment, but also with the movement of the ego-vehicle itself. Self-supervised representations proposed for large-scale planning, such as ego-centric freespace, confound these two motions, making the representation difficult to use for downstream motion planners. In this paper, we use geometric occupancy as a natural alternative to view-dependent representations such as freespace. Occupancy maps naturally disentangle the motion of the environment from the motion of the ego-vehicle. However, one cannot directly observe the full 3D occupancy of a scene (due to occlusion), making it difficult to use as a signal for learning. Our key insight is to use differentiable raycasting to "render" future occupancy predictions into future LiDAR sweep predictions, which can be compared with ground-truth sweeps for self-supervised learning. The use of differentiable raycasting allows occupancy to emerge as an internal representation within the forecasting network. In the absence of groundtruth occupancy, we quantitatively evaluate the forecasting of raycasted LiDAR sweeps and show improvements of upto 15 F1 points. For downstream motion planners, where emergent occupancy can be directly used to guide non-driveable regions, this representation relatively reduces the number of collisions with objects by up to 17% as compared to freespace-centric motion planners.

* ECCV 2022. Code available at https://github.com/tarashakhurana/emergent-occ-forecasting

Via

Access Paper or Ask Questions

EDO-Net: Learning Elastic Properties of Deformable Objects from Graph Dynamics

Sep 19, 2022

Alberta Longhini, Marco Moletta, Alfredo Reichlin, Michael C. Welle, David Held, Zackory Erickson, Danica Kragic

Figure 1 for EDO-Net: Learning Elastic Properties of Deformable Objects from Graph Dynamics

Figure 2 for EDO-Net: Learning Elastic Properties of Deformable Objects from Graph Dynamics

Figure 3 for EDO-Net: Learning Elastic Properties of Deformable Objects from Graph Dynamics

Figure 4 for EDO-Net: Learning Elastic Properties of Deformable Objects from Graph Dynamics

Abstract:We study the problem of learning graph dynamics of deformable objects which generalize to unknown physical properties. In particular, we leverage a latent representation of elastic physical properties of cloth-like deformable objects which we explore through a pulling interaction. We propose EDO-Net (Elastic Deformable Object - Net), a model trained in a self-supervised fashion on a large variety of samples with different elastic properties. EDO-Net jointly learns an adaptation module, responsible for extracting a latent representation of the physical properties of the object, and a forward-dynamics module, which leverages the latent representation to predict future states of cloth-like objects, represented as graphs. We evaluate EDO-Net both in simulation and real world, assessing its capabilities of: 1) generalizing to unknown physical properties of cloth-like deformable objects, 2) transferring the learned representation to new downstream tasks.

Via

Access Paper or Ask Questions

Elastic Context: Encoding Elasticity for Data-driven Models of Textiles

Sep 19, 2022

Alberta Longhini, Marco Moletta, Alfredo Reichlin, Michael C. Welle, Alexander Kravberg, Yufei Wang, David Held, Zackory Erickson, Danica Kragic

Figure 1 for Elastic Context: Encoding Elasticity for Data-driven Models of Textiles

Figure 2 for Elastic Context: Encoding Elasticity for Data-driven Models of Textiles

Figure 3 for Elastic Context: Encoding Elasticity for Data-driven Models of Textiles

Figure 4 for Elastic Context: Encoding Elasticity for Data-driven Models of Textiles

Abstract:Physical interaction with textiles, such as assistive dressing, relies on advanced dextreous capabilities. The underlying complexity in textile behavior when being pulled and stretched, is due to both the yarn material properties and the textile construction technique. Today, there are no commonly adopted and annotated datasets on which the various interaction or property identification methods are assessed. One important property that affects the interaction is material elasticity that results from both the yarn material and construction technique: these two are intertwined and, if not known a-priori, almost impossible to identify through sensing commonly available on robotic platforms. We introduce Elastic Context (EC), a concept that integrates various properties that affect elastic behavior, to enable a more effective physical interaction with textiles. The definition of EC relies on stress/strain curves commonly used in textile engineering, which we reformulated for robotic applications. We employ EC using Graph Neural Network (GNN) to learn generalized elastic behaviors of textiles. Furthermore, we explore the effect the dimension of the EC has on accurate force modeling of non-linear real-world elastic behaviors, highlighting the challenges of current robotic setups to sense textile properties.

Via

Access Paper or Ask Questions

Visual Haptic Reasoning: Estimating Contact Forces by Observing Deformable Object Interactions

Aug 11, 2022

Yufei Wang, David Held, Zackory Erickson

Figure 1 for Visual Haptic Reasoning: Estimating Contact Forces by Observing Deformable Object Interactions

Figure 2 for Visual Haptic Reasoning: Estimating Contact Forces by Observing Deformable Object Interactions

Figure 3 for Visual Haptic Reasoning: Estimating Contact Forces by Observing Deformable Object Interactions

Figure 4 for Visual Haptic Reasoning: Estimating Contact Forces by Observing Deformable Object Interactions

Abstract:Robotic manipulation of highly deformable cloth presents a promising opportunity to assist people with several daily tasks, such as washing dishes; folding laundry; or dressing, bathing, and hygiene assistance for individuals with severe motor impairments. In this work, we introduce a formulation that enables a collaborative robot to perform visual haptic reasoning with cloth -- the act of inferring the location and magnitude of applied forces during physical interaction. We present two distinct model representations, trained in physics simulation, that enable haptic reasoning using only visual and robot kinematic observations. We conducted quantitative evaluations of these models in simulation for robot-assisted dressing, bathing, and dish washing tasks, and demonstrate that the trained models can generalize across different tasks with varying interactions, human body sizes, and object shapes. We also present results with a real-world mobile manipulator, which used our simulation-trained models to estimate applied contact forces while performing physically assistive tasks with cloth. Videos can be found at our project webpage.

* RA-L with presentation at IROS 2022. Project website: https://sites.google.com/view/visualhapticreasoning/home

Via

Access Paper or Ask Questions

Learning to Singulate Layers of Cloth using Tactile Feedback

Jul 22, 2022

Sashank Tirumala, Thomas Weng, Daniel Seita, Oliver Kroemer, Zeynep Temel, David Held

Figure 1 for Learning to Singulate Layers of Cloth using Tactile Feedback

Figure 2 for Learning to Singulate Layers of Cloth using Tactile Feedback

Figure 3 for Learning to Singulate Layers of Cloth using Tactile Feedback

Figure 4 for Learning to Singulate Layers of Cloth using Tactile Feedback

Abstract:Robotic manipulation of cloth has applications ranging from fabrics manufacturing to handling blankets and laundry. Cloth manipulation is challenging for robots largely due to their high degrees of freedom, complex dynamics, and severe self-occlusions when in folded or crumpled configurations. Prior work on robotic manipulation of cloth relies primarily on vision sensors alone, which may pose challenges for fine-grained manipulation tasks such as grasping a desired number of cloth layers from a stack of cloth. In this paper, we propose to use tactile sensing for cloth manipulation; we attach a tactile sensor (ReSkin) to one of the two fingertips of a Franka robot and train a classifier to determine whether the robot is grasping a specific number of cloth layers. During test-time experiments, the robot uses this classifier as part of its policy to grasp one or two cloth layers using tactile feedback to determine suitable grasping points. Experimental results over 180 physical trials suggest that the proposed method outperforms baselines that do not use tactile feedback and has better generalization to unseen cloth compared to methods that use image classifiers. Code, data, and videos are available at https://sites.google.com/view/reskin-cloth.

* IROS 2022. See https://sites.google.com/view/reskin-cloth for supplementary material

Via

Access Paper or Ask Questions

Learning Closed-loop Dough Manipulation Using a Differentiable Reset Module

Jul 11, 2022

Carl Qi, Xingyu Lin, David Held

Figure 1 for Learning Closed-loop Dough Manipulation Using a Differentiable Reset Module

Figure 2 for Learning Closed-loop Dough Manipulation Using a Differentiable Reset Module

Figure 3 for Learning Closed-loop Dough Manipulation Using a Differentiable Reset Module

Figure 4 for Learning Closed-loop Dough Manipulation Using a Differentiable Reset Module

Abstract:Deformable object manipulation has many applications such as cooking and laundry folding in our daily lives. Manipulating elastoplastic objects such as dough is particularly challenging because dough lacks a compact state representation and requires contact-rich interactions. We consider the task of flattening a piece of dough into a specific shape from RGB-D images. While the task is seemingly intuitive for humans, there exist local optima for common approaches such as naive trajectory optimization. We propose a novel trajectory optimizer that optimizes through a differentiable "reset" module, transforming a single-stage, fixed-initialization trajectory into a multistage, multi-initialization trajectory where all stages are optimized jointly. We then train a closed-loop policy on the demonstrations generated by our trajectory optimizer. Our policy receives partial point clouds as input, allowing ease of transfer from simulation to the real world. We show that our policy can perform real-world dough manipulation, flattening a ball of dough into a target shape.

Via

Access Paper or Ask Questions