Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Russ Tedrake

PoCo: Policy Composition from and for Heterogeneous Robot Learning

Feb 04, 2024

Lirui Wang, Jialiang Zhao, Yilun Du, Edward H. Adelson, Russ Tedrake

Figure 1 for PoCo: Policy Composition from and for Heterogeneous Robot Learning

Figure 2 for PoCo: Policy Composition from and for Heterogeneous Robot Learning

Figure 3 for PoCo: Policy Composition from and for Heterogeneous Robot Learning

Figure 4 for PoCo: Policy Composition from and for Heterogeneous Robot Learning

Abstract:Training general robotic policies from heterogeneous data for different tasks is a significant challenge. Existing robotic datasets vary in different modalities such as color, depth, tactile, and proprioceptive information, and collected in different domains such as simulation, real robots, and human videos. Current methods usually collect and pool all data from one domain to train a single policy to handle such heterogeneity in tasks and domains, which is prohibitively expensive and difficult. In this work, we present a flexible approach, dubbed Policy Composition, to combine information across such diverse modalities and domains for learning scene-level and task-level generalized manipulation skills, by composing different data distributions represented with diffusion models. Our method can use task-level composition for multi-task manipulation and be composed with analytic cost functions to adapt policy behaviors at inference time. We train our method on simulation, human, and real robot data and evaluate in tool-use tasks. The composed policy achieves robust and dexterous performance under varying scenes and tasks and outperforms baselines from a single data source in both simulation and real-world experiments. See https://liruiw.github.io/policycomp for more details .

Via

Access Paper or Ask Questions

Certifying Bimanual RRT Motion Plans in a Second

Oct 25, 2023

Alexandre Amice, Peter Werner, Russ Tedrake

Abstract:We present an efficient method for certifying non-collision for piecewise-polynomial motion plans in algebraic reparametrizations of configuration space. Such motion plans include those generated by popular randomized methods including RRTs and PRMs, as well as those generated by many methods in trajectory optimization. Based on Sums-of-Squares optimization, our method provides exact, rigorous certificates of non-collision; it can never falsely claim that a motion plan containing collisions is collision-free. We demonstrate that our formulation is practical for real world deployment, certifying the safety of a twelve degree of freedom motion plan in just over a second. Moreover, the method is capable of discriminating the safety or lack thereof of two motion plans which differ by only millimeters.

* 7 pages, 5 figures, 1 table

Via

Access Paper or Ask Questions

Approximating Robot Configuration Spaces with few Convex Sets using Clique Covers of Visibility Graphs

Oct 04, 2023

Peter Werner, Alexandre Amice, Tobia Marcucci, Daniela Rus, Russ Tedrake

Figure 1 for Approximating Robot Configuration Spaces with few Convex Sets using Clique Covers of Visibility Graphs

Figure 2 for Approximating Robot Configuration Spaces with few Convex Sets using Clique Covers of Visibility Graphs

Figure 3 for Approximating Robot Configuration Spaces with few Convex Sets using Clique Covers of Visibility Graphs

Figure 4 for Approximating Robot Configuration Spaces with few Convex Sets using Clique Covers of Visibility Graphs

Abstract:Many computations in robotics can be dramatically accelerated if the robot configuration space is described as a collection of simple sets. For example, recently developed motion planners rely on a convex decomposition of the free space to design collision-free trajectories using fast convex optimization. In this work, we present an efficient method for approximately covering complex configuration spaces with a small number of polytopes. The approach constructs a visibility graph using sampling and generates a clique cover of this graph to find clusters of samples that have mutual line of sight. These clusters are then inflated into large, full-dimensional, polytopes. We evaluate our method on a variety of robotic systems and show that it consistently covers larger portions of free configuration space, with fewer polytopes, and in a fraction of the time compared to previous methods.

* 7 pages, 6 figures, under review for possible publication at ICRA 2024

Via

Access Paper or Ask Questions

Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Oct 02, 2023

Lirui Wang, Kaiqing Zhang, Allan Zhou, Max Simchowitz, Russ Tedrake

Figure 1 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Figure 2 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Figure 3 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Figure 4 for Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Abstract:Fleets of robots ingest massive amounts of streaming data generated by interacting with their environments, far more than those that can be stored or transmitted with ease. At the same time, we hope that teams of robots can co-acquire diverse skills through their experiences in varied settings. How can we enable such fleet-level learning without having to transmit or centralize fleet-scale data? In this paper, we investigate distributed learning of policies as a potential solution. To efficiently merge policies in the distributed setting, we propose fleet-merge, an instantiation of distributed learning that accounts for the symmetries that can arise in learning policies that are parameterized by recurrent neural networks. We show that fleet-merge consolidates the behavior of policies trained on 50 tasks in the Meta-World environment, with the merged policy achieving good performance on nearly all training tasks at test time. Moreover, we introduce a novel robotic tool-use benchmark, fleet-tools, for fleet policy learning in compositional and contact-rich robot manipulation tasks, which might be of broader interest, and validate the efficacy of fleet-merge on the benchmark.

* See the code https://github.com/liruiw/Fleet-Tools for more details

Via

Access Paper or Ask Questions

Constrained Bimanual Planning with Analytic Inverse Kinematics

Sep 15, 2023

Thomas Cohn, Seiji Shaw, Max Simchowitz, Russ Tedrake

Abstract:In order for a bimanual robot to manipulate an object that is held by both hands, it must construct motion plans such that the transformation between its end effectors remains fixed. This amounts to complicated nonlinear equality constraints in the configuration space, which are difficult for trajectory optimizers. In addition, the set of feasible configurations becomes a measure zero set, which presents a challenge to sampling-based motion planners. We leverage an analytic solution to the inverse kinematics problem to parametrize the configuration space, resulting in a lower-dimensional representation where the set of valid configurations has positive measure. We describe how to use this parametrization with existing algorithms for motion planning, including sampling-based approaches, trajectory optimizers, and techniques that plan through convex inner-approximations of collision-free space.

* Submitted to ICRA 2024. 8 pages, 5 figures. Interactive results available at https://cohnt.github.io/Bimanual-Web/index.html

Via

Access Paper or Ask Questions

Proximity and Visuotactile Point Cloud Fusion for Contact Patches in Extreme Deformation

Jul 07, 2023

Jessica Yin, Paarth Shah, Naveen Kuppuswamy, Andrew Beaulieu, Avinash Uttamchandani, Alejandro Castro, James Pikul, Russ Tedrake

Figure 1 for Proximity and Visuotactile Point Cloud Fusion for Contact Patches in Extreme Deformation

Figure 2 for Proximity and Visuotactile Point Cloud Fusion for Contact Patches in Extreme Deformation

Figure 3 for Proximity and Visuotactile Point Cloud Fusion for Contact Patches in Extreme Deformation

Figure 4 for Proximity and Visuotactile Point Cloud Fusion for Contact Patches in Extreme Deformation

Abstract:Equipping robots with the sense of touch is critical to emulating the capabilities of humans in real world manipulation tasks. Visuotactile sensors are a popular tactile sensing strategy due to data output compatible with computer vision algorithms and accurate, high resolution estimates of local object geometry. However, these sensors struggle to accommodate high deformations of the sensing surface during object interactions, hindering more informative contact with cm-scale objects frequently encountered in the real world. The soft interfaces of visuotactile sensors are often made of hyperelastic elastomers, which are difficult to simulate quickly and accurately when extremely deformed for tactile information. Additionally, many visuotactile sensors that rely on strict internal light conditions or pattern tracking will fail if the surface is highly deformed. In this work, we propose an algorithm that fuses proximity and visuotactile point clouds for contact patch segmentation that is entirely independent from membrane mechanics. This algorithm exploits the synchronous, high-res proximity and visuotactile modalities enabled by an extremely deformable, selectively transmissive soft membrane, which uses visible light for visuotactile sensing and infrared light for proximity depth. We present the hardware design, membrane fabrication, and evaluation of our contact patch algorithm in low (10%), medium (60%), and high (100%+) membrane strain states. We compare our algorithm against three baselines: proximity-only, tactile-only, and a membrane mechanics model. Our proposed algorithm outperforms all baselines with an average RMSE under 2.8mm of the contact patch geometry across all strain ranges. We demonstrate our contact patch algorithm in four applications: varied stiffness membranes, torque and shear-induced wrinkling, closed loop control for whole body manipulation, and pose estimation.

Via

Access Paper or Ask Questions

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

Jun 24, 2023

H. J. Terry Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake

Figure 1 for Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

Figure 2 for Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

Figure 3 for Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

Figure 4 for Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

Abstract:Offline optimization paradigms such as offline Reinforcement Learning (RL) or Imitation Learning (IL) allow policy search algorithms to make use of offline data, but require careful incorporation of uncertainty in order to circumvent the challenges of distribution shift. Gradient-based policy search methods are a promising direction due to their effectiveness in high dimensions; however, we require a more careful consideration of how these methods interplay with uncertainty estimation. We claim that in order for an uncertainty metric to be amenable for gradient-based optimization, it must be (i) stably convergent to data when uncertainty is minimized with gradients, and (ii) not prone to underestimation of true uncertainty. We investigate smoothed distance to data as a metric, and show that it not only stably converges to data, but also allows us to analyze model bias with Lipschitz constants. Moreover, we establish an equivalence between smoothed distance to data and data likelihood, which allows us to use score-matching techniques to learn gradients of distance to data. Importantly, we show that offline model-based policy search problems that maximize data likelihood do not require values of likelihood; but rather only the gradient of the log likelihood (the score function). Using this insight, we propose Score-Guided Planning (SGP), a planning algorithm for offline RL that utilizes score-matching to enable first-order planning in high-dimensional problems, where zeroth-order methods were unable to scale, and ensembles were unable to overcome local minima. Website: https://sites.google.com/view/score-guided-planning/home

* Glen Chou, Hongkai Dai, and Lujie Yang contributed equally to this work

Via

Access Paper or Ask Questions

Non-Euclidean Motion Planning with Graphs of Geodesically-Convex Sets

May 11, 2023

Thomas Cohn, Mark Petersen, Max Simchowitz, Russ Tedrake

Figure 1 for Non-Euclidean Motion Planning with Graphs of Geodesically-Convex Sets

Figure 2 for Non-Euclidean Motion Planning with Graphs of Geodesically-Convex Sets

Figure 3 for Non-Euclidean Motion Planning with Graphs of Geodesically-Convex Sets

Figure 4 for Non-Euclidean Motion Planning with Graphs of Geodesically-Convex Sets

Abstract:Computing optimal, collision-free trajectories for high-dimensional systems is a challenging problem. Sampling-based planners struggle with the dimensionality, whereas trajectory optimizers may get stuck in local minima due to inherent nonconvexities in the optimization landscape. The use of mixed-integer programming to encapsulate these nonconvexities and find globally optimal trajectories has recently shown great promise, thanks in part to tight convex relaxations and efficient approximation strategies that greatly reduce runtimes. These approaches were previously limited to Euclidean configuration spaces, precluding their use with mobile bases or continuous revolute joints. In this paper, we handle such scenarios by modeling configuration spaces as Riemannian manifolds, and we describe a reduction procedure for the zero-curvature case to a mixed-integer convex optimization problem. We demonstrate our results on various robot platforms, including producing efficient collision-free trajectories for a PR2 bimanual mobile manipulator.

* 14 pages, 11 figures. To appear at RSS 2023. Interactive results available at https://ggcs-anonymous-submission.github.io/

Via

Access Paper or Ask Questions

Fast Path Planning Through Large Collections of Safe Boxes

May 01, 2023

Tobia Marcucci, Parth Nobel, Russ Tedrake, Stephen Boyd

Figure 1 for Fast Path Planning Through Large Collections of Safe Boxes

Figure 2 for Fast Path Planning Through Large Collections of Safe Boxes

Figure 3 for Fast Path Planning Through Large Collections of Safe Boxes

Figure 4 for Fast Path Planning Through Large Collections of Safe Boxes

Abstract:We present a fast algorithm for the design of smooth paths (or trajectories) that are constrained to lie in a collection of axis-aligned boxes. We consider the case where the number of these safe boxes is large, and basic preprocessing of them (such as finding their intersections) can be done offline. At runtime we quickly generate a smooth path between given initial and terminal positions. Our algorithm designs trajectories that are guaranteed to be safe at all times, and it detects infeasibility whenever such a trajectory does not exist. Our algorithm is based on two subproblems that we can solve very efficiently: finding a shortest path in a weighted graph, and solving (multiple) convex optimal control problems. We demonstrate the proposed path planner on large-scale numerical examples, and we provide an efficient open-source software implementation, fastpathplanning.

Via

Access Paper or Ask Questions

Suboptimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares

Apr 25, 2023

Lujie Yang, Hongkai Dai, Alexandre Amice, Russ Tedrake

Figure 1 for Suboptimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares

Figure 2 for Suboptimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares

Figure 3 for Suboptimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares

Figure 4 for Suboptimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares

Abstract:Sums-of-squares (SOS) optimization is a promising tool to synthesize certifiable controllers, but most examples to date have been limited to relatively simple systems. Here we demonstrate that SOS can synthesize controllers with bounded suboptimal performance for various underactuated robotic systems by finding good approximations of the value function. We summarize a unified SOS framework to synthesize both under- and over- approximations of the value function for continuous-time, control-affine systems, use these approximations to generate suboptimal controllers, and perform regional analysis on the closed-loop system driven by these controllers. We then extend the formulation to handle hybrid systems with contacts. We demonstrate that our method can generate tight under- and over- approximations of the value function with low-degree polynomials, which are used to provide stabilizing controllers for continuous-time systems including the inverted pendulum, the cart-pole, and the 3D quadrotor as well as a hybrid system, the planar pusher. To the best of our knowledge, this is the first time that a SOS-based time-invariant controller can swing up and stabilize a cart-pole, and push the planar slider to the desired pose.

Via

Access Paper or Ask Questions