Alert button
Picture for Rishabh Madan

Rishabh Madan

Alert button

Real Robot Challenge 2021: Cartesian Position Control with Triangle Grasp and Trajectory Interpolation

Mar 19, 2022
Rishabh Madan, Harshit Sikchi, Ethan K. Gordon, Tapomayukh Bhattacharjee

Figure 1 for Real Robot Challenge 2021: Cartesian Position Control with Triangle Grasp and Trajectory Interpolation
Figure 2 for Real Robot Challenge 2021: Cartesian Position Control with Triangle Grasp and Trajectory Interpolation
Figure 3 for Real Robot Challenge 2021: Cartesian Position Control with Triangle Grasp and Trajectory Interpolation
Figure 4 for Real Robot Challenge 2021: Cartesian Position Control with Triangle Grasp and Trajectory Interpolation

We present our runner-up approach for the Real Robot Challenge 2021. We build upon our previous approach used in Real Robot Challenge 2020. To solve the task of sequential goal-reaching we focus on two aspects to achieving near-optimal trajectory: Grasp stability and Controller performance. In the RRC 2021 simulated challenge, our method relied on a hand-designed Pinch grasp combined with Trajectory Interpolation for better stability during the motion for fast goal-reaching. In Stage 1, we observe reverting to a Triangular grasp to provide a more stable grasp when combined with Trajectory Interpolation, possibly due to the sim2real gap. The video demonstration for our approach is available at https://youtu.be/dlOueoaRWrM. The code is publicly available at https://github.com/madan96/benchmark-rrc.

Viaarxiv icon

A Robot Cluster for Reproducible Research in Dexterous Manipulation

Sep 22, 2021
Stefan Bauer, Felix Widmaier, Manuel Wüthrich, Niklas Funk, Julen Urain De Jesus, Jan Peters, Joe Watson, Claire Chen, Krishnan Srinivasan, Junwu Zhang, Jeffrey Zhang, Matthew R. Walter, Rishabh Madan, Charles Schaff, Takahiro Maeda, Takuma Yoneda, Denis Yarats, Arthur Allshire, Ethan K. Gordon, Tapomayukh Bhattacharjee, Siddhartha S. Srinivasa, Animesh Garg, Annika Buchholz, Sebastian Stark, Thomas Steinbrenner, Joel Akpo, Shruti Joshi, Vaibhav Agrawal, Bernhard Schölkopf

Figure 1 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Figure 2 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Figure 3 for A Robot Cluster for Reproducible Research in Dexterous Manipulation
Figure 4 for A Robot Cluster for Reproducible Research in Dexterous Manipulation

Dexterous manipulation remains an open problem in robotics. To coordinate efforts of the research community towards tackling this problem, we propose a shared benchmark. We designed and built robotic platforms that are hosted at the MPI-IS and can be accessed remotely. Each platform consists of three robotic fingers that are capable of dexterous object manipulation. Users are able to control the platforms remotely by submitting code that is executed automatically, akin to a computational cluster. Using this setup, i) we host robotics competitions, where teams from anywhere in the world access our platforms to tackle challenging tasks, ii) we publish the datasets collected during these competitions (consisting of hundreds of robot hours), and iii) we give researchers access to these platforms for their own projects.

Viaarxiv icon

Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation

May 05, 2021
Niklas Funk, Charles Schaff, Rishabh Madan, Takuma Yoneda, Julen Urain De Jesus, Joe Watson, Ethan K. Gordon, Felix Widmaier, Stefan Bauer, Siddhartha S. Srinivasa, Tapomayukh Bhattacharjee, Matthew R. Walter, Jan Peters

Figure 1 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Figure 2 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Figure 3 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Figure 4 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation

Dexterous manipulation is a challenging and important problem in robotics. While data-driven methods are a promising approach, current benchmarks require simulation or extensive engineering support due to the sample inefficiency of popular methods. We present benchmarks for the TriFinger system, an open-source robotic platform for dexterous manipulation and the focus of the 2020 Real Robot Challenge. The benchmarked methods, which were successful in the challenge, can be generally described as structured policies, as they combine elements of classical robotics and modern policy optimization. This inclusion of inductive biases facilitates sample efficiency, interpretability, reliability and high performance. The key aspects of this benchmarking is validation of the baselines across both simulation and the real system, thorough ablation study over the core features of each solution, and a retrospective analysis of the challenge as a manipulation benchmark. The code and demo videos for this work can be found on our website (https://sites.google.com/view/benchmark-rrc).

Viaarxiv icon

Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

Nov 08, 2020
Junha Roh, Christoforos Mavrogiannis, Rishabh Madan, Dieter Fox, Siddhartha S. Srinivasa

Figure 1 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections
Figure 2 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections
Figure 3 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections
Figure 4 for Multimodal Trajectory Prediction via Topological Invariance for Navigation at Uncontrolled Intersections

We focus on decentralized navigation among multiple non-communicating rational agents at \emph{uncontrolled} intersections, i.e., street intersections without traffic signs or signals. Avoiding collisions in such domains relies on the ability of agents to predict each others' intentions reliably, and react quickly. Multiagent trajectory prediction is NP-hard whereas the sample complexity of existing data-driven approaches limits their applicability. Our key insight is that the geometric structure of the intersection and the incentive of agents to move efficiently and avoid collisions (rationality) reduces the space of likely behaviors, effectively relaxing the problem of trajectory prediction. In this paper, we collapse the space of multiagent trajectories at an intersection into a set of modes representing different classes of multiagent behavior, formalized using a notion of topological invariance. Based on this formalism, we design Multiple Topologies Prediction (MTP), a data-driven trajectory-prediction mechanism that reconstructs trajectory representations of high-likelihood modes in multiagent intersection scenes. We show that MTP outperforms a state-of-the-art multimodal trajectory prediction baseline (MFP) in terms of prediction accuracy by 78.24% on a challenging simulated dataset. Finally, we show that MTP enables our optimization-based planner, MTPnav, to achieve collision-free and time-efficient navigation across a variety of challenging intersection scenarios on the CARLA simulator.

* Preprint of a paper with the same title, accepted to the Conference on Robot Learning 2020 
Viaarxiv icon

ExTra: Transfer-guided Exploration

Jun 27, 2019
Anirban Santara, Rishabh Madan, Balaraman Ravindran, Pabitra Mitra

Figure 1 for ExTra: Transfer-guided Exploration
Figure 2 for ExTra: Transfer-guided Exploration
Figure 3 for ExTra: Transfer-guided Exploration
Figure 4 for ExTra: Transfer-guided Exploration

In this work we present a novel approach for transfer-guided exploration in reinforcement learning that is inspired by the human tendency to leverage experiences from similar encounters in the past while navigating a new task. Given an optimal policy in a related task-environment, we show that its bisimulation distance from the current task-environment gives a lower bound on the optimal advantage of state-action pairs in the current task-environment. Transfer-guided Exploration (ExTra) samples actions from a Softmax distribution over these lower bounds. In this way, actions with potentially higher optimum advantage are sampled more frequently. In our experiments on gridworld environments, we demonstrate that given access to an optimal policy in a related task-environment, ExTra can outperform popular domain-specific exploration strategies viz. epsilon greedy, Model-Based Interval Estimation - Exploration Based (MBIE-EB), Pursuit and Boltzmann in terms of sample complexity and rate of convergence. We further show that ExTra is robust to choices of source task and shows a graceful degradation of performance as the dissimilarity of the source task increases. We also demonstrate that ExTra, when used alongside traditional exploration algorithms, improves their rate of convergence. Thus it is capable of complimenting the efficacy of traditional exploration algorithms.

* Under review at NeurIPS 2019 
Viaarxiv icon