Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

AJ Miller

High-Performance Reinforcement Learning on Spot: Optimizing Simulation Parameters with Distributional Measures

Apr 29, 2025

AJ Miller, Fangzhou Yu, Michael Brauckmann, Farbod Farshidian

Abstract:This work presents an overview of the technical details behind a high performance reinforcement learning policy deployment with the Spot RL Researcher Development Kit for low level motor access on Boston Dynamics Spot. This represents the first public demonstration of an end to end end reinforcement learning policy deployed on Spot hardware with training code publicly available through Nvidia IsaacLab and deployment code available through Boston Dynamics. We utilize Wasserstein Distance and Maximum Mean Discrepancy to quantify the distributional dissimilarity of data collected on hardware and in simulation to measure our sim2real gap. We use these measures as a scoring function for the Covariance Matrix Adaptation Evolution Strategy to optimize simulated parameters that are unknown or difficult to measure from Spot. Our procedure for modeling and training produces high quality reinforcement learning policies capable of multiple gaits, including a flight phase. We deploy policies capable of over 5.2ms locomotion, more than triple Spots default controller maximum speed, robustness to slippery surfaces, disturbance rejection, and overall agility previously unseen on Spot. We detail our method and release our code to support future work on Spot with the low level API.

Via

Access Paper or Ask Questions

Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

May 18, 2023

AJ Miller, Shamel Fahmi, Matthew Chignoli, Sangbae Kim

Figure 1 for Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

Figure 2 for Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

Figure 3 for Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

Figure 4 for Reinforcement Learning for Legged Robots: Motion Imitation from Model-Based Optimal Control

Abstract:We propose MIMOC: Motion Imitation from Model-Based Optimal Control. MIMOC is a Reinforcement Learning (RL) controller that learns agile locomotion by imitating reference trajectories from model-based optimal control. MIMOC mitigates challenges faced by other motion imitation RL approaches because the references are dynamically consistent, require no motion retargeting, and include torque references. Hence, MIMOC does not require fine-tuning. MIMOC is also less sensitive to modeling and state estimation inaccuracies than model-based controllers. We validate MIMOC on the Mini-Cheetah in outdoor environments over a wide variety of challenging terrain, and on the MIT Humanoid in simulation. We show cases where MIMOC outperforms model-based optimal controllers, and show that imitating torque references improves the policy's performance.

Via

Access Paper or Ask Questions