Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gabriele Fadini

Differentiable Environment-Trajectory Co-Optimization for Safe Multi-Agent Navigation

Apr 08, 2026

Zhan Gao, Gabriele Fadini, Stelian Coros, Amanda Prorok

Abstract:The environment plays a critical role in multi-agent navigation by imposing spatial constraints, rules, and limitations that agents must navigate around. Traditional approaches treat the environment as fixed, without exploring its impact on agents' performance. This work considers environment configurations as decision variables, alongside agent actions, to jointly achieve safe navigation. We formulate a bi-level problem, where the lower-level sub-problem optimizes agent trajectories that minimize navigation cost and the upper-level sub-problem optimizes environment configurations that maximize navigation safety. We develop a differentiable optimization method that iteratively solves the lower-level sub-problem with interior point methods and the upper-level sub-problem with gradient ascent. A key challenge lies in analytically coupling these two levels. We address this by leveraging KKT conditions and the Implicit Function Theorem to compute gradients of agent trajectories w.r.t. environment parameters, enabling differentiation throughout the bi-level structure. Moreover, we propose a novel metric that quantifies navigation safety as a criterion for the upper-level environment optimization, and prove its validity through measure theory. Our experiments validate the effectiveness of the proposed framework in a variety of safety-critical navigation scenarios, inspired from warehouse logistics to urban transportation. The results demonstrate that optimized environments provide navigation guidance, improving both agents' safety and efficiency.

Via

Access Paper or Ask Questions

Differentiable Material Point Method for the Control of Deformable Objects

Dec 15, 2025

Diego Bolliger, Gabriele Fadini, Markus Bambach, Alisa Rupenyan

Figure 1 for Differentiable Material Point Method for the Control of Deformable Objects

Figure 2 for Differentiable Material Point Method for the Control of Deformable Objects

Figure 3 for Differentiable Material Point Method for the Control of Deformable Objects

Figure 4 for Differentiable Material Point Method for the Control of Deformable Objects

Abstract:Controlling the deformation of flexible objects is challenging due to their non-linear dynamics and high-dimensional configuration space. This work presents a differentiable Material Point Method (MPM) simulator targeted at control applications. We exploit the differentiability of the simulator to optimize a control trajectory in an active damping problem for a hyperelastic rope. The simulator effectively minimizes the kinetic energy of the rope around 2$\times$ faster than a baseline MPPI method and to a 20% lower energy level, while using about 3% of the computation time.

* 7 Pages, 4 Figures, 1 Table

Via

Access Paper or Ask Questions

RAMBO: RL-augmented Model-based Optimal Control for Whole-body Loco-manipulation

Apr 09, 2025

Jin Cheng, Dongho Kang, Gabriele Fadini, Guanya Shi, Stelian Coros

Figure 1 for RAMBO: RL-augmented Model-based Optimal Control for Whole-body Loco-manipulation

Figure 2 for RAMBO: RL-augmented Model-based Optimal Control for Whole-body Loco-manipulation

Figure 3 for RAMBO: RL-augmented Model-based Optimal Control for Whole-body Loco-manipulation

Figure 4 for RAMBO: RL-augmented Model-based Optimal Control for Whole-body Loco-manipulation

Abstract:Loco-manipulation -- coordinated locomotion and physical interaction with objects -- remains a major challenge for legged robots due to the need for both accurate force interaction and robustness to unmodeled dynamics. While model-based controllers provide interpretable dynamics-level planning and optimization, they are limited by model inaccuracies and computational cost. In contrast, learning-based methods offer robustness while struggling with precise modulation of interaction forces. We introduce RAMBO -- RL-Augmented Model-Based Optimal Control -- a hybrid framework that integrates model-based reaction force optimization using a simplified dynamics model and a feedback policy trained with reinforcement learning. The model-based module generates feedforward torques by solving a quadratic program, while the policy provides feedback residuals to enhance robustness in control execution. We validate our framework on a quadruped robot across a diverse set of real-world loco-manipulation tasks -- such as pushing a shopping cart, balancing a plate, and holding soft objects -- in both quadrupedal and bipedal walking. Our experiments demonstrate that RAMBO enables precise manipulation while achieving robust and dynamic locomotion, surpassing the performance of policies trained with end-to-end scheme. In addition, our method enables flexible trade-off between end-effector tracking accuracy with compliance.

* 9 pages, 6 figures

Via

Access Paper or Ask Questions

Improving generalization of robot locomotion policies via Sharpness-Aware Reinforcement Learning

Nov 29, 2024

Severin Bochem, Eduardo Gonzalez-Sanchez, Yves Bicker, Gabriele Fadini

Figure 1 for Improving generalization of robot locomotion policies via Sharpness-Aware Reinforcement Learning

Figure 2 for Improving generalization of robot locomotion policies via Sharpness-Aware Reinforcement Learning

Figure 3 for Improving generalization of robot locomotion policies via Sharpness-Aware Reinforcement Learning

Figure 4 for Improving generalization of robot locomotion policies via Sharpness-Aware Reinforcement Learning

Abstract:Reinforcement learning often requires extensive training data. Simulation-to-real transfer offers a promising approach to address this challenge in robotics. While differentiable simulators offer improved sample efficiency through exact gradients, they can be unstable in contact-rich environments and may lead to poor generalization. This paper introduces a novel approach integrating sharpness-aware optimization into gradient-based reinforcement learning algorithms. Our simulation results demonstrate that our method, tested on contact-rich environments, significantly enhances policy robustness to environmental variations and action perturbations while maintaining the sample efficiency of first-order methods. Specifically, our approach improves action noise tolerance compared to standard first-order methods and achieves generalization comparable to zeroth-order methods. This improvement stems from finding flatter minima in the loss landscape, associated with better generalization. Our work offers a promising solution to balance efficient learning and robust sim-to-real transfer in robotics, potentially bridging the gap between simulation and real-world performance.

* 9 pages, 6 figures

Via

Access Paper or Ask Questions