Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daan Brinks

Modular Recurrence in Contextual MDPs for Universal Morphology Control

Jun 10, 2025

Laurens Engwegen, Daan Brinks, Wendelin Böhmer

Abstract:A universal controller for any robot morphology would greatly improve computational and data efficiency. By utilizing contextual information about the properties of individual robots and exploiting their modular structure in the architecture of deep reinforcement learning agents, steps have been made towards multi-robot control. Generalization to new, unseen robots, however, remains a challenge. In this paper we hypothesize that the relevant contextual information is partially observable, but that it can be inferred through interactions for better generalization to contexts that are not seen during training. To this extent, we implement a modular recurrent architecture and evaluate its generalization performance on a large set of MuJoCo robots. The results show a substantial improved performance on robots with unseen dynamics, kinematics, and topologies, in four different environments.

Via

Access Paper or Ask Questions

Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Oct 02, 2024

Alejandro Castañeda Garcia, Jan van Gemert, Daan Brinks, Nergis Tömen

Figure 1 for Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Figure 2 for Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Figure 3 for Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Figure 4 for Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Abstract:Extracting physical dynamical system parameters from videos is of great interest to applications in natural science and technology. The state-of-the-art in automatic parameter estimation from video is addressed by training supervised deep networks on large datasets. Such datasets require labels, which are difficult to acquire. While some unsupervised techniques -- which depend on frame prediction -- exist, they suffer from long training times, instability under different initializations, and are limited to hand-picked motion problems. In this work, we propose a method to estimate the physical parameters of any known, continuous governing equation from single videos; our solution is suitable for different dynamical systems beyond motion and is robust to initialization compared to previous approaches. Moreover, we remove the need for frame prediction by implementing a KL-divergence-based loss function in the latent space, which avoids convergence to trivial solutions and reduces model size and compute.

Via

Access Paper or Ask Questions