Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy

Aug 20, 2025

Yijin Chen, Wenqiang Xu, Zhenjun Yu, Tutian Tang, Yutong Li, Siqiong Yao, Cewu Lu

Figure 1 for FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy

Figure 2 for FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy

Figure 3 for FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy

Figure 4 for FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy

Share this with someone who'll enjoy it:

Abstract:Dexterous in-hand manipulation is a long-standing challenge in robotics due to complex contact dynamics and partial observability. While humans synergize vision and touch for such tasks, robotic approaches often prioritize one modality, therefore limiting adaptability. This paper introduces Flow Before Imitation (FBI), a visuotactile imitation learning framework that dynamically fuses tactile interactions with visual observations through motion dynamics. Unlike prior static fusion methods, FBI establishes a causal link between tactile signals and object motion via a dynamics-aware latent model. FBI employs a transformer-based interaction module to fuse flow-derived tactile features with visual inputs, training a one-step diffusion policy for real-time execution. Extensive experiments demonstrate that the proposed method outperforms the baseline methods in both simulation and the real world on two customized in-hand manipulation tasks and three standard dexterous manipulation tasks. Code, models, and more results are available in the website https://sites.google.com/view/dex-fbi.

View paper on

Share this with someone who'll enjoy it:

Title:FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy

Paper and Code