Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:In-Hand Object Pose Estimation via Visual-Tactile Fusion

Jun 12, 2025

Felix Nonnengießer, Alap Kshirsagar, Boris Belousov, Jan Peters

Figure 1 for In-Hand Object Pose Estimation via Visual-Tactile Fusion

Figure 2 for In-Hand Object Pose Estimation via Visual-Tactile Fusion

Figure 3 for In-Hand Object Pose Estimation via Visual-Tactile Fusion

Figure 4 for In-Hand Object Pose Estimation via Visual-Tactile Fusion

Share this with someone who'll enjoy it:

Abstract:Accurate in-hand pose estimation is crucial for robotic object manipulation, but visual occlusion remains a major challenge for vision-based approaches. This paper presents an approach to robotic in-hand object pose estimation, combining visual and tactile information to accurately determine the position and orientation of objects grasped by a robotic hand. We address the challenge of visual occlusion by fusing visual information from a wrist-mounted RGB-D camera with tactile information from vision-based tactile sensors mounted on the fingertips of a robotic gripper. Our approach employs a weighting and sensor fusion module to combine point clouds from heterogeneous sensor types and control each modality's contribution to the pose estimation process. We use an augmented Iterative Closest Point (ICP) algorithm adapted for weighted point clouds to estimate the 6D object pose. Our experiments show that incorporating tactile information significantly improves pose estimation accuracy, particularly when occlusion is high. Our method achieves an average pose estimation error of 7.5 mm and 16.7 degrees, outperforming vision-only baselines by up to 20%. We also demonstrate the ability of our method to perform precise object manipulation in a real-world insertion task.

* 8 pages

View paper on

Share this with someone who'll enjoy it:

Title:In-Hand Object Pose Estimation via Visual-Tactile Fusion

Paper and Code