Picture for Weiyao Wang

Weiyao Wang

Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success

Add code
Jun 12, 2025
Viaarxiv icon

Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Add code
May 22, 2025
Viaarxiv icon

HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models

Add code
Mar 24, 2025
Viaarxiv icon

VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment

Add code
Jan 03, 2025
Figure 1 for VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment
Figure 2 for VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment
Figure 3 for VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment
Figure 4 for VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment
Viaarxiv icon

OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB

Add code
Oct 09, 2024
Figure 1 for OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB
Figure 2 for OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB
Figure 3 for OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB
Figure 4 for OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB
Viaarxiv icon

ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation

Add code
Aug 16, 2024
Viaarxiv icon

Adapting Image-based RL Policies via Predicted Rewards

Add code
Jul 23, 2024
Viaarxiv icon

Domain Adaptation of Visual Policies with a Single Demonstration

Add code
Jul 23, 2024
Viaarxiv icon

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences

Add code
Jul 12, 2024
Viaarxiv icon

VIHE: Virtual In-Hand Eye Transformer for 3D Robotic Manipulation

Add code
Mar 19, 2024
Viaarxiv icon