Picture for Hanbo Zhang

Hanbo Zhang

Robot Operation of Home Appliances by Reading User Manuals

Add code
May 26, 2025
Viaarxiv icon

RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video

Add code
May 04, 2025
Viaarxiv icon

MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning

Add code
Apr 05, 2025
Viaarxiv icon

FUNCTO: Function-Centric One-Shot Imitation Learning for Tool Manipulation

Add code
Feb 17, 2025
Viaarxiv icon

Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

Add code
Dec 18, 2024
Viaarxiv icon

REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds

Add code
Oct 12, 2024
Viaarxiv icon

GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation

Add code
Oct 08, 2024
Figure 1 for GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
Figure 2 for GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
Figure 3 for GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
Figure 4 for GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
Viaarxiv icon

DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments

Add code
Sep 09, 2024
Figure 1 for DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments
Figure 2 for DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments
Figure 3 for DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments
Figure 4 for DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments
Viaarxiv icon

SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction

Add code
Feb 20, 2024
Viaarxiv icon

Towards Unified Interactive Visual Grounding in The Wild

Add code
Jan 30, 2024
Figure 1 for Towards Unified Interactive Visual Grounding in The Wild
Figure 2 for Towards Unified Interactive Visual Grounding in The Wild
Figure 3 for Towards Unified Interactive Visual Grounding in The Wild
Figure 4 for Towards Unified Interactive Visual Grounding in The Wild
Viaarxiv icon