Picture for Haoqiang Fan

Haoqiang Fan

SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Add code
Nov 12, 2025
Viaarxiv icon

Running VLAs at Real-time Speed

Add code
Oct 30, 2025
Viaarxiv icon

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation

Add code
Aug 26, 2025
Figure 1 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 2 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 3 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 4 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Viaarxiv icon

ROSA: Harnessing Robot States for Vision-Language and Action Alignment

Add code
Jun 16, 2025
Viaarxiv icon

Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding

Add code
Jun 05, 2025
Figure 1 for Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding
Figure 2 for Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding
Figure 3 for Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding
Figure 4 for Grounding Beyond Detection: Enhancing Contextual Understanding in Embodied 3D Grounding
Viaarxiv icon

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness

Add code
Apr 02, 2025
Figure 1 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 2 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 3 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 4 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Viaarxiv icon

Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation

Add code
Dec 11, 2024
Figure 1 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Figure 2 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Figure 3 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Figure 4 for Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation
Viaarxiv icon

FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation

Add code
Dec 06, 2024
Viaarxiv icon

RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World

Add code
Nov 29, 2024
Figure 1 for RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
Figure 2 for RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
Figure 3 for RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
Figure 4 for RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
Viaarxiv icon

RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator

Add code
Nov 18, 2024
Figure 1 for RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator
Figure 2 for RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator
Figure 3 for RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator
Figure 4 for RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator
Viaarxiv icon