Picture for Ruimao Zhang

Ruimao Zhang

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation

Add code
Apr 13, 2026
Viaarxiv icon

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation

Add code
Apr 07, 2026
Viaarxiv icon

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance

Add code
Jan 28, 2026
Viaarxiv icon

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Add code
Jan 26, 2026
Viaarxiv icon

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion

Add code
Jun 17, 2025
Viaarxiv icon

WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning

Add code
Jun 09, 2025
Viaarxiv icon

DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once

Add code
May 07, 2025
Figure 1 for DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once
Figure 2 for DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once
Figure 3 for DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once
Figure 4 for DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once
Viaarxiv icon

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Add code
Mar 20, 2025
Viaarxiv icon

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Add code
Mar 14, 2025
Figure 1 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 2 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 3 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 4 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Viaarxiv icon