Picture for Deli Zhao

Deli Zhao

GP3: A 3D Geometry-Aware Policy with Multi-View Images for Robotic Manipulation

Add code
Sep 19, 2025
Viaarxiv icon

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Add code
Sep 18, 2025
Viaarxiv icon

RynnEC: Bringing MLLMs into Embodied World

Add code
Aug 19, 2025
Viaarxiv icon

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Add code
Aug 12, 2025
Viaarxiv icon

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Add code
Jul 30, 2025
Figure 1 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Figure 2 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Figure 3 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Figure 4 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Viaarxiv icon

DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models

Add code
Jul 09, 2025
Viaarxiv icon

WorldVLA: Towards Autoregressive Action World Model

Add code
Jun 26, 2025
Viaarxiv icon

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Add code
Jun 05, 2025
Viaarxiv icon

STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs

Add code
May 26, 2025
Viaarxiv icon