Picture for Ping Luo

Ping Luo

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Add code
May 19, 2025
Viaarxiv icon

DanceGRPO: Unleashing GRPO on Visual Generation

Add code
May 12, 2025
Figure 1 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 2 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 3 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 4 for DanceGRPO: Unleashing GRPO on Visual Generation
Viaarxiv icon

UniVLA: Learning to Act Anywhere with Task-centric Latent Actions

Add code
May 09, 2025
Viaarxiv icon

PixelFlow: Pixel-Space Generative Models with Flow

Add code
Apr 10, 2025
Figure 1 for PixelFlow: Pixel-Space Generative Models with Flow
Figure 2 for PixelFlow: Pixel-Space Generative Models with Flow
Figure 3 for PixelFlow: Pixel-Space Generative Models with Flow
Figure 4 for PixelFlow: Pixel-Space Generative Models with Flow
Viaarxiv icon

Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models

Add code
Mar 19, 2025
Figure 1 for Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Figure 2 for Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Figure 3 for Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Figure 4 for Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Viaarxiv icon

Centaur: Robust End-to-End Autonomous Driving with Test-Time Training

Add code
Mar 14, 2025
Figure 1 for Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Figure 2 for Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Figure 3 for Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Figure 4 for Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Viaarxiv icon

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

Add code
Mar 13, 2025
Viaarxiv icon

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

VB-Com: Learning Vision-Blind Composite Humanoid Locomotion Against Deficient Perception

Add code
Feb 20, 2025
Viaarxiv icon