Picture for Guang Chen

Guang Chen

LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs

Add code
Dec 23, 2025
Figure 1 for LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs
Figure 2 for LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs
Figure 3 for LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs
Figure 4 for LiDARDraft: Generating LiDAR Point Cloud from Versatile Inputs
Viaarxiv icon

Point What You Mean: Visually Grounded Instruction Policy

Add code
Dec 22, 2025
Figure 1 for Point What You Mean: Visually Grounded Instruction Policy
Figure 2 for Point What You Mean: Visually Grounded Instruction Policy
Figure 3 for Point What You Mean: Visually Grounded Instruction Policy
Figure 4 for Point What You Mean: Visually Grounded Instruction Policy
Viaarxiv icon

TUMTraf EMOT: Event-Based Multi-Object Tracking Dataset and Baseline for Traffic Scenarios

Add code
Dec 20, 2025
Viaarxiv icon

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Add code
Dec 16, 2025
Figure 1 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 2 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 3 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 4 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Viaarxiv icon

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Add code
Nov 09, 2025
Viaarxiv icon

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Add code
Oct 22, 2025
Viaarxiv icon

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Add code
Oct 08, 2025
Figure 1 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 2 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 3 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 4 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Viaarxiv icon

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Add code
Jul 02, 2025
Viaarxiv icon

FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design

Add code
Jun 16, 2025
Viaarxiv icon

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Add code
Jun 09, 2025
Figure 1 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 2 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 3 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 4 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Viaarxiv icon