Picture for Guang Chen

Guang Chen

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Add code
Jun 09, 2025
Viaarxiv icon

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Add code
Jun 09, 2025
Viaarxiv icon

UrbanCraft: Urban View Extrapolation via Hierarchical Sem-Geometric Priors

Add code
May 29, 2025
Viaarxiv icon

AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving

Add code
May 21, 2025
Viaarxiv icon

Vidi: Large Multimodal Models for Video Understanding and Editing

Add code
Apr 22, 2025
Viaarxiv icon

Beyond Intermediate States: Explaining Visual Redundancy through Language

Add code
Mar 26, 2025
Viaarxiv icon

ChatBEV: A Visual Language Model that Understands BEV Maps

Add code
Mar 21, 2025
Viaarxiv icon

EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models

Add code
Mar 14, 2025
Viaarxiv icon

Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

Add code
Feb 17, 2025
Viaarxiv icon

Generative Multi-Agent Collaboration in Embodied AI: A Systematic Review

Add code
Feb 17, 2025
Viaarxiv icon