Picture for Junchi Yan

Junchi Yan

SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence

Add code
Jun 09, 2025
Viaarxiv icon

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Add code
May 29, 2025
Viaarxiv icon

On the Role of Label Noise in the Feature Learning Process

Add code
May 25, 2025
Viaarxiv icon

KITINet: Kinetics Theory Inspired Network Architectures with PDE Simulation Approaches

Add code
May 23, 2025
Viaarxiv icon

Decoupled Geometric Parameterization and its Application in Deep Homography Estimation

Add code
May 22, 2025
Viaarxiv icon

DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving

Add code
May 22, 2025
Viaarxiv icon

Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Add code
May 22, 2025
Viaarxiv icon

Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

Add code
May 22, 2025
Viaarxiv icon

New Evidence of the Two-Phase Learning Dynamics of Neural Networks

Add code
May 20, 2025
Viaarxiv icon

KO: Kinetics-inspired Neural Optimizer with PDE Simulation Approaches

Add code
May 20, 2025
Viaarxiv icon