Picture for Shaoyu Chen

Shaoyu Chen

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Add code
Dec 09, 2025
Viaarxiv icon

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

Add code
Dec 08, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Figure 1 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 2 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 3 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 4 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Viaarxiv icon

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Add code
Mar 10, 2025
Viaarxiv icon

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Add code
Feb 18, 2025
Figure 1 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 2 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 3 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 4 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Viaarxiv icon

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Add code
Nov 22, 2024
Figure 1 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Figure 2 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Figure 3 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Figure 4 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Viaarxiv icon

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Add code
Oct 29, 2024
Figure 1 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 2 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 3 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 4 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Viaarxiv icon

ARTiST: Automated Text Simplification for Task Guidance in Augmented Reality

Add code
Feb 29, 2024
Figure 1 for ARTiST: Automated Text Simplification for Task Guidance in Augmented Reality
Figure 2 for ARTiST: Automated Text Simplification for Task Guidance in Augmented Reality
Figure 3 for ARTiST: Automated Text Simplification for Task Guidance in Augmented Reality
Figure 4 for ARTiST: Automated Text Simplification for Task Guidance in Augmented Reality
Viaarxiv icon

VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning

Add code
Feb 20, 2024
Figure 1 for VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning
Figure 2 for VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning
Figure 3 for VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning
Figure 4 for VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning
Viaarxiv icon

MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

Add code
Aug 10, 2023
Figure 1 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction
Figure 2 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction
Figure 3 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction
Figure 4 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction
Viaarxiv icon