Picture for Yunsheng Ma

Yunsheng Ma

ALN-P3: Unified Language Alignment for Perception, Prediction, and Planning in Autonomous Driving

Add code
May 21, 2025
Viaarxiv icon

LTDA-Drive: LLMs-guided Generative Models based Long-tail Data Augmentation for Autonomous Driving

Add code
May 21, 2025
Viaarxiv icon

NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models

Add code
Mar 17, 2025
Viaarxiv icon

On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation

Add code
Nov 17, 2024
Figure 1 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Figure 2 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Figure 3 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Figure 4 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Viaarxiv icon

MTA: Multimodal Task Alignment for BEV Perception and Captioning

Add code
Nov 16, 2024
Viaarxiv icon

Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving

Add code
Sep 16, 2024
Figure 1 for Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving
Figure 2 for Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving
Figure 3 for Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving
Figure 4 for Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving
Viaarxiv icon

MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs

Add code
Jun 24, 2024
Figure 1 for MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs
Figure 2 for MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs
Figure 3 for MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs
Figure 4 for MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs
Viaarxiv icon

What is the Visual Cognition Gap between Humans and Multimodal LLMs?

Add code
Jun 14, 2024
Viaarxiv icon

Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture

Add code
Apr 04, 2024
Figure 1 for Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture
Figure 2 for Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture
Figure 3 for Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture
Figure 4 for Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture
Viaarxiv icon

Spurious Correlations in Machine Learning: A Survey

Add code
Feb 20, 2024
Figure 1 for Spurious Correlations in Machine Learning: A Survey
Figure 2 for Spurious Correlations in Machine Learning: A Survey
Figure 3 for Spurious Correlations in Machine Learning: A Survey
Viaarxiv icon