Picture for Xu Cao

Xu Cao

What is the Visual Cognition Gap between Humans and Multimodal LLMs?

Add code
Jun 14, 2024
Viaarxiv icon

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

Add code
May 14, 2024
Figure 1 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 2 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 3 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 4 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Viaarxiv icon

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

Add code
Apr 10, 2024
Figure 1 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 2 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 3 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 4 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Viaarxiv icon

Spurious Correlations in Machine Learning: A Survey

Add code
Feb 20, 2024
Figure 1 for Spurious Correlations in Machine Learning: A Survey
Figure 2 for Spurious Correlations in Machine Learning: A Survey
Figure 3 for Spurious Correlations in Machine Learning: A Survey
Viaarxiv icon

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

Add code
Feb 08, 2024
Figure 1 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Figure 2 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Figure 3 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Figure 4 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Viaarxiv icon

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

Add code
Jan 08, 2024
Figure 1 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 2 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 3 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 4 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Viaarxiv icon

SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration

Add code
Dec 08, 2023
Viaarxiv icon

LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs

Add code
Dec 07, 2023
Viaarxiv icon

A Survey on Multimodal Large Language Models for Autonomous Driving

Add code
Nov 21, 2023
Figure 1 for A Survey on Multimodal Large Language Models for Autonomous Driving
Figure 2 for A Survey on Multimodal Large Language Models for Autonomous Driving
Figure 3 for A Survey on Multimodal Large Language Models for Autonomous Driving
Figure 4 for A Survey on Multimodal Large Language Models for Autonomous Driving
Viaarxiv icon

MACP: Efficient Model Adaptation for Cooperative Perception

Add code
Nov 07, 2023
Viaarxiv icon