Picture for Letian Wang

Letian Wang

DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving

Add code
Jan 04, 2026
Viaarxiv icon

OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving

Add code
Dec 16, 2025
Viaarxiv icon

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Add code
Nov 18, 2025
Viaarxiv icon

ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Add code
Aug 09, 2025
Figure 1 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Figure 2 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Figure 3 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Figure 4 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Viaarxiv icon

OpenNav: Open-World Navigation with Multimodal Large Language Models

Add code
Jul 24, 2025
Figure 1 for OpenNav: Open-World Navigation with Multimodal Large Language Models
Figure 2 for OpenNav: Open-World Navigation with Multimodal Large Language Models
Figure 3 for OpenNav: Open-World Navigation with Multimodal Large Language Models
Figure 4 for OpenNav: Open-World Navigation with Multimodal Large Language Models
Viaarxiv icon

Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions

Add code
May 14, 2025
Figure 1 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Figure 2 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Figure 3 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Figure 4 for Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions
Viaarxiv icon

SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction

Add code
Oct 11, 2024
Figure 1 for SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Figure 2 for SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Figure 3 for SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Figure 4 for SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Viaarxiv icon

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

Add code
Jun 17, 2024
Figure 1 for DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
Figure 2 for DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
Figure 3 for DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
Figure 4 for DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
Viaarxiv icon

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Add code
Mar 25, 2024
Figure 1 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 2 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 3 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 4 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Viaarxiv icon

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

Add code
Mar 19, 2024
Figure 1 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 2 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 3 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 4 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Viaarxiv icon