Picture for Yuxiao Chen

Yuxiao Chen

RealDrive: Retrieval-Augmented Driving with Diffusion Models

Add code
May 30, 2025
Viaarxiv icon

Deployable and Generalizable Motion Prediction: Taxonomy, Open Challenges and Future Directions

Add code
May 14, 2025
Viaarxiv icon

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Add code
Mar 18, 2025
Viaarxiv icon

STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion

Add code
Feb 10, 2025
Figure 1 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Figure 2 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Figure 3 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Figure 4 for STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
Viaarxiv icon

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Add code
Feb 05, 2025
Figure 1 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 2 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 3 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 4 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Viaarxiv icon

DreamDrive: Generative 4D Scene Modeling from Street View Images

Add code
Jan 03, 2025
Figure 1 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 2 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 3 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 4 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Viaarxiv icon

STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes

Add code
Dec 31, 2024
Figure 1 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 2 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 3 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Figure 4 for STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes
Viaarxiv icon

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Add code
Dec 05, 2024
Figure 1 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models
Figure 2 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models
Figure 3 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models
Figure 4 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models
Viaarxiv icon

Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection

Add code
Nov 17, 2024
Figure 1 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Figure 2 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Figure 3 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Figure 4 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Viaarxiv icon

Optimal Defenses Against Gradient Reconstruction Attacks

Add code
Nov 06, 2024
Viaarxiv icon