Picture for Zhiding Yu

Zhiding Yu

Exploring Camera Encoder Designs for Autonomous Driving Perception

Add code
Jul 09, 2024
Figure 1 for Exploring Camera Encoder Designs for Autonomous Driving Perception
Figure 2 for Exploring Camera Encoder Designs for Autonomous Driving Perception
Figure 3 for Exploring Camera Encoder Designs for Autonomous Driving Perception
Figure 4 for Exploring Camera Encoder Designs for Autonomous Driving Perception
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Figure 1 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 2 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 3 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 4 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Viaarxiv icon

X-VILA: Cross-Modality Alignment for Large Language Model

Add code
May 29, 2024
Viaarxiv icon

Memorize What Matters: Emergent Scene Decomposition from Multitraverse

Add code
May 29, 2024
Viaarxiv icon

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

Add code
May 02, 2024
Figure 1 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 2 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 3 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 4 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Viaarxiv icon

What is Point Supervision Worth in Video Instance Segmentation?

Add code
Apr 01, 2024
Figure 1 for What is Point Supervision Worth in Video Instance Segmentation?
Figure 2 for What is Point Supervision Worth in Video Instance Segmentation?
Figure 3 for What is Point Supervision Worth in Video Instance Segmentation?
Figure 4 for What is Point Supervision Worth in Video Instance Segmentation?
Viaarxiv icon

LITA: Language Instructed Temporal-Localization Assistant

Add code
Mar 27, 2024
Figure 1 for LITA: Language Instructed Temporal-Localization Assistant
Figure 2 for LITA: Language Instructed Temporal-Localization Assistant
Figure 3 for LITA: Language Instructed Temporal-Localization Assistant
Figure 4 for LITA: Language Instructed Temporal-Localization Assistant
Viaarxiv icon

Improving Distant 3D Object Detection Using 2D Box Supervision

Add code
Mar 14, 2024
Figure 1 for Improving Distant 3D Object Detection Using 2D Box Supervision
Figure 2 for Improving Distant 3D Object Detection Using 2D Box Supervision
Figure 3 for Improving Distant 3D Object Detection Using 2D Box Supervision
Figure 4 for Improving Distant 3D Object Detection Using 2D Box Supervision
Viaarxiv icon

T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching

Add code
Feb 21, 2024
Viaarxiv icon

Fully Attentional Networks with Self-emerging Token Labeling

Add code
Jan 08, 2024
Figure 1 for Fully Attentional Networks with Self-emerging Token Labeling
Figure 2 for Fully Attentional Networks with Self-emerging Token Labeling
Figure 3 for Fully Attentional Networks with Self-emerging Token Labeling
Figure 4 for Fully Attentional Networks with Self-emerging Token Labeling
Viaarxiv icon