Picture for Zhiqi Li

Zhiqi Li

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Figure 1 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 2 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 3 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 4 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Viaarxiv icon

Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting

Add code
Mar 19, 2024
Figure 1 for Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Figure 2 for Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Figure 3 for Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Figure 4 for Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Viaarxiv icon

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

Add code
Mar 14, 2024
Figure 1 for Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Figure 2 for Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Figure 3 for Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Figure 4 for Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Viaarxiv icon

Improving Group Connectivity for Generalization of Federated Deep Learning

Add code
Feb 29, 2024
Viaarxiv icon

Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion

Add code
Feb 02, 2024
Viaarxiv icon

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

Add code
Jan 11, 2024
Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Dec 25, 2023
Figure 1 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 2 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 3 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 4 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Viaarxiv icon

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

Add code
Dec 05, 2023
Viaarxiv icon

MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation

Add code
Nov 27, 2023
Viaarxiv icon

ET3D: Efficient Text-to-3D Generation via Multi-View Distillation

Add code
Nov 27, 2023
Viaarxiv icon