Picture for Zhongdao Wang

Zhongdao Wang

LiteFusion: Taming 3D Object Detectors from Vision-Based to Multi-Modal with Minimal Adaptation

Add code
Dec 23, 2025
Viaarxiv icon

Grounding Everything in Tokens for Multimodal Large Language Models

Add code
Dec 11, 2025
Viaarxiv icon

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Add code
Jun 17, 2025
Viaarxiv icon

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Add code
Apr 20, 2025
Figure 1 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Figure 2 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Figure 3 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Figure 4 for Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
Viaarxiv icon

Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation

Add code
Mar 11, 2025
Viaarxiv icon

Effective LLM Knowledge Learning via Model Generalization

Add code
Mar 05, 2025
Viaarxiv icon

Edit as You See: Image-guided Video Editing via Masked Motion Modeling

Add code
Jan 08, 2025
Viaarxiv icon

ReNeg: Learning Negative Embedding with Reward Guidance

Add code
Dec 27, 2024
Viaarxiv icon

Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning

Add code
Nov 12, 2024
Figure 1 for Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning
Figure 2 for Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning
Figure 3 for Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning
Figure 4 for Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning
Viaarxiv icon

QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model

Add code
Oct 10, 2024
Figure 1 for QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Figure 2 for QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Figure 3 for QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Figure 4 for QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Viaarxiv icon