Picture for Zuxuan Wu

Zuxuan Wu

Fudan University

BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

Add code
Mar 20, 2025
Viaarxiv icon

MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

Add code
Mar 20, 2025
Viaarxiv icon

Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation

Add code
Mar 17, 2025
Viaarxiv icon

Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training

Add code
Mar 15, 2025
Viaarxiv icon

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Add code
Mar 11, 2025
Figure 1 for Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Figure 2 for Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Figure 3 for Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Figure 4 for Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Viaarxiv icon

Human2Robot: Learning Robot Actions from Paired Human-Robot Videos

Add code
Feb 23, 2025
Figure 1 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Figure 2 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Figure 3 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Figure 4 for Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Viaarxiv icon

Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning

Add code
Jan 23, 2025
Figure 1 for Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning
Figure 2 for Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning
Figure 3 for Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning
Figure 4 for Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning
Viaarxiv icon

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients

Add code
Jan 21, 2025
Figure 1 for FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
Figure 2 for FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
Figure 3 for FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
Figure 4 for FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
Viaarxiv icon

FOCUS: Towards Universal Foreground Segmentation

Add code
Jan 09, 2025
Figure 1 for FOCUS: Towards Universal Foreground Segmentation
Figure 2 for FOCUS: Towards Universal Foreground Segmentation
Figure 3 for FOCUS: Towards Universal Foreground Segmentation
Figure 4 for FOCUS: Towards Universal Foreground Segmentation
Viaarxiv icon

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Add code
Dec 24, 2024
Figure 1 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 2 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 3 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Figure 4 for VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
Viaarxiv icon