Picture for Zuxuan Wu

Zuxuan Wu

Generalized Trajectory Scoring for End-to-end Multimodal Planning

Add code
Jun 07, 2025
Viaarxiv icon

DriveSuprim: Towards Precise Trajectory Selection for End-to-End Planning

Add code
Jun 07, 2025
Viaarxiv icon

CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Add code
May 25, 2025
Viaarxiv icon

Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities

Add code
May 23, 2025
Viaarxiv icon

ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation

Add code
May 20, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Add code
Apr 15, 2025
Viaarxiv icon

Aligning Anime Video Generation with Human Feedback

Add code
Apr 14, 2025
Viaarxiv icon

DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation

Add code
Mar 27, 2025
Viaarxiv icon