Picture for Zuxuan Wu

Zuxuan Wu

FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization

Add code
Jul 02, 2025
Viaarxiv icon

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

Add code
Jul 02, 2025
Viaarxiv icon

Generalized Trajectory Scoring for End-to-end Multimodal Planning

Add code
Jun 07, 2025
Viaarxiv icon

DriveSuprim: Towards Precise Trajectory Selection for End-to-End Planning

Add code
Jun 07, 2025
Viaarxiv icon

CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Add code
May 25, 2025
Viaarxiv icon

Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities

Add code
May 23, 2025
Viaarxiv icon

ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation

Add code
May 20, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Add code
Apr 15, 2025
Viaarxiv icon