Picture for Zuxuan Wu

Zuxuan Wu

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives

Add code
Aug 20, 2025
Viaarxiv icon

StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation

Add code
Aug 11, 2025
Viaarxiv icon

Multimodal Referring Segmentation: A Survey

Add code
Aug 01, 2025
Viaarxiv icon

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

Add code
Jul 02, 2025
Viaarxiv icon

FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization

Add code
Jul 02, 2025
Viaarxiv icon

DriveSuprim: Towards Precise Trajectory Selection for End-to-End Planning

Add code
Jun 07, 2025
Viaarxiv icon

Generalized Trajectory Scoring for End-to-end Multimodal Planning

Add code
Jun 07, 2025
Viaarxiv icon

CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Add code
May 25, 2025
Viaarxiv icon

Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities

Add code
May 23, 2025
Viaarxiv icon