Picture for Yi Wu

Yi Wu

How Far Are We from Optimal Reasoning Efficiency?

Add code
Jun 08, 2025
Viaarxiv icon

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Add code
May 30, 2025
Viaarxiv icon

StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation

Add code
May 26, 2025
Viaarxiv icon

What Can RL Bring to VLA Generalization? An Empirical Study

Add code
May 26, 2025
Viaarxiv icon

Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams

Add code
May 20, 2025
Viaarxiv icon

Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps

Add code
May 15, 2025
Viaarxiv icon

PaRT: Enhancing Proactive Social Chatbots with Personalized Real-Time Retrieval

Add code
Apr 29, 2025
Viaarxiv icon

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Viaarxiv icon

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

Add code
Mar 13, 2025
Viaarxiv icon

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization

Add code
Feb 07, 2025
Figure 1 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 2 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 3 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Figure 4 for Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
Viaarxiv icon