Picture for Chao Du

Chao Du

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Viaarxiv icon

UFO2: The Desktop AgentOS

Add code
Apr 20, 2025
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon

Understanding R1-Zero-Like Training: A Critical Perspective

Add code
Mar 26, 2025
Viaarxiv icon

LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

Add code
Feb 24, 2025
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

Add code
Jan 29, 2025
Viaarxiv icon

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Add code
Dec 24, 2024
Viaarxiv icon

Real-time Identity Defenses against Malicious Personalization of Diffusion Models

Add code
Dec 13, 2024
Viaarxiv icon

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Add code
Nov 20, 2024
Figure 1 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Figure 2 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Figure 3 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Figure 4 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Viaarxiv icon