Picture for Tianyu Pang

Tianyu Pang

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon

Efficient Process Reward Model Training via Active Learning

Add code
Apr 14, 2025
Viaarxiv icon

Understanding R1-Zero-Like Training: A Critical Perspective

Add code
Mar 26, 2025
Viaarxiv icon

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Add code
Mar 19, 2025
Viaarxiv icon

LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

Add code
Feb 24, 2025
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

Add code
Jan 29, 2025
Viaarxiv icon

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Add code
Dec 24, 2024
Viaarxiv icon

Real-time Identity Defenses against Malicious Personalization of Diffusion Models

Add code
Dec 13, 2024
Viaarxiv icon