Picture for Tianyu Pang

Tianyu Pang

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Add code
Jun 09, 2026
Viaarxiv icon

Rethinking the Divergence Regularization in LLM RL

Add code
Jun 08, 2026
Viaarxiv icon

Balancing Learning Rates Across Layers: Exact Two-Step Dynamics and Optimal Scaling in Linear Neural Networks

Add code
May 29, 2026
Viaarxiv icon

Reinforcing Few-step Generators via Reward-Tilted Distribution Matching

Add code
May 28, 2026
Viaarxiv icon

Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization

Add code
May 27, 2026
Viaarxiv icon

Scalable Token-Level Hallucination Detection in Large Language Models

Add code
May 12, 2026
Viaarxiv icon

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Add code
May 06, 2026
Viaarxiv icon

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Add code
Apr 06, 2026
Viaarxiv icon

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

Add code
Mar 20, 2026
Viaarxiv icon

HTMuon: Improving Muon via Heavy-Tailed Spectral Correction

Add code
Mar 10, 2026
Viaarxiv icon