Picture for Yongbin Li

Yongbin Li

P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling

Add code
Feb 12, 2026
Viaarxiv icon

Beyond Quantity: Trajectory Diversity Scaling for Code Agents

Add code
Feb 03, 2026
Viaarxiv icon

ExpSeek: Self-Triggered Experience Seeking for Web Agents

Add code
Jan 13, 2026
Viaarxiv icon

Reward Modeling from Natural Language Human Feedback

Add code
Jan 12, 2026
Viaarxiv icon

Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions

Add code
Jan 12, 2026
Viaarxiv icon

EvoRoute: Experience-Driven Self-Routing LLM Agent Systems

Add code
Jan 06, 2026
Viaarxiv icon

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure

Add code
Dec 27, 2025
Viaarxiv icon

Understanding Generalization in Role-Playing Models via Information Theory

Add code
Dec 19, 2025
Viaarxiv icon

MOA: Multi-Objective Alignment for Role-Playing Agents

Add code
Dec 10, 2025
Viaarxiv icon

Selective Weak-to-Strong Generalization

Add code
Nov 18, 2025
Viaarxiv icon