Picture for Yuan Lu

Yuan Lu

Uncertainty-Aware Reward Modeling for Stable RLHF

Add code
Jun 18, 2026
Viaarxiv icon

SafeRun: Enabling Determinism in LLM Planning for Running

Add code
Jun 08, 2026
Viaarxiv icon

MINDGAMES: A Live Arena for Evaluating Social and Strategic Reasoning in Multi-Agent LLMs

Add code
May 28, 2026
Viaarxiv icon

MMSkills: Towards Multimodal Skills for General Visual Agents

Add code
May 14, 2026
Viaarxiv icon

AgentDisCo: Towards Disentanglement and Collaboration in Open-ended Deep Research Agents

Add code
May 12, 2026
Viaarxiv icon

Optimal Transport for LLM Reward Modeling from Noisy Preference

Add code
May 07, 2026
Viaarxiv icon

Foundation Models to Unlock Real-World Evidence from Nationwide Medical Claims

Add code
May 05, 2026
Viaarxiv icon

MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences

Add code
Mar 29, 2026
Viaarxiv icon

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Add code
Mar 24, 2026
Viaarxiv icon

Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects

Add code
Mar 20, 2026
Viaarxiv icon