Picture for Yuan Lu

Yuan Lu

Optimal Transport for LLM Reward Modeling from Noisy Preference

Add code
May 07, 2026
Viaarxiv icon

Foundation Models to Unlock Real-World Evidence from Nationwide Medical Claims

Add code
May 05, 2026
Viaarxiv icon

MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences

Add code
Mar 29, 2026
Viaarxiv icon

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Add code
Mar 24, 2026
Viaarxiv icon

Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects

Add code
Mar 20, 2026
Viaarxiv icon

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Add code
Mar 19, 2026
Viaarxiv icon

GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows

Add code
Mar 12, 2026
Viaarxiv icon

Improving Diffusion Planners by Self-Supervised Action Gating with Energies

Add code
Mar 03, 2026
Viaarxiv icon

OmniGAIA: Towards Native Omni-Modal AI Agents

Add code
Feb 26, 2026
Viaarxiv icon

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Add code
Feb 09, 2026
Viaarxiv icon