Picture for Yuan Lu

Yuan Lu

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Add code
Mar 24, 2026
Viaarxiv icon

Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects

Add code
Mar 20, 2026
Viaarxiv icon

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Add code
Mar 19, 2026
Viaarxiv icon

GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows

Add code
Mar 12, 2026
Viaarxiv icon

Improving Diffusion Planners by Self-Supervised Action Gating with Energies

Add code
Mar 03, 2026
Viaarxiv icon

OmniGAIA: Towards Native Omni-Modal AI Agents

Add code
Feb 26, 2026
Viaarxiv icon

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Add code
Feb 09, 2026
Viaarxiv icon

Rewards as Labels: Revisiting RLVR from a Classification Perspective

Add code
Feb 05, 2026
Viaarxiv icon

Deep Time-series Forecasting Needs Kernelized Moment Balancing

Add code
Jan 31, 2026
Viaarxiv icon

UniPACT: A Multimodal Framework for Prognostic Question Answering on Raw ECG and Structured EHR

Add code
Jan 25, 2026
Viaarxiv icon