Picture for Licheng Pan

Licheng Pan

Optimal Transport for LLM Reward Modeling from Noisy Preference

Add code
May 07, 2026
Viaarxiv icon

Robust Reward Modeling for Large Language Models via Causal Decomposition

Add code
Apr 16, 2026
Viaarxiv icon

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Add code
Mar 24, 2026
Viaarxiv icon

Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects

Add code
Mar 20, 2026
Viaarxiv icon

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Add code
Mar 19, 2026
Viaarxiv icon

Analyzing and Improving Diffusion Models for Time-Series Data Imputation: A Proximal Recursion Perspective

Add code
Feb 01, 2026
Viaarxiv icon

A Causal Perspective for Enhancing Jailbreak Attack and Defense

Add code
Jan 31, 2026
Viaarxiv icon

Deep Time-series Forecasting Needs Kernelized Moment Balancing

Add code
Jan 31, 2026
Viaarxiv icon

Mixture of Low Rank Adaptation with Partial Parameter Sharing for Time Series Forecasting

Add code
May 23, 2025
Viaarxiv icon

Understanding and Mitigating Overrefusal in LLMs from an Unveiling Perspective of Safety Decision Boundary

Add code
May 23, 2025
Viaarxiv icon