Picture for Chenyao Lu

Chenyao Lu

Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models

Add code
Apr 11, 2026
Viaarxiv icon

Reason Only When Needed: Efficient Generative Reward Modeling via Model-Internal Uncertainty

Add code
Apr 11, 2026
Viaarxiv icon