Picture for Ao Lu

Ao Lu

Large Language Models Explore by Latent Distilling

Add code
Apr 27, 2026
Viaarxiv icon

Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning

Add code
Feb 02, 2026
Viaarxiv icon