Picture for Hengtong Lu

Hengtong Lu

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Add code
Aug 23, 2025
Viaarxiv icon

A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-training

Add code
Oct 01, 2023
Viaarxiv icon