Picture for Hailun Lu

Hailun Lu

Advantage Collapse in Group Relative Policy Optimization: Diagnosis and Mitigation

Add code
May 20, 2026
Viaarxiv icon

Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration

Add code
Mar 24, 2026
Viaarxiv icon