Picture for Hangcheng Zhu

Hangcheng Zhu

Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks

Add code
Apr 03, 2026
Viaarxiv icon

SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

Add code
Mar 23, 2026
Viaarxiv icon

SERL: Self-Examining Reinforcement Learning on Open-Domain

Add code
Nov 18, 2025
Viaarxiv icon