Picture for Hongyan Xie

Hongyan Xie

UniARM: Towards a Unified Autoregressive Reward Model for Multi-Objective Test-Time Alignment

Add code
Feb 10, 2026
Viaarxiv icon

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Add code
Feb 09, 2026
Viaarxiv icon

Real-Time Aligned Reward Model beyond Semantics

Add code
Jan 30, 2026
Viaarxiv icon