Picture for Xuanqi Lan

Xuanqi Lan

Latent Reward Steering: An Adaptive Inference-Time Framework that Implicitly Promotes Cognitive Behaviors in Reasoning LLMs

Add code
May 30, 2026
Viaarxiv icon

On the Role of Language Representations in Auto-Bidding: Findings and Implications

Add code
May 07, 2026
Viaarxiv icon