Picture for Xiaoxi Jiang

Xiaoxi Jiang

SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm

Add code
Feb 08, 2026
Viaarxiv icon

Eliminating Inductive Bias in Reward Models with Information-Theoretic Guidance

Add code
Dec 29, 2025
Viaarxiv icon

Search Self-play: Pushing the Frontier of Agent Capability without Supervision

Add code
Oct 21, 2025
Viaarxiv icon