Picture for Rihui Xin

Rihui Xin

Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers

Add code
May 26, 2025
Viaarxiv icon

Baichuan-M1: Pushing the Medical Capability of Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon