Picture for Zhen Shu

Zhen Shu

ExTra: Exploratory Trajectory Optimization for Language Model Reinforcement Learning

Add code
Jun 23, 2026
Viaarxiv icon