Picture for Junxiang Jia

Junxiang Jia

ExTra: Exploratory Trajectory Optimization for Language Model Reinforcement Learning

Add code
Jun 23, 2026
Viaarxiv icon