Picture for Powei Chang

Powei Chang

SALT: When More Rollouts Don't Help in Group-Based Policy Optimization and How to Make Them Matter

Add code
Jun 04, 2026
Viaarxiv icon

SPICE: Submodular Penalized Information-Conflict Selection for Efficient Large Language Model Training

Add code
Jan 30, 2026
Viaarxiv icon