Picture for Mingze Wu

Mingze Wu

Efficient Post-training of LLMs for Code Generation With Offline Reinforcement Learning

Add code
May 27, 2026
Viaarxiv icon