Picture for Isaac Miller

Isaac Miller

Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

Add code
Aug 06, 2025
Viaarxiv icon