Picture for Eshwar S. R.

Eshwar S. R.

Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning

Add code
Jun 08, 2025
Viaarxiv icon