Picture for Grégoire Ouerdane

Grégoire Ouerdane

Benchmarking Reinforcement Learning via Stochastic Converse Optimality: Generating Systems with Known Optimal Policies

Add code
Mar 18, 2026
Viaarxiv icon