Picture for Harry Mead

Harry Mead

Improving Regret Approximation for Unsupervised Dynamic Environment Generation

Add code
Jan 21, 2026
Viaarxiv icon

Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation

Add code
Apr 29, 2025
Viaarxiv icon