Picture for Leonardo Cesani

Leonardo Cesani

How Log-Barrier Helps Exploration in Policy Optimization

Add code
Mar 16, 2026
Viaarxiv icon

Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes

Add code
Jun 06, 2025
Figure 1 for Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes
Figure 2 for Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes
Figure 3 for Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes
Figure 4 for Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes
Viaarxiv icon