Picture for Ulrich Armel Mbou Sob

Ulrich Armel Mbou Sob

Self-Supervised On-Policy Reinforcement Learning via Contrastive Proximal Policy Optimisation

Add code
May 13, 2026
Viaarxiv icon