Picture for Brahim Driss

Brahim Driss

Performative Policy Gradient: Optimality in Performative Reinforcement Learning

Add code
Dec 23, 2025
Viaarxiv icon

PB$^2$: Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning

Add code
Jun 16, 2025
Viaarxiv icon

Deep Reinforcement Learning for 5*5 Multiplayer Go

Add code
May 23, 2024
Viaarxiv icon