Picture for Anas Barakat

Anas Barakat

S2A, IDS, LTCI

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

Add code
Feb 26, 2026
Viaarxiv icon

Convex Markov Games and Beyond: New Proof of Existence, Characterization and Learning Algorithms for Nash Equilibria

Add code
Feb 12, 2026
Viaarxiv icon

Multi-Agent Online Control with Adversarial Disturbances

Add code
Jun 23, 2025
Viaarxiv icon

Optimistic Online Learning in Symmetric Cone Games

Add code
Apr 04, 2025
Figure 1 for Optimistic Online Learning in Symmetric Cone Games
Figure 2 for Optimistic Online Learning in Symmetric Cone Games
Viaarxiv icon

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

Add code
Oct 05, 2024
Figure 1 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 2 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 3 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 4 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Viaarxiv icon

Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning

Add code
Oct 03, 2024
Figure 1 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Figure 2 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Figure 3 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Figure 4 for Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning
Viaarxiv icon

Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players

Add code
Aug 15, 2024
Figure 1 for Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players
Viaarxiv icon

Policy Mirror Descent with Lookahead

Add code
Mar 21, 2024
Figure 1 for Policy Mirror Descent with Lookahead
Figure 2 for Policy Mirror Descent with Lookahead
Figure 3 for Policy Mirror Descent with Lookahead
Viaarxiv icon

Independent Learning in Constrained Markov Potential Games

Add code
Feb 27, 2024
Figure 1 for Independent Learning in Constrained Markov Potential Games
Figure 2 for Independent Learning in Constrained Markov Potential Games
Figure 3 for Independent Learning in Constrained Markov Potential Games
Figure 4 for Independent Learning in Constrained Markov Potential Games
Viaarxiv icon

Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity

Add code
Sep 08, 2023
Figure 1 for Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
Figure 2 for Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
Viaarxiv icon