Picture for Asuman Ozdaglar

Asuman Ozdaglar

Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach

Add code
Jun 05, 2025
Viaarxiv icon

What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization

Add code
May 27, 2025
Viaarxiv icon

UFT: Unifying Supervised and Reinforcement Fine-Tuning

Add code
May 22, 2025
Viaarxiv icon

Differentially Private Equilibrium Finding in Polymatrix Games

Add code
Mar 12, 2025
Viaarxiv icon

MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Add code
Feb 25, 2025
Viaarxiv icon

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Sep 02, 2024
Viaarxiv icon

A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence

Add code
Aug 01, 2024
Figure 1 for A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Figure 2 for A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Figure 3 for A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Viaarxiv icon

Finite-Sample Guarantees for Best-Response Learning Dynamics in Zero-Sum Matrix Games

Add code
Jul 29, 2024
Viaarxiv icon

LiteEFG: An Efficient Python Library for Solving Extensive-form Games

Add code
Jul 29, 2024
Viaarxiv icon

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

Add code
May 20, 2024
Viaarxiv icon