Picture for Asuman Ozdaglar

Asuman Ozdaglar

How AI Aggregation Affects Knowledge

Add code
Apr 06, 2026
Viaarxiv icon

Online Learning and Equilibrium Computation with Ranking Feedback

Add code
Mar 19, 2026
Viaarxiv icon

Collaborative and Efficient Fine-tuning: Leveraging Task Similarity

Add code
Feb 06, 2026
Viaarxiv icon

Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach

Add code
Nov 06, 2025
Viaarxiv icon

Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach

Add code
Jun 05, 2025
Figure 1 for Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach
Figure 2 for Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach
Figure 3 for Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach
Figure 4 for Population-Proportional Preference Learning from Human Feedback: An Axiomatic Approach
Viaarxiv icon

What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization

Add code
May 27, 2025
Figure 1 for What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization
Figure 2 for What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization
Viaarxiv icon

UFT: Unifying Supervised and Reinforcement Fine-Tuning

Add code
May 22, 2025
Figure 1 for UFT: Unifying Supervised and Reinforcement Fine-Tuning
Figure 2 for UFT: Unifying Supervised and Reinforcement Fine-Tuning
Figure 3 for UFT: Unifying Supervised and Reinforcement Fine-Tuning
Figure 4 for UFT: Unifying Supervised and Reinforcement Fine-Tuning
Viaarxiv icon

Differentially Private Equilibrium Finding in Polymatrix Games

Add code
Mar 12, 2025
Figure 1 for Differentially Private Equilibrium Finding in Polymatrix Games
Figure 2 for Differentially Private Equilibrium Finding in Polymatrix Games
Figure 3 for Differentially Private Equilibrium Finding in Polymatrix Games
Figure 4 for Differentially Private Equilibrium Finding in Polymatrix Games
Viaarxiv icon

MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Add code
Feb 25, 2025
Figure 1 for MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning
Figure 2 for MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning
Figure 3 for MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning
Figure 4 for MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning
Viaarxiv icon

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Sep 02, 2024
Figure 1 for Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
Viaarxiv icon