Picture for Michael I. Jordan

Michael I. Jordan

Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

Add code
Jan 30, 2023
Figure 1 for Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons
Figure 2 for Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons
Viaarxiv icon

Online Learning in Stackelberg Games with an Omniscient Follower

Add code
Jan 27, 2023
Viaarxiv icon

Incentive-Aware Recommender Systems in Two-Sided Markets

Add code
Nov 23, 2022
Viaarxiv icon

The Sample Complexity of Online Contract Design

Add code
Nov 10, 2022
Viaarxiv icon

Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization

Add code
Oct 31, 2022
Figure 1 for Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization
Figure 2 for Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization
Figure 3 for Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization
Figure 4 for Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization
Viaarxiv icon

Revisiting the ACVI Method for Constrained Variational Inequalities

Add code
Oct 27, 2022
Figure 1 for Revisiting the ACVI Method for Constrained Variational Inequalities
Figure 2 for Revisiting the ACVI Method for Constrained Variational Inequalities
Figure 3 for Revisiting the ACVI Method for Constrained Variational Inequalities
Figure 4 for Revisiting the ACVI Method for Constrained Variational Inequalities
Viaarxiv icon

Explicit Second-Order Min-Max Optimization Methods with Optimal Convergence Guarantee

Add code
Oct 23, 2022
Viaarxiv icon

On-Demand Sampling: Learning Optimally from Multiple Distributions

Add code
Oct 22, 2022
Viaarxiv icon

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

Add code
Oct 19, 2022
Figure 1 for A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Viaarxiv icon

QuTE: decentralized multiple testing on sensor networks with false discovery rate control

Add code
Oct 09, 2022
Figure 1 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 2 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 3 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 4 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Viaarxiv icon