Alert button
Picture for Michael I. Jordan

Michael I. Jordan

Alert button

Polyak-Ruppert Averaged Q-Leaning is Statistically Efficient

Add code
Bookmark button
Alert button
Jan 23, 2022
Xiang Li, Wenhao Yang, Jiadong Liang, Zhihua Zhang, Michael I. Jordan

Figure 1 for Polyak-Ruppert Averaged Q-Leaning is Statistically Efficient
Viaarxiv icon

Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 21, 2022
Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan

Figure 1 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Figure 2 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Figure 3 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Figure 4 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Viaarxiv icon

Optimal variance-reduced stochastic approximation in Banach spaces

Add code
Bookmark button
Alert button
Jan 21, 2022
Wenlong Mou, Koulik Khamaru, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

Viaarxiv icon

Nonconvex Stochastic Scaled-Gradient Descent and Generalized Eigenvector Problems

Add code
Bookmark button
Alert button
Dec 29, 2021
Chris Junchi Li, Michael I. Jordan

Figure 1 for Nonconvex Stochastic Scaled-Gradient Descent and Generalized Eigenvector Problems
Figure 2 for Nonconvex Stochastic Scaled-Gradient Descent and Generalized Eigenvector Problems
Figure 3 for Nonconvex Stochastic Scaled-Gradient Descent and Generalized Eigenvector Problems
Viaarxiv icon

Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations

Add code
Bookmark button
Alert button
Dec 27, 2021
Tatjana Chavdarova, Michael I. Jordan, Manolis Zampetakis

Figure 1 for Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations
Figure 2 for Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations
Figure 3 for Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations
Figure 4 for Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations
Viaarxiv icon

Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic

Add code
Bookmark button
Alert button
Dec 27, 2021
Yufeng Zhang, Siyu Chen, Zhuoran Yang, Michael I. Jordan, Zhaoran Wang

Viaarxiv icon

Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?

Add code
Bookmark button
Alert button
Dec 27, 2021
Han Zhong, Zhuoran Yang, Zhaoran Wang, Michael I. Jordan

Viaarxiv icon

Assessment of Treatment Effect Estimators for Heavy-Tailed Data

Add code
Bookmark button
Alert button
Dec 19, 2021
Nilesh Tripuraneni, Dhruv Madeka, Dean Foster, Dominique Perrault-Joncas, Michael I. Jordan

Figure 1 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 2 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 3 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 4 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Viaarxiv icon