Picture for Martin J. Wainwright

Martin J. Wainwright

Optimal policy evaluation using kernel-based temporal difference methods

Add code
Sep 24, 2021
Figure 1 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 2 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 3 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 4 for Optimal policy evaluation using kernel-based temporal difference methods
Viaarxiv icon

Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning

Add code
Aug 19, 2021
Viaarxiv icon

Near-optimal inference in adaptive linear regression

Add code
Jul 14, 2021
Figure 1 for Near-optimal inference in adaptive linear regression
Figure 2 for Near-optimal inference in adaptive linear regression
Figure 3 for Near-optimal inference in adaptive linear regression
Figure 4 for Near-optimal inference in adaptive linear regression
Viaarxiv icon

Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

Add code
Jun 28, 2021
Figure 1 for Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning
Viaarxiv icon

Preference learning along multiple criteria: A game-theoretic perspective

Add code
May 05, 2021
Figure 1 for Preference learning along multiple criteria: A game-theoretic perspective
Figure 2 for Preference learning along multiple criteria: A game-theoretic perspective
Figure 3 for Preference learning along multiple criteria: A game-theoretic perspective
Figure 4 for Preference learning along multiple criteria: A game-theoretic perspective
Viaarxiv icon

Minimax Off-Policy Evaluation for Multi-Armed Bandits

Add code
Jan 19, 2021
Figure 1 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Figure 2 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Figure 3 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Figure 4 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Viaarxiv icon

Optimal oracle inequalities for solving projected fixed-point equations

Add code
Dec 09, 2020
Figure 1 for Optimal oracle inequalities for solving projected fixed-point equations
Figure 2 for Optimal oracle inequalities for solving projected fixed-point equations
Figure 3 for Optimal oracle inequalities for solving projected fixed-point equations
Viaarxiv icon

ROOT-SGD: Sharp Nonasymptotics and Asymptotic Efficiency in a Single Algorithm

Add code
Aug 28, 2020
Figure 1 for ROOT-SGD: Sharp Nonasymptotics and Asymptotic Efficiency in a Single Algorithm
Viaarxiv icon

Revisiting complexity and the bias-variance tradeoff

Add code
Jun 17, 2020
Figure 1 for Revisiting complexity and the bias-variance tradeoff
Figure 2 for Revisiting complexity and the bias-variance tradeoff
Figure 3 for Revisiting complexity and the bias-variance tradeoff
Figure 4 for Revisiting complexity and the bias-variance tradeoff
Viaarxiv icon

Instability, Computational Efficiency and Statistical Accuracy

Add code
May 22, 2020
Figure 1 for Instability, Computational Efficiency and Statistical Accuracy
Figure 2 for Instability, Computational Efficiency and Statistical Accuracy
Figure 3 for Instability, Computational Efficiency and Statistical Accuracy
Figure 4 for Instability, Computational Efficiency and Statistical Accuracy
Viaarxiv icon