Picture for Martin J. Wainwright

Martin J. Wainwright

Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces

Add code
Oct 20, 2022
Figure 1 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Figure 2 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Figure 3 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Figure 4 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Viaarxiv icon

QuTE: decentralized multiple testing on sensor networks with false discovery rate control

Add code
Oct 09, 2022
Figure 1 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 2 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 3 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 4 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Viaarxiv icon

Off-policy estimation of linear functionals: Non-asymptotic theory for semi-parametric efficiency

Add code
Sep 26, 2022
Figure 1 for Off-policy estimation of linear functionals: Non-asymptotic theory for semi-parametric efficiency
Viaarxiv icon

Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning

Add code
Jun 01, 2022
Viaarxiv icon

Optimally tackling covariate shift in RKHS-based nonparametric regression

Add code
May 06, 2022
Figure 1 for Optimally tackling covariate shift in RKHS-based nonparametric regression
Figure 2 for Optimally tackling covariate shift in RKHS-based nonparametric regression
Figure 3 for Optimally tackling covariate shift in RKHS-based nonparametric regression
Figure 4 for Optimally tackling covariate shift in RKHS-based nonparametric regression
Viaarxiv icon

Bellman Residual Orthogonalization for Offline Reinforcement Learning

Add code
Mar 24, 2022
Viaarxiv icon

A new similarity measure for covariate shift with applications to nonparametric regression

Add code
Feb 06, 2022
Figure 1 for A new similarity measure for covariate shift with applications to nonparametric regression
Viaarxiv icon

Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

Add code
Jan 21, 2022
Figure 1 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Figure 2 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Figure 3 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Figure 4 for Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
Viaarxiv icon

Optimal variance-reduced stochastic approximation in Banach spaces

Add code
Jan 21, 2022
Viaarxiv icon

Optimal and instance-dependent guarantees for Markovian linear stochastic approximation

Add code
Dec 23, 2021
Viaarxiv icon