Alert button
Picture for Nathan Kallus

Nathan Kallus

Alert button

Minimax Instrumental Variable Regression and $L_2$ Convergence Guarantees without Identification or Closedness

Add code
Bookmark button
Alert button
Feb 10, 2023
Andrew Bennett, Nathan Kallus, Xiaojie Mao, Whitney Newey, Vasilis Syrgkanis, Masatoshi Uehara

Figure 1 for Minimax Instrumental Variable Regression and $L_2$ Convergence Guarantees without Identification or Closedness
Viaarxiv icon

Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

Add code
Bookmark button
Alert button
Feb 07, 2023
Kaiwen Wang, Nathan Kallus, Wen Sun

Viaarxiv icon

Refined Value-Based Offline RL under Realizability and Partial Coverage

Add code
Bookmark button
Alert button
Feb 05, 2023
Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun

Figure 1 for Refined Value-Based Offline RL under Realizability and Partial Coverage
Viaarxiv icon

Smooth Non-Stationary Bandits

Add code
Bookmark button
Alert button
Jan 29, 2023
Su Jia, Qian Xie, Nathan Kallus, Peter I. Frazier

Figure 1 for Smooth Non-Stationary Bandits
Figure 2 for Smooth Non-Stationary Bandits
Figure 3 for Smooth Non-Stationary Bandits
Viaarxiv icon

Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent Observations

Add code
Bookmark button
Alert button
Dec 29, 2022
Aurelien Bibaut, Nathan Kallus, Michael Lindon

Figure 1 for Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent Observations
Figure 2 for Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent Observations
Viaarxiv icon

A Review of Off-Policy Evaluation in Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 13, 2022
Masatoshi Uehara, Chengchun Shi, Nathan Kallus

Viaarxiv icon

The Implicit Delta Method

Add code
Bookmark button
Alert button
Nov 11, 2022
Nathan Kallus, James McInerney

Figure 1 for The Implicit Delta Method
Figure 2 for The Implicit Delta Method
Figure 3 for The Implicit Delta Method
Figure 4 for The Implicit Delta Method
Viaarxiv icon

Provable Safe Reinforcement Learning with Binary Feedback

Add code
Bookmark button
Alert button
Oct 26, 2022
Andrew Bennett, Dipendra Misra, Nathan Kallus

Figure 1 for Provable Safe Reinforcement Learning with Binary Feedback
Figure 2 for Provable Safe Reinforcement Learning with Binary Feedback
Figure 3 for Provable Safe Reinforcement Learning with Binary Feedback
Figure 4 for Provable Safe Reinforcement Learning with Binary Feedback
Viaarxiv icon

Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation

Add code
Bookmark button
Alert button
Aug 17, 2022
Nathan Kallus, Xiaojie Mao

Figure 1 for Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation
Figure 2 for Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation
Figure 3 for Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation
Viaarxiv icon

Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

Add code
Bookmark button
Alert button
Jul 26, 2022
Masatoshi Uehara, Haruka Kiyohara, Andrew Bennett, Victor Chernozhukov, Nan Jiang, Nathan Kallus, Chengchun Shi, Wen Sun

Figure 1 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 2 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 3 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 4 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Viaarxiv icon