Picture for Nathan Kallus

Nathan Kallus

Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent Observations

Add code
Dec 29, 2022
Viaarxiv icon

A Review of Off-Policy Evaluation in Reinforcement Learning

Add code
Dec 13, 2022
Viaarxiv icon

The Implicit Delta Method

Add code
Nov 11, 2022
Viaarxiv icon

Provable Safe Reinforcement Learning with Binary Feedback

Add code
Oct 26, 2022
Figure 1 for Provable Safe Reinforcement Learning with Binary Feedback
Figure 2 for Provable Safe Reinforcement Learning with Binary Feedback
Figure 3 for Provable Safe Reinforcement Learning with Binary Feedback
Figure 4 for Provable Safe Reinforcement Learning with Binary Feedback
Viaarxiv icon

Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation

Add code
Aug 17, 2022
Figure 1 for Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation
Figure 2 for Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation
Figure 3 for Debiased Inference on Identified Linear Functionals of Underidentified Nuisances via Penalized Minimax Estimation
Viaarxiv icon

Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

Add code
Jul 26, 2022
Figure 1 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 2 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 3 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Figure 4 for Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Viaarxiv icon

Learning Bellman Complete Representations for Offline Policy Evaluation

Add code
Jul 12, 2022
Figure 1 for Learning Bellman Complete Representations for Offline Policy Evaluation
Figure 2 for Learning Bellman Complete Representations for Offline Policy Evaluation
Figure 3 for Learning Bellman Complete Representations for Offline Policy Evaluation
Figure 4 for Learning Bellman Complete Representations for Offline Policy Evaluation
Viaarxiv icon

Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings

Add code
Jun 24, 2022
Figure 1 for Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Figure 2 for Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Viaarxiv icon

Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

Add code
Jun 24, 2022
Figure 1 for Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Figure 2 for Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Viaarxiv icon

Robust and Agnostic Learning of Conditional Distributional Treatment Effects

Add code
May 23, 2022
Figure 1 for Robust and Agnostic Learning of Conditional Distributional Treatment Effects
Figure 2 for Robust and Agnostic Learning of Conditional Distributional Treatment Effects
Figure 3 for Robust and Agnostic Learning of Conditional Distributional Treatment Effects
Viaarxiv icon