Picture for Nathan Kallus

Nathan Kallus

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

Add code
Mar 12, 2024
Figure 1 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 2 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 3 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 4 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Viaarxiv icon

Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL

Add code
Mar 10, 2024
Viaarxiv icon

Is Cosine-Similarity of Embeddings Really About Similarity?

Add code
Mar 08, 2024
Figure 1 for Is Cosine-Similarity of Embeddings Really About Similarity?
Viaarxiv icon

Applied Causal Inference Powered by ML and AI

Add code
Mar 04, 2024
Figure 1 for Applied Causal Inference Powered by ML and AI
Figure 2 for Applied Causal Inference Powered by ML and AI
Figure 3 for Applied Causal Inference Powered by ML and AI
Figure 4 for Applied Causal Inference Powered by ML and AI
Viaarxiv icon

Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams

Add code
Feb 16, 2024
Figure 1 for Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
Figure 2 for Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
Figure 3 for Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
Figure 4 for Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
Viaarxiv icon

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

Add code
Feb 11, 2024
Viaarxiv icon

Multi-Armed Bandits with Interference

Add code
Feb 02, 2024
Figure 1 for Multi-Armed Bandits with Interference
Viaarxiv icon

Faster Rates for Switchback Experiments

Add code
Dec 25, 2023
Figure 1 for Faster Rates for Switchback Experiments
Figure 2 for Faster Rates for Switchback Experiments
Figure 3 for Faster Rates for Switchback Experiments
Figure 4 for Faster Rates for Switchback Experiments
Viaarxiv icon

Low-Rank MDPs with Continuous Action Spaces

Add code
Nov 06, 2023
Viaarxiv icon

Off-Policy Evaluation for Large Action Spaces via Policy Convolution

Add code
Oct 24, 2023
Viaarxiv icon