Picture for Alekh Agarwal

Alekh Agarwal

Practical Contextual Bandits with Regression Oracles

Add code
Mar 03, 2018
Figure 1 for Practical Contextual Bandits with Regression Oracles
Figure 2 for Practical Contextual Bandits with Regression Oracles
Figure 3 for Practical Contextual Bandits with Regression Oracles
Viaarxiv icon

Active Learning for Cost-Sensitive Classification

Add code
Nov 13, 2017
Figure 1 for Active Learning for Cost-Sensitive Classification
Figure 2 for Active Learning for Cost-Sensitive Classification
Figure 3 for Active Learning for Cost-Sensitive Classification
Figure 4 for Active Learning for Cost-Sensitive Classification
Viaarxiv icon

Optimal and Adaptive Off-policy Evaluation in Contextual Bandits

Add code
Nov 11, 2017
Figure 1 for Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Figure 2 for Optimal and Adaptive Off-policy Evaluation in Contextual Bandits
Viaarxiv icon

Off-policy evaluation for slate recommendation

Add code
Nov 06, 2017
Figure 1 for Off-policy evaluation for slate recommendation
Figure 2 for Off-policy evaluation for slate recommendation
Figure 3 for Off-policy evaluation for slate recommendation
Figure 4 for Off-policy evaluation for slate recommendation
Viaarxiv icon

Efficient Second Order Online Learning by Sketching

Add code
Oct 17, 2017
Figure 1 for Efficient Second Order Online Learning by Sketching
Figure 2 for Efficient Second Order Online Learning by Sketching
Figure 3 for Efficient Second Order Online Learning by Sketching
Figure 4 for Efficient Second Order Online Learning by Sketching
Viaarxiv icon

Corralling a Band of Bandit Algorithms

Add code
Jun 06, 2017
Figure 1 for Corralling a Band of Bandit Algorithms
Viaarxiv icon

Making Contextual Decisions with Low Technical Debt

Add code
May 09, 2017
Figure 1 for Making Contextual Decisions with Low Technical Debt
Figure 2 for Making Contextual Decisions with Low Technical Debt
Figure 3 for Making Contextual Decisions with Low Technical Debt
Figure 4 for Making Contextual Decisions with Low Technical Debt
Viaarxiv icon

Contextual Decision Processes with Low Bellman Rank are PAC-Learnable

Add code
Dec 01, 2016
Figure 1 for Contextual Decision Processes with Low Bellman Rank are PAC-Learnable
Figure 2 for Contextual Decision Processes with Low Bellman Rank are PAC-Learnable
Viaarxiv icon

Contextual Semibandits via Supervised Learning Oracles

Add code
Nov 04, 2016
Figure 1 for Contextual Semibandits via Supervised Learning Oracles
Figure 2 for Contextual Semibandits via Supervised Learning Oracles
Viaarxiv icon

PAC Reinforcement Learning with Rich Observations

Add code
Oct 28, 2016
Figure 1 for PAC Reinforcement Learning with Rich Observations
Viaarxiv icon