Picture for Ambuj Tewari

Ambuj Tewari

University of Texas

Online Boosting for Multilabel Ranking with Top-k Feedback

Add code
Nov 06, 2019
Figure 1 for Online Boosting for Multilabel Ranking with Top-k Feedback
Figure 2 for Online Boosting for Multilabel Ranking with Top-k Feedback
Figure 3 for Online Boosting for Multilabel Ranking with Top-k Feedback
Figure 4 for Online Boosting for Multilabel Ranking with Top-k Feedback
Viaarxiv icon

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles

Add code
Oct 23, 2019
Viaarxiv icon

Thompson Sampling in Non-Episodic Restless Bandits

Add code
Oct 12, 2019
Figure 1 for Thompson Sampling in Non-Episodic Restless Bandits
Figure 2 for Thompson Sampling in Non-Episodic Restless Bandits
Figure 3 for Thompson Sampling in Non-Episodic Restless Bandits
Viaarxiv icon

What You See May Not Be What You Get: UCB Bandit Algorithms Robust to ε-Contamination

Add code
Oct 12, 2019
Figure 1 for What You See May Not Be What You Get: UCB Bandit Algorithms Robust to ε-Contamination
Figure 2 for What You See May Not Be What You Get: UCB Bandit Algorithms Robust to ε-Contamination
Figure 3 for What You See May Not Be What You Get: UCB Bandit Algorithms Robust to ε-Contamination
Figure 4 for What You See May Not Be What You Get: UCB Bandit Algorithms Robust to ε-Contamination
Viaarxiv icon

Not All are Made Equal: Consistency of Weighted Averaging Estimators Under Active Learning

Add code
Oct 11, 2019
Figure 1 for Not All are Made Equal: Consistency of Weighted Averaging Estimators Under Active Learning
Figure 2 for Not All are Made Equal: Consistency of Weighted Averaging Estimators Under Active Learning
Viaarxiv icon

Regret Analysis of Causal Bandit Problems

Add code
Oct 11, 2019
Figure 1 for Regret Analysis of Causal Bandit Problems
Figure 2 for Regret Analysis of Causal Bandit Problems
Figure 3 for Regret Analysis of Causal Bandit Problems
Figure 4 for Regret Analysis of Causal Bandit Problems
Viaarxiv icon

Regret Bounds for Thompson Sampling in Restless Bandit Problems

Add code
May 29, 2019
Figure 1 for Regret Bounds for Thompson Sampling in Restless Bandit Problems
Figure 2 for Regret Bounds for Thompson Sampling in Restless Bandit Problems
Viaarxiv icon

Generalization Bounds in the Predict-then-Optimize Framework

Add code
May 27, 2019
Viaarxiv icon

Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems

Add code
May 16, 2019
Figure 1 for Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems
Figure 2 for Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems
Figure 3 for Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems
Figure 4 for Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems
Viaarxiv icon

Contextual Markov Decision Processes using Generalized Linear Models

Add code
Mar 14, 2019
Figure 1 for Contextual Markov Decision Processes using Generalized Linear Models
Viaarxiv icon