Picture for Kevin Jamieson

Kevin Jamieson

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Add code
Jul 27, 2023
Viaarxiv icon

Logarithmic Regret for Matrix Games against an Adversary with Noisy Bandit Feedback

Add code
Jun 22, 2023
Viaarxiv icon

LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning

Add code
Jun 16, 2023
Figure 1 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 2 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 3 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 4 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Viaarxiv icon

Optimal Exploration for Model-Based RL in Nonlinear Systems

Add code
Jun 15, 2023
Figure 1 for Optimal Exploration for Model-Based RL in Nonlinear Systems
Figure 2 for Optimal Exploration for Model-Based RL in Nonlinear Systems
Figure 3 for Optimal Exploration for Model-Based RL in Nonlinear Systems
Figure 4 for Optimal Exploration for Model-Based RL in Nonlinear Systems
Viaarxiv icon

Active Representation Learning for General Task Space with Applications in Robotics

Add code
Jun 15, 2023
Viaarxiv icon

Improved Active Multi-Task Representation Learning via Lasso

Add code
Jun 05, 2023
Figure 1 for Improved Active Multi-Task Representation Learning via Lasso
Figure 2 for Improved Active Multi-Task Representation Learning via Lasso
Figure 3 for Improved Active Multi-Task Representation Learning via Lasso
Viaarxiv icon

Large-Scale Package Manipulation via Learned Metrics of Pick Success

Add code
May 17, 2023
Figure 1 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Figure 2 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Figure 3 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Figure 4 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Viaarxiv icon

Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games

Add code
Mar 19, 2023
Viaarxiv icon

Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design

Add code
Jul 06, 2022
Viaarxiv icon

Instance-optimal PAC Algorithms for Contextual Bandits

Add code
Jul 05, 2022
Figure 1 for Instance-optimal PAC Algorithms for Contextual Bandits
Viaarxiv icon