Alert button
Picture for Aleksandrs Slivkins

Aleksandrs Slivkins

Alert button

Sayer: Using Implicit Feedback to Optimize System Policies

Add code
Bookmark button
Alert button
Oct 28, 2021
Mathias Lécuyer, Sang Hoon Kim, Mihir Nanavati, Junchen Jiang, Siddhartha Sen, Amit Sharma, Aleksandrs Slivkins

Figure 1 for Sayer: Using Implicit Feedback to Optimize System Policies
Figure 2 for Sayer: Using Implicit Feedback to Optimize System Policies
Figure 3 for Sayer: Using Implicit Feedback to Optimize System Policies
Figure 4 for Sayer: Using Implicit Feedback to Optimize System Policies
Viaarxiv icon

Exploration and Incentives in Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 28, 2021
Max Simchowitz, Aleksandrs Slivkins

Viaarxiv icon

Competing Bandits: The Perils of Exploration Under Competition

Add code
Bookmark button
Alert button
Jul 20, 2020
Guy Aridor, Yishay Mansour, Aleksandrs Slivkins, Zhiwei Steven Wu

Figure 1 for Competing Bandits: The Perils of Exploration Under Competition
Figure 2 for Competing Bandits: The Perils of Exploration Under Competition
Figure 3 for Competing Bandits: The Perils of Exploration Under Competition
Figure 4 for Competing Bandits: The Perils of Exploration Under Competition
Viaarxiv icon

Adaptive Discretization for Adversarial Bandits with Continuous Action Spaces

Add code
Bookmark button
Alert button
Jun 22, 2020
Chara Podimata, Aleksandrs Slivkins

Viaarxiv icon

Efficient Contextual Bandits with Continuous Actions

Add code
Bookmark button
Alert button
Jun 10, 2020
Maryam Majzoubi, Chicheng Zhang, Rajan Chari, Akshay Krishnamurthy, John Langford, Aleksandrs Slivkins

Figure 1 for Efficient Contextual Bandits with Continuous Actions
Figure 2 for Efficient Contextual Bandits with Continuous Actions
Figure 3 for Efficient Contextual Bandits with Continuous Actions
Figure 4 for Efficient Contextual Bandits with Continuous Actions
Viaarxiv icon

Constrained episodic reinforcement learning in concave-convex and knapsack settings

Add code
Bookmark button
Alert button
Jun 09, 2020
Kianté Brantley, Miroslav Dudik, Thodoris Lykouris, Sobhan Miryoosefi, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

Figure 1 for Constrained episodic reinforcement learning in concave-convex and knapsack settings
Figure 2 for Constrained episodic reinforcement learning in concave-convex and knapsack settings
Viaarxiv icon

Greedy Algorithm almost Dominates in Smoothed Contextual Bandits

Add code
Bookmark button
Alert button
May 19, 2020
Manish Raghavan, Aleksandrs Slivkins, Jennifer Wortman Vaughan, Zhiwei Steven Wu

Viaarxiv icon

Sample Complexity of Incentivized Exploration

Add code
Bookmark button
Alert button
Feb 03, 2020
Mark Sellke, Aleksandrs Slivkins

Viaarxiv icon

Advances in Bandits with Knapsacks

Add code
Bookmark button
Alert button
Feb 01, 2020
Karthik Abinav Sankararaman, Aleksandrs Slivkins

Viaarxiv icon

Corruption Robust Exploration in Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 20, 2019
Thodoris Lykouris, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

Figure 1 for Corruption Robust Exploration in Episodic Reinforcement Learning
Viaarxiv icon