Alert button
Picture for Ciara Pike-Burke

Ciara Pike-Burke

Alert button

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Add code
Bookmark button
Alert button
Oct 02, 2023
Emmeran Johnson, Ciara Pike-Burke, Patrick Rebeschini

Viaarxiv icon

Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts

Add code
Bookmark button
Alert button
Jul 03, 2023
Dirk van der Hoeven, Ciara Pike-Burke, Hao Qiu, Nicolo Cesa-Bianchi

Figure 1 for Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts
Figure 2 for Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts
Figure 3 for Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts
Figure 4 for Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts
Viaarxiv icon

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Add code
Bookmark button
Alert button
Feb 22, 2023
Emmeran Johnson, Ciara Pike-Burke, Patrick Rebeschini

Figure 1 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Figure 2 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Viaarxiv icon

Delayed Feedback in Kernel Bandits

Add code
Bookmark button
Alert button
Feb 01, 2023
Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke

Figure 1 for Delayed Feedback in Kernel Bandits
Figure 2 for Delayed Feedback in Kernel Bandits
Figure 3 for Delayed Feedback in Kernel Bandits
Figure 4 for Delayed Feedback in Kernel Bandits
Viaarxiv icon

Delayed Feedback in Generalised Linear Bandits Revisited

Add code
Bookmark button
Alert button
Jul 25, 2022
Benjamin Howson, Ciara Pike-Burke, Sarah Filippi

Figure 1 for Delayed Feedback in Generalised Linear Bandits Revisited
Figure 2 for Delayed Feedback in Generalised Linear Bandits Revisited
Viaarxiv icon

Bandit problems with fidelity rewards

Add code
Bookmark button
Alert button
Nov 25, 2021
Gábor Lugosi, Ciara Pike-Burke, Pierre-André Savalle

Figure 1 for Bandit problems with fidelity rewards
Figure 2 for Bandit problems with fidelity rewards
Figure 3 for Bandit problems with fidelity rewards
Figure 4 for Bandit problems with fidelity rewards
Viaarxiv icon

Delayed Feedback in Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 15, 2021
Benjamin Howson, Ciara Pike-Burke, Sarah Filippi

Figure 1 for Delayed Feedback in Episodic Reinforcement Learning
Viaarxiv icon

Local Differentially Private Regret Minimization in Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 15, 2020
Evrard Garcelon, Vianney Perchet, Ciara Pike-Burke, Matteo Pirotta

Figure 1 for Local Differentially Private Regret Minimization in Reinforcement Learning
Figure 2 for Local Differentially Private Regret Minimization in Reinforcement Learning
Figure 3 for Local Differentially Private Regret Minimization in Reinforcement Learning
Figure 4 for Local Differentially Private Regret Minimization in Reinforcement Learning
Viaarxiv icon

A Unifying View of Optimism in Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 03, 2020
Gergely Neu, Ciara Pike-Burke

Figure 1 for A Unifying View of Optimism in Episodic Reinforcement Learning
Viaarxiv icon

Recovering Bandits

Add code
Bookmark button
Alert button
Oct 31, 2019
Ciara Pike-Burke, Steffen Grünewälder

Figure 1 for Recovering Bandits
Viaarxiv icon