Alert button
Picture for Kamil Ciosek

Kamil Ciosek

Alert button

Automatic Music Playlist Generation via Simulation-based Reinforcement Learning

Oct 13, 2023
Federico Tomasi, Joseph Cauteruccio, Surya Kanoria, Kamil Ciosek, Matteo Rinaldi, Zhenwen Dai

Figure 1 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Figure 2 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Figure 3 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Figure 4 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Viaarxiv icon

Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay

Jul 20, 2023
Thomas M. McDonald, Lucas Maystre, Mounia Lalmas, Daniel Russo, Kamil Ciosek

Figure 1 for Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
Figure 2 for Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
Figure 3 for Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
Figure 4 for Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
Viaarxiv icon

A Strong Baseline for Batch Imitation Learning

Feb 06, 2023
Matthew Smith, Lucas Maystre, Zhenwen Dai, Kamil Ciosek

Figure 1 for A Strong Baseline for Batch Imitation Learning
Figure 2 for A Strong Baseline for Batch Imitation Learning
Figure 3 for A Strong Baseline for Batch Imitation Learning
Figure 4 for A Strong Baseline for Batch Imitation Learning
Viaarxiv icon

Imitation Learning by Reinforcement Learning

Aug 10, 2021
Kamil Ciosek

Figure 1 for Imitation Learning by Reinforcement Learning
Viaarxiv icon

Information Directed Reward Learning for Reinforcement Learning

Feb 24, 2021
David Lindner, Matteo Turchetta, Sebastian Tschiatschek, Kamil Ciosek, Andreas Krause

Figure 1 for Information Directed Reward Learning for Reinforcement Learning
Figure 2 for Information Directed Reward Learning for Reinforcement Learning
Figure 3 for Information Directed Reward Learning for Reinforcement Learning
Figure 4 for Information Directed Reward Learning for Reinforcement Learning
Viaarxiv icon

Estimating $α$-Rank by Maximizing Information Gain

Jan 22, 2021
Tabish Rashid, Cheng Zhang, Kamil Ciosek

Figure 1 for Estimating $α$-Rank by Maximizing Information Gain
Figure 2 for Estimating $α$-Rank by Maximizing Information Gain
Figure 3 for Estimating $α$-Rank by Maximizing Information Gain
Figure 4 for Estimating $α$-Rank by Maximizing Information Gain
Viaarxiv icon

Regularized Policies are Reward Robust

Jan 18, 2021
Hisham Husain, Kamil Ciosek, Ryota Tomioka

Figure 1 for Regularized Policies are Reward Robust
Figure 2 for Regularized Policies are Reward Robust
Viaarxiv icon

Evaluating the Robustness of Collaborative Agents

Jan 14, 2021
Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, A. D. Dragan, Rohin Shah

Figure 1 for Evaluating the Robustness of Collaborative Agents
Figure 2 for Evaluating the Robustness of Collaborative Agents
Figure 3 for Evaluating the Robustness of Collaborative Agents
Figure 4 for Evaluating the Robustness of Collaborative Agents
Viaarxiv icon

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Jan 11, 2021
Luisa Zintgraf, Sam Devlin, Kamil Ciosek, Shimon Whiteson, Katja Hofmann

Figure 1 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Figure 2 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Figure 3 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Figure 4 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Viaarxiv icon

DRIFT: Deep Reinforcement Learning for Functional Software Testing

Jul 16, 2020
Luke Harries, Rebekah Storan Clarke, Timothy Chapman, Swamy V. P. L. N. Nallamalli, Levent Ozgur, Shuktika Jain, Alex Leung, Steve Lim, Aaron Dietrich, José Miguel Hernández-Lobato, Tom Ellis, Cheng Zhang, Kamil Ciosek

Figure 1 for DRIFT: Deep Reinforcement Learning for Functional Software Testing
Figure 2 for DRIFT: Deep Reinforcement Learning for Functional Software Testing
Figure 3 for DRIFT: Deep Reinforcement Learning for Functional Software Testing
Figure 4 for DRIFT: Deep Reinforcement Learning for Functional Software Testing
Viaarxiv icon