Alert button
Picture for Csaba Szepesvari

Csaba Szepesvari

Alert button

Dj

Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures

Add code
Bookmark button
Alert button
Dec 04, 2018
Jonathan Uesato, Ananya Kumar, Csaba Szepesvari, Tom Erez, Avraham Ruderman, Keith Anderson, Krishmamurthy, Dvijotham, Nicolas Heess, Pushmeet Kohli

Figure 1 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 2 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 3 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Figure 4 for Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Viaarxiv icon

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits

Add code
Bookmark button
Alert button
Nov 13, 2018
Branislav Kveton, Csaba Szepesvari, Zheng Wen, Mohammad Ghavamzadeh, Tor Lattimore

Figure 1 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Figure 2 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Viaarxiv icon

Model-Free Linear Quadratic Control via Reduction to Expert Prediction

Add code
Bookmark button
Alert button
Oct 05, 2018
Yasin Abbasi-Yadkori, Nevena Lazic, Csaba Szepesvari

Figure 1 for Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Viaarxiv icon

PAC-Bayes bounds for stable algorithms with instance-dependent priors

Add code
Bookmark button
Alert button
Aug 30, 2018
Omar Rivasplata, Emilio Parrado-Hernandez, John Shawe-Taylor, Shiliang Sun, Csaba Szepesvari

Figure 1 for PAC-Bayes bounds for stable algorithms with instance-dependent priors
Figure 2 for PAC-Bayes bounds for stable algorithms with instance-dependent priors
Figure 3 for PAC-Bayes bounds for stable algorithms with instance-dependent priors
Figure 4 for PAC-Bayes bounds for stable algorithms with instance-dependent priors
Viaarxiv icon

BubbleRank: Safe Online Learning to Rerank

Add code
Bookmark button
Alert button
Jun 15, 2018
Branislav Kveton, Chang Li, Tor Lattimore, Ilya Markov, Maarten de Rijke, Csaba Szepesvari, Masrour Zoghi

Figure 1 for BubbleRank: Safe Online Learning to Rerank
Figure 2 for BubbleRank: Safe Online Learning to Rerank
Figure 3 for BubbleRank: Safe Online Learning to Rerank
Viaarxiv icon

Bandits with Delayed, Aggregated Anonymous Feedback

Add code
Bookmark button
Alert button
Jun 13, 2018
Ciara Pike-Burke, Shipra Agrawal, Csaba Szepesvari, Steffen Grunewalder

Figure 1 for Bandits with Delayed, Aggregated Anonymous Feedback
Figure 2 for Bandits with Delayed, Aggregated Anonymous Feedback
Figure 3 for Bandits with Delayed, Aggregated Anonymous Feedback
Figure 4 for Bandits with Delayed, Aggregated Anonymous Feedback
Viaarxiv icon

TopRank: A practical algorithm for online stochastic ranking

Add code
Bookmark button
Alert button
Jun 06, 2018
Tor Lattimore, Branislav Kveton, Shuai Li, Csaba Szepesvari

Figure 1 for TopRank: A practical algorithm for online stochastic ranking
Figure 2 for TopRank: A practical algorithm for online stochastic ranking
Viaarxiv icon

Cleaning up the neighborhood: A full classification for adversarial partial monitoring

Add code
Bookmark button
Alert button
May 23, 2018
Tor Lattimore, Csaba Szepesvari

Figure 1 for Cleaning up the neighborhood: A full classification for adversarial partial monitoring
Figure 2 for Cleaning up the neighborhood: A full classification for adversarial partial monitoring
Viaarxiv icon

Stochastic Low-Rank Bandits

Add code
Bookmark button
Alert button
Dec 13, 2017
Branislav Kveton, Csaba Szepesvari, Anup Rao, Zheng Wen, Yasin Abbasi-Yadkori, S. Muthukrishnan

Figure 1 for Stochastic Low-Rank Bandits
Viaarxiv icon

Crowdsourcing with Sparsely Interacting Workers

Add code
Bookmark button
Alert button
Jun 20, 2017
Yao Ma, Alex Olshevsky, Venkatesh Saligrama, Csaba Szepesvari

Figure 1 for Crowdsourcing with Sparsely Interacting Workers
Figure 2 for Crowdsourcing with Sparsely Interacting Workers
Figure 3 for Crowdsourcing with Sparsely Interacting Workers
Figure 4 for Crowdsourcing with Sparsely Interacting Workers
Viaarxiv icon