Alert button
Picture for András György

András György

Alert button

Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits

Add code
Bookmark button
Alert button
Feb 08, 2024
Nicolas Nguyen, Imad Aouali, András György, Claire Vernade

Viaarxiv icon

Online RL in Linearly $q^π$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore

Add code
Bookmark button
Alert button
Oct 11, 2023
Gellért Weisz, András György, Csaba Szepesvári

Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
Bookmark button
Alert button
May 18, 2023
Qinghua Liu, Gellért Weisz, András György, Chi Jin, Csaba Szepesvári

Viaarxiv icon

A Second-Order Method for Stochastic Bandit Convex Optimisation

Add code
Bookmark button
Alert button
Feb 10, 2023
Tor Lattimore, András György

Figure 1 for A Second-Order Method for Stochastic Bandit Convex Optimisation
Figure 2 for A Second-Order Method for Stochastic Bandit Convex Optimisation
Viaarxiv icon

Optimistic Meta-Gradients

Add code
Bookmark button
Alert button
Jan 09, 2023
Sebastian Flennerhag, Tom Zahavy, Brendan O'Donoghue, Hado van Hasselt, András György, Satinder Singh

Figure 1 for Optimistic Meta-Gradients
Figure 2 for Optimistic Meta-Gradients
Figure 3 for Optimistic Meta-Gradients
Figure 4 for Optimistic Meta-Gradients
Viaarxiv icon

Generalization Bounds for Transfer Learning with Pretrained Classifiers

Add code
Bookmark button
Alert button
Dec 23, 2022
Tomer Galanti, András György, Marcus Hutter

Figure 1 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Figure 2 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Figure 3 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Figure 4 for Generalization Bounds for Transfer Learning with Pretrained Classifiers
Viaarxiv icon

Understanding Self-Predictive Learning for Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 06, 2022
Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko

Figure 1 for Understanding Self-Predictive Learning for Reinforcement Learning
Figure 2 for Understanding Self-Predictive Learning for Reinforcement Learning
Figure 3 for Understanding Self-Predictive Learning for Reinforcement Learning
Figure 4 for Understanding Self-Predictive Learning for Reinforcement Learning
Viaarxiv icon

Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs

Add code
Bookmark button
Alert button
Oct 27, 2022
Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvári

Figure 1 for Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs
Viaarxiv icon

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Add code
Bookmark button
Alert button
May 26, 2022
Sanae Amani, Tor Lattimore, András György, Lin F. Yang

Figure 1 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Figure 2 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Viaarxiv icon

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

Add code
Bookmark button
Alert button
Mar 13, 2022
MohammadJavad Azizi, Thang Duong, Yasin Abbasi-Yadkori, András György, Claire Vernade, Mohammad Ghavamzadeh

Figure 1 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 2 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 3 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 4 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Viaarxiv icon