Alert button
Picture for Brendan D. Tracey

Brendan D. Tracey

Alert button

Towards practical reinforcement learning for tokamak magnetic control

Jul 21, 2023
Brendan D. Tracey, Andrea Michi, Yuri Chervonyi, Ian Davies, Cosmin Paduraru, Nevena Lazic, Federico Felici, Timo Ewalds, Craig Donner, Cristian Galperti, Jonas Buchli, Michael Neunert, Andrea Huber, Jonathan Evens, Paula Kurylowicz, Daniel J. Mankowitz, Martin Riedmiller, The TCV Team

Figure 1 for Towards practical reinforcement learning for tokamak magnetic control
Figure 2 for Towards practical reinforcement learning for tokamak magnetic control
Figure 3 for Towards practical reinforcement learning for tokamak magnetic control
Figure 4 for Towards practical reinforcement learning for tokamak magnetic control
Viaarxiv icon

From Motor Control to Team Play in Simulated Humanoid Football

May 25, 2021
Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech M. Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess

Figure 1 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 2 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 3 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 4 for From Motor Control to Team Play in Simulated Humanoid Football
Viaarxiv icon

Nonlinear Information Bottleneck

Sep 04, 2018
Artemy Kolchinsky, Brendan D. Tracey, David H. Wolpert

Figure 1 for Nonlinear Information Bottleneck
Figure 2 for Nonlinear Information Bottleneck
Viaarxiv icon

Pathologies in information bottleneck for deterministic supervised learning

Aug 23, 2018
Artemy Kolchinsky, Brendan D. Tracey, Steven Van Kuyk

Figure 1 for Pathologies in information bottleneck for deterministic supervised learning
Figure 2 for Pathologies in information bottleneck for deterministic supervised learning
Figure 3 for Pathologies in information bottleneck for deterministic supervised learning
Viaarxiv icon

Estimating Mixture Entropy with Pairwise Distances

Aug 22, 2018
Artemy Kolchinsky, Brendan D. Tracey

Figure 1 for Estimating Mixture Entropy with Pairwise Distances
Figure 2 for Estimating Mixture Entropy with Pairwise Distances
Viaarxiv icon

Upgrading from Gaussian Processes to Student's-T Processes

Jan 18, 2018
Brendan D. Tracey, David H. Wolpert

Figure 1 for Upgrading from Gaussian Processes to Student's-T Processes
Figure 2 for Upgrading from Gaussian Processes to Student's-T Processes
Figure 3 for Upgrading from Gaussian Processes to Student's-T Processes
Figure 4 for Upgrading from Gaussian Processes to Student's-T Processes
Viaarxiv icon

Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes

Sep 19, 2017
Kunal Menda, Yi-Chun Chen, Justin Grana, James W. Bono, Brendan D. Tracey, Mykel J. Kochenderfer, David Wolpert

Figure 1 for Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Figure 2 for Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Figure 3 for Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Figure 4 for Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Viaarxiv icon

Reducing the error of Monte Carlo Algorithms by Learning Control Variates

Jun 07, 2016
Brendan D. Tracey, David H. Wolpert

Figure 1 for Reducing the error of Monte Carlo Algorithms by Learning Control Variates
Figure 2 for Reducing the error of Monte Carlo Algorithms by Learning Control Variates
Figure 3 for Reducing the error of Monte Carlo Algorithms by Learning Control Variates
Viaarxiv icon