Alert button
Picture for Tom Stepleton

Tom Stepleton

Alert button

Ethical and social risks of harm from Language Models

Add code
Bookmark button
Alert button
Dec 08, 2021
Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, Zac Kenton, Sasha Brown, Will Hawkins, Tom Stepleton, Courtney Biles, Abeba Birhane, Julia Haas, Laura Rimell, Lisa Anne Hendricks, William Isaac, Sean Legassick, Geoffrey Irving, Iason Gabriel

Figure 1 for Ethical and social risks of harm from Language Models
Figure 2 for Ethical and social risks of harm from Language Models
Viaarxiv icon

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 18, 2020
Thomas Mesnard, Théophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Tom Stepleton, Nicolas Heess, Arthur Guez, Marcus Hutter, Lars Buesing, Rémi Munos

Figure 1 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 2 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 3 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 4 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Viaarxiv icon

Wasserstein Fair Classification

Add code
Bookmark button
Alert button
Jul 28, 2019
Ray Jiang, Aldo Pacchiano, Tom Stepleton, Heinrich Jiang, Silvia Chiappa

Figure 1 for Wasserstein Fair Classification
Figure 2 for Wasserstein Fair Classification
Figure 3 for Wasserstein Fair Classification
Figure 4 for Wasserstein Fair Classification
Viaarxiv icon

Safe and Efficient Off-Policy Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 07, 2016
Rémi Munos, Tom Stepleton, Anna Harutyunyan, Marc G. Bellemare

Figure 1 for Safe and Efficient Off-Policy Reinforcement Learning
Figure 2 for Safe and Efficient Off-Policy Reinforcement Learning
Viaarxiv icon

Q($λ$) with Off-Policy Corrections

Add code
Bookmark button
Alert button
Aug 11, 2016
Anna Harutyunyan, Marc G. Bellemare, Tom Stepleton, Remi Munos

Figure 1 for Q($λ$) with Off-Policy Corrections
Viaarxiv icon