Alert button
Picture for Ray Jiang

Ray Jiang

Alert button

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 07, 2023
Michaël Mathieu, Sherjil Ozair, Srivatsan Srinivasan, Caglar Gulcehre, Shangtong Zhang, Ray Jiang, Tom Le Paine, Richard Powell, Konrad Żołna, Julian Schrittwieser, David Choi, Petko Georgiev, Daniel Toyama, Aja Huang, Roman Ring, Igor Babuschkin, Timo Ewalds, Mahyar Bordbar, Sarah Henderson, Sergio Gómez Colmenarejo, Aäron van den Oord, Wojciech Marian Czarnecki, Nando de Freitas, Oriol Vinyals

Figure 1 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 2 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 3 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 4 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Viaarxiv icon

Scaling Goal-based Exploration via Pruning Proto-goals

Add code
Bookmark button
Alert button
Feb 09, 2023
Akhil Bagaria, Ray Jiang, Ramana Kumar, Tom Schaul

Figure 1 for Scaling Goal-based Exploration via Pruning Proto-goals
Figure 2 for Scaling Goal-based Exploration via Pruning Proto-goals
Figure 3 for Scaling Goal-based Exploration via Pruning Proto-goals
Figure 4 for Scaling Goal-based Exploration via Pruning Proto-goals
Viaarxiv icon

Human-level Atari 200x faster

Add code
Bookmark button
Alert button
Sep 15, 2022
Steven Kapturowski, Víctor Campos, Ray Jiang, Nemanja Rakićević, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia

Figure 1 for Human-level Atari 200x faster
Figure 2 for Human-level Atari 200x faster
Figure 3 for Human-level Atari 200x faster
Figure 4 for Human-level Atari 200x faster
Viaarxiv icon

Learning Expected Emphatic Traces for Deep RL

Add code
Bookmark button
Alert button
Jul 12, 2021
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt

Figure 1 for Learning Expected Emphatic Traces for Deep RL
Figure 2 for Learning Expected Emphatic Traces for Deep RL
Figure 3 for Learning Expected Emphatic Traces for Deep RL
Figure 4 for Learning Expected Emphatic Traces for Deep RL
Viaarxiv icon

Emphatic Algorithms for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 21, 2021
Ray Jiang, Tom Zahavy, Zhongwen Xu, Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

Figure 1 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 2 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 3 for Emphatic Algorithms for Deep Reinforcement Learning
Figure 4 for Emphatic Algorithms for Deep Reinforcement Learning
Viaarxiv icon

Causally Correct Partial Models for Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 07, 2020
Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

Figure 1 for Causally Correct Partial Models for Reinforcement Learning
Figure 2 for Causally Correct Partial Models for Reinforcement Learning
Figure 3 for Causally Correct Partial Models for Reinforcement Learning
Figure 4 for Causally Correct Partial Models for Reinforcement Learning
Viaarxiv icon

Reducing Sentiment Bias in Language Models via Counterfactual Evaluation

Add code
Bookmark button
Alert button
Nov 08, 2019
Po-Sen Huang, Huan Zhang, Ray Jiang, Robert Stanforth, Johannes Welbl, Jack Rae, Vishal Maini, Dani Yogatama, Pushmeet Kohli

Figure 1 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Figure 2 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Figure 3 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Figure 4 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Viaarxiv icon

Wasserstein Fair Classification

Add code
Bookmark button
Alert button
Jul 28, 2019
Ray Jiang, Aldo Pacchiano, Tom Stepleton, Heinrich Jiang, Silvia Chiappa

Figure 1 for Wasserstein Fair Classification
Figure 2 for Wasserstein Fair Classification
Figure 3 for Wasserstein Fair Classification
Figure 4 for Wasserstein Fair Classification
Viaarxiv icon

Degenerate Feedback Loops in Recommender Systems

Add code
Bookmark button
Alert button
Mar 27, 2019
Ray Jiang, Silvia Chiappa, Tor Lattimore, András György, Pushmeet Kohli

Figure 1 for Degenerate Feedback Loops in Recommender Systems
Figure 2 for Degenerate Feedback Loops in Recommender Systems
Figure 3 for Degenerate Feedback Loops in Recommender Systems
Figure 4 for Degenerate Feedback Loops in Recommender Systems
Viaarxiv icon

Learning from Delayed Outcomes with Intermediate Observations

Add code
Bookmark button
Alert button
Jul 24, 2018
Timothy A. Mann, Sven Gowal, Ray Jiang, Huiyi Hu, Balaji Lakshminarayanan, Andras Gyorgy

Figure 1 for Learning from Delayed Outcomes with Intermediate Observations
Figure 2 for Learning from Delayed Outcomes with Intermediate Observations
Figure 3 for Learning from Delayed Outcomes with Intermediate Observations
Figure 4 for Learning from Delayed Outcomes with Intermediate Observations
Viaarxiv icon