Picture for Aidan Clark

Aidan Clark

Training Compute-Optimal Large Language Models

Add code
Mar 29, 2022
Figure 1 for Training Compute-Optimal Large Language Models
Figure 2 for Training Compute-Optimal Large Language Models
Figure 3 for Training Compute-Optimal Large Language Models
Figure 4 for Training Compute-Optimal Large Language Models
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Feb 09, 2022
Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon

Improving language models by retrieving from trillions of tokens

Add code
Jan 11, 2022
Figure 1 for Improving language models by retrieving from trillions of tokens
Figure 2 for Improving language models by retrieving from trillions of tokens
Figure 3 for Improving language models by retrieving from trillions of tokens
Figure 4 for Improving language models by retrieving from trillions of tokens
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Dec 08, 2021
Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Podracer architectures for scalable Reinforcement Learning

Add code
Apr 13, 2021
Figure 1 for Podracer architectures for scalable Reinforcement Learning
Figure 2 for Podracer architectures for scalable Reinforcement Learning
Figure 3 for Podracer architectures for scalable Reinforcement Learning
Figure 4 for Podracer architectures for scalable Reinforcement Learning
Viaarxiv icon

Skillful Precipitation Nowcasting using Deep Generative Models of Radar

Add code
Apr 02, 2021
Figure 1 for Skillful Precipitation Nowcasting using Deep Generative Models of Radar
Figure 2 for Skillful Precipitation Nowcasting using Deep Generative Models of Radar
Figure 3 for Skillful Precipitation Nowcasting using Deep Generative Models of Radar
Viaarxiv icon

Transformation-based Adversarial Video Prediction on Large-Scale Data

Add code
Mar 09, 2020
Figure 1 for Transformation-based Adversarial Video Prediction on Large-Scale Data
Figure 2 for Transformation-based Adversarial Video Prediction on Large-Scale Data
Figure 3 for Transformation-based Adversarial Video Prediction on Large-Scale Data
Figure 4 for Transformation-based Adversarial Video Prediction on Large-Scale Data
Viaarxiv icon

Stabilizing Transformers for Reinforcement Learning

Add code
Oct 13, 2019
Figure 1 for Stabilizing Transformers for Reinforcement Learning
Figure 2 for Stabilizing Transformers for Reinforcement Learning
Figure 3 for Stabilizing Transformers for Reinforcement Learning
Figure 4 for Stabilizing Transformers for Reinforcement Learning
Viaarxiv icon

High Fidelity Speech Synthesis with Adversarial Networks

Add code
Sep 26, 2019
Figure 1 for High Fidelity Speech Synthesis with Adversarial Networks
Figure 2 for High Fidelity Speech Synthesis with Adversarial Networks
Figure 3 for High Fidelity Speech Synthesis with Adversarial Networks
Figure 4 for High Fidelity Speech Synthesis with Adversarial Networks
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Sep 26, 2019
Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon