Picture for David Budden

David Budden

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Feb 29, 2024
Viaarxiv icon

The CLRS Algorithmic Reasoning Benchmark

Add code
Jun 04, 2022
Figure 1 for The CLRS Algorithmic Reasoning Benchmark
Figure 2 for The CLRS Algorithmic Reasoning Benchmark
Figure 3 for The CLRS Algorithmic Reasoning Benchmark
Figure 4 for The CLRS Algorithmic Reasoning Benchmark
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Feb 09, 2022
Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Dec 08, 2021
Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Large-scale graph representation learning with very deep GNNs and self-supervision

Add code
Jul 20, 2021
Figure 1 for Large-scale graph representation learning with very deep GNNs and self-supervision
Figure 2 for Large-scale graph representation learning with very deep GNNs and self-supervision
Figure 3 for Large-scale graph representation learning with very deep GNNs and self-supervision
Viaarxiv icon

A Combinatorial Perspective on Transfer Learning

Add code
Oct 23, 2020
Figure 1 for A Combinatorial Perspective on Transfer Learning
Figure 2 for A Combinatorial Perspective on Transfer Learning
Figure 3 for A Combinatorial Perspective on Transfer Learning
Figure 4 for A Combinatorial Perspective on Transfer Learning
Viaarxiv icon

Gaussian Gated Linear Networks

Add code
Jun 10, 2020
Figure 1 for Gaussian Gated Linear Networks
Figure 2 for Gaussian Gated Linear Networks
Figure 3 for Gaussian Gated Linear Networks
Figure 4 for Gaussian Gated Linear Networks
Viaarxiv icon

Online Learning in Contextual Bandits using Gated Linear Networks

Add code
Feb 21, 2020
Figure 1 for Online Learning in Contextual Bandits using Gated Linear Networks
Figure 2 for Online Learning in Contextual Bandits using Gated Linear Networks
Figure 3 for Online Learning in Contextual Bandits using Gated Linear Networks
Figure 4 for Online Learning in Contextual Bandits using Gated Linear Networks
Viaarxiv icon

Task-Relevant Adversarial Imitation Learning

Add code
Oct 02, 2019
Figure 1 for Task-Relevant Adversarial Imitation Learning
Figure 2 for Task-Relevant Adversarial Imitation Learning
Figure 3 for Task-Relevant Adversarial Imitation Learning
Figure 4 for Task-Relevant Adversarial Imitation Learning
Viaarxiv icon