Picture for Machel Reid

Machel Reid

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer

Add code
May 24, 2023
Figure 1 for BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Figure 2 for BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Figure 3 for BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Figure 4 for BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Viaarxiv icon

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations

Add code
May 23, 2023
Figure 1 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Figure 2 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Figure 3 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Figure 4 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Viaarxiv icon

On the Role of Parallel Data in Cross-lingual Transfer Learning

Add code
Dec 20, 2022
Figure 1 for On the Role of Parallel Data in Cross-lingual Transfer Learning
Figure 2 for On the Role of Parallel Data in Cross-lingual Transfer Learning
Figure 3 for On the Role of Parallel Data in Cross-lingual Transfer Learning
Viaarxiv icon

DiffusER: Discrete Diffusion via Edit-based Reconstruction

Add code
Oct 30, 2022
Figure 1 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Figure 2 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Figure 3 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Figure 4 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Viaarxiv icon

M2D2: A Massively Multi-domain Language Modeling Dataset

Add code
Oct 13, 2022
Figure 1 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 2 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 3 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 4 for M2D2: A Massively Multi-domain Language Modeling Dataset
Viaarxiv icon

Learning to Model Editing Processes

Add code
May 24, 2022
Figure 1 for Learning to Model Editing Processes
Figure 2 for Learning to Model Editing Processes
Figure 3 for Learning to Model Editing Processes
Figure 4 for Learning to Model Editing Processes
Viaarxiv icon

Large Language Models are Zero-Shot Reasoners

Add code
May 24, 2022
Figure 1 for Large Language Models are Zero-Shot Reasoners
Figure 2 for Large Language Models are Zero-Shot Reasoners
Figure 3 for Large Language Models are Zero-Shot Reasoners
Figure 4 for Large Language Models are Zero-Shot Reasoners
Viaarxiv icon