Picture for Yi Tay

Yi Tay

Transformer Memory as a Differentiable Search Index

Add code
Feb 16, 2022
Figure 1 for Transformer Memory as a Differentiable Search Index
Figure 2 for Transformer Memory as a Differentiable Search Index
Figure 3 for Transformer Memory as a Differentiable Search Index
Figure 4 for Transformer Memory as a Differentiable Search Index
Viaarxiv icon

PolyViT: Co-training Vision Transformers on Images, Videos and Audio

Add code
Nov 25, 2021
Figure 1 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 2 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 3 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 4 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Viaarxiv icon

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning

Add code
Nov 22, 2021
Figure 1 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Figure 2 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Figure 3 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Figure 4 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Viaarxiv icon

The Efficiency Misnomer

Add code
Oct 25, 2021
Figure 1 for The Efficiency Misnomer
Figure 2 for The Efficiency Misnomer
Figure 3 for The Efficiency Misnomer
Viaarxiv icon

SCENIC: A JAX Library for Computer Vision Research and Beyond

Add code
Oct 18, 2021
Figure 1 for SCENIC: A JAX Library for Computer Vision Research and Beyond
Viaarxiv icon

Sharpness-Aware Minimization Improves Language Model Generalization

Add code
Oct 16, 2021
Figure 1 for Sharpness-Aware Minimization Improves Language Model Generalization
Figure 2 for Sharpness-Aware Minimization Improves Language Model Generalization
Figure 3 for Sharpness-Aware Minimization Improves Language Model Generalization
Figure 4 for Sharpness-Aware Minimization Improves Language Model Generalization
Viaarxiv icon

Improving Compositional Generalization with Self-Training for Data-to-Text Generation

Add code
Oct 16, 2021
Figure 1 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Figure 2 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Figure 3 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Figure 4 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Viaarxiv icon

Born Again Neural Rankers

Add code
Sep 30, 2021
Figure 1 for Born Again Neural Rankers
Figure 2 for Born Again Neural Rankers
Figure 3 for Born Again Neural Rankers
Figure 4 for Born Again Neural Rankers
Viaarxiv icon

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers

Add code
Sep 22, 2021
Figure 1 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 2 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 3 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 4 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Viaarxiv icon

The Benchmark Lottery

Add code
Jul 14, 2021
Figure 1 for The Benchmark Lottery
Figure 2 for The Benchmark Lottery
Figure 3 for The Benchmark Lottery
Figure 4 for The Benchmark Lottery
Viaarxiv icon