Picture for Barret Zoph

Barret Zoph

Tony

Designing Effective Sparse Expert Models

Add code
Feb 17, 2022
Figure 1 for Designing Effective Sparse Expert Models
Figure 2 for Designing Effective Sparse Expert Models
Figure 3 for Designing Effective Sparse Expert Models
Figure 4 for Designing Effective Sparse Expert Models
Viaarxiv icon

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Add code
Dec 13, 2021
Figure 1 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 2 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 3 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Figure 4 for GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Viaarxiv icon

Multi-Task Self-Training for Learning General Representations

Add code
Aug 25, 2021
Figure 1 for Multi-Task Self-Training for Learning General Representations
Figure 2 for Multi-Task Self-Training for Learning General Representations
Figure 3 for Multi-Task Self-Training for Learning General Representations
Figure 4 for Multi-Task Self-Training for Learning General Representations
Viaarxiv icon

Simple Training Strategies and Model Scaling for Object Detection

Add code
Jun 30, 2021
Figure 1 for Simple Training Strategies and Model Scaling for Object Detection
Figure 2 for Simple Training Strategies and Model Scaling for Object Detection
Figure 3 for Simple Training Strategies and Model Scaling for Object Detection
Figure 4 for Simple Training Strategies and Model Scaling for Object Detection
Viaarxiv icon

Revisiting ResNets: Improved Training and Scaling Strategies

Add code
Mar 13, 2021
Figure 1 for Revisiting ResNets: Improved Training and Scaling Strategies
Figure 2 for Revisiting ResNets: Improved Training and Scaling Strategies
Figure 3 for Revisiting ResNets: Improved Training and Scaling Strategies
Figure 4 for Revisiting ResNets: Improved Training and Scaling Strategies
Viaarxiv icon

Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity

Add code
Jan 11, 2021
Figure 1 for Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Figure 2 for Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Figure 3 for Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Figure 4 for Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Viaarxiv icon

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Add code
Dec 13, 2020
Figure 1 for Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Figure 2 for Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Figure 3 for Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Figure 4 for Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Viaarxiv icon

Does Data Augmentation Benefit from Split BatchNorms

Add code
Oct 15, 2020
Figure 1 for Does Data Augmentation Benefit from Split BatchNorms
Figure 2 for Does Data Augmentation Benefit from Split BatchNorms
Figure 3 for Does Data Augmentation Benefit from Split BatchNorms
Figure 4 for Does Data Augmentation Benefit from Split BatchNorms
Viaarxiv icon

Rethinking Pre-training and Self-training

Add code
Jun 11, 2020
Figure 1 for Rethinking Pre-training and Self-training
Figure 2 for Rethinking Pre-training and Self-training
Figure 3 for Rethinking Pre-training and Self-training
Figure 4 for Rethinking Pre-training and Self-training
Viaarxiv icon

Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

Add code
May 22, 2020
Figure 1 for Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
Figure 2 for Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
Figure 3 for Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
Figure 4 for Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
Viaarxiv icon