Picture for Vinh Q. Tran

Vinh Q. Tran

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

Add code
Jul 21, 2022
Figure 1 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 2 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 3 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 4 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Viaarxiv icon

Confident Adaptive Language Modeling

Add code
Jul 14, 2022
Figure 1 for Confident Adaptive Language Modeling
Figure 2 for Confident Adaptive Language Modeling
Figure 3 for Confident Adaptive Language Modeling
Figure 4 for Confident Adaptive Language Modeling
Viaarxiv icon

Unifying Language Learning Paradigms

Add code
May 10, 2022
Figure 1 for Unifying Language Learning Paradigms
Figure 2 for Unifying Language Learning Paradigms
Figure 3 for Unifying Language Learning Paradigms
Figure 4 for Unifying Language Learning Paradigms
Viaarxiv icon

A New Generation of Perspective API: Efficient Multilingual Character-level Transformers

Add code
Feb 22, 2022
Figure 1 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Figure 2 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Figure 3 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Figure 4 for A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Viaarxiv icon

Transformer Memory as a Differentiable Search Index

Add code
Feb 16, 2022
Figure 1 for Transformer Memory as a Differentiable Search Index
Figure 2 for Transformer Memory as a Differentiable Search Index
Figure 3 for Transformer Memory as a Differentiable Search Index
Figure 4 for Transformer Memory as a Differentiable Search Index
Viaarxiv icon

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning

Add code
Nov 22, 2021
Figure 1 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Figure 2 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Figure 3 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Figure 4 for ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Viaarxiv icon

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization

Add code
Jul 02, 2021
Figure 1 for Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Figure 2 for Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Figure 3 for Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Figure 4 for Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Viaarxiv icon

AgreeSum: Agreement-Oriented Multi-Document Summarization

Add code
Jun 04, 2021
Figure 1 for AgreeSum: Agreement-Oriented Multi-Document Summarization
Figure 2 for AgreeSum: Agreement-Oriented Multi-Document Summarization
Figure 3 for AgreeSum: Agreement-Oriented Multi-Document Summarization
Figure 4 for AgreeSum: Agreement-Oriented Multi-Document Summarization
Viaarxiv icon

Quiz-Style Question Generation for News Stories

Add code
Feb 18, 2021
Figure 1 for Quiz-Style Question Generation for News Stories
Figure 2 for Quiz-Style Question Generation for News Stories
Figure 3 for Quiz-Style Question Generation for News Stories
Figure 4 for Quiz-Style Question Generation for News Stories
Viaarxiv icon