Picture for Dani Yogatama

Dani Yogatama

ABC: Attention with Bounded-memory Control

Add code
Oct 06, 2021
Figure 1 for ABC: Attention with Bounded-memory Control
Figure 2 for ABC: Attention with Bounded-memory Control
Figure 3 for ABC: Attention with Bounded-memory Control
Figure 4 for ABC: Attention with Bounded-memory Control
Viaarxiv icon

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers

Add code
Sep 22, 2021
Figure 1 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 2 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 3 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 4 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Viaarxiv icon

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

Add code
Jun 09, 2021
Figure 1 for End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering
Figure 2 for End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering
Figure 3 for End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering
Figure 4 for End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering
Viaarxiv icon

Finetuning Pretrained Transformers into RNNs

Add code
Mar 24, 2021
Figure 1 for Finetuning Pretrained Transformers into RNNs
Figure 2 for Finetuning Pretrained Transformers into RNNs
Figure 3 for Finetuning Pretrained Transformers into RNNs
Figure 4 for Finetuning Pretrained Transformers into RNNs
Viaarxiv icon

Random Feature Attention

Add code
Mar 19, 2021
Figure 1 for Random Feature Attention
Figure 2 for Random Feature Attention
Figure 3 for Random Feature Attention
Figure 4 for Random Feature Attention
Viaarxiv icon

Adaptive Semiparametric Language Models

Add code
Feb 04, 2021
Figure 1 for Adaptive Semiparametric Language Models
Figure 2 for Adaptive Semiparametric Language Models
Figure 3 for Adaptive Semiparametric Language Models
Figure 4 for Adaptive Semiparametric Language Models
Viaarxiv icon

Pitfalls of Static Language Modelling

Add code
Feb 03, 2021
Figure 1 for Pitfalls of Static Language Modelling
Figure 2 for Pitfalls of Static Language Modelling
Figure 3 for Pitfalls of Static Language Modelling
Figure 4 for Pitfalls of Static Language Modelling
Viaarxiv icon

Syntactic Structure Distillation Pretraining For Bidirectional Encoders

Add code
May 27, 2020
Figure 1 for Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Figure 2 for Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Figure 3 for Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Figure 4 for Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Viaarxiv icon

A Call for More Rigor in Unsupervised Cross-lingual Learning

Add code
Apr 30, 2020
Figure 1 for A Call for More Rigor in Unsupervised Cross-lingual Learning
Figure 2 for A Call for More Rigor in Unsupervised Cross-lingual Learning
Viaarxiv icon

Modelling Latent Skills for Multitask Language Generation

Add code
Feb 21, 2020
Figure 1 for Modelling Latent Skills for Multitask Language Generation
Figure 2 for Modelling Latent Skills for Multitask Language Generation
Figure 3 for Modelling Latent Skills for Multitask Language Generation
Figure 4 for Modelling Latent Skills for Multitask Language Generation
Viaarxiv icon