Picture for Tie-Yan Liu

Tie-Yan Liu

LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning

Add code
Apr 27, 2020
Figure 1 for LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning
Figure 2 for LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning
Figure 3 for LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning
Figure 4 for LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning
Viaarxiv icon

MPNet: Masked and Permuted Pre-training for Language Understanding

Add code
Apr 20, 2020
Figure 1 for MPNet: Masked and Permuted Pre-training for Language Understanding
Figure 2 for MPNet: Masked and Permuted Pre-training for Language Understanding
Figure 3 for MPNet: Masked and Permuted Pre-training for Language Understanding
Figure 4 for MPNet: Masked and Permuted Pre-training for Language Understanding
Viaarxiv icon

Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the Discriminator

Add code
Apr 05, 2020
Figure 1 for Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the Discriminator
Figure 2 for Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the Discriminator
Figure 3 for Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the Discriminator
Figure 4 for Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the Discriminator
Viaarxiv icon

Suphx: Mastering Mahjong with Deep Reinforcement Learning

Add code
Apr 01, 2020
Figure 1 for Suphx: Mastering Mahjong with Deep Reinforcement Learning
Figure 2 for Suphx: Mastering Mahjong with Deep Reinforcement Learning
Figure 3 for Suphx: Mastering Mahjong with Deep Reinforcement Learning
Figure 4 for Suphx: Mastering Mahjong with Deep Reinforcement Learning
Viaarxiv icon

Semi-Supervised Neural Architecture Search

Add code
Mar 09, 2020
Figure 1 for Semi-Supervised Neural Architecture Search
Figure 2 for Semi-Supervised Neural Architecture Search
Figure 3 for Semi-Supervised Neural Architecture Search
Figure 4 for Semi-Supervised Neural Architecture Search
Viaarxiv icon

Incorporating BERT into Neural Machine Translation

Add code
Feb 17, 2020
Figure 1 for Incorporating BERT into Neural Machine Translation
Figure 2 for Incorporating BERT into Neural Machine Translation
Figure 3 for Incorporating BERT into Neural Machine Translation
Figure 4 for Incorporating BERT into Neural Machine Translation
Viaarxiv icon

On Layer Normalization in the Transformer Architecture

Add code
Feb 12, 2020
Figure 1 for On Layer Normalization in the Transformer Architecture
Figure 2 for On Layer Normalization in the Transformer Architecture
Figure 3 for On Layer Normalization in the Transformer Architecture
Figure 4 for On Layer Normalization in the Transformer Architecture
Viaarxiv icon

A Study of Multilingual Neural Machine Translation

Add code
Dec 25, 2019
Figure 1 for A Study of Multilingual Neural Machine Translation
Figure 2 for A Study of Multilingual Neural Machine Translation
Figure 3 for A Study of Multilingual Neural Machine Translation
Figure 4 for A Study of Multilingual Neural Machine Translation
Viaarxiv icon

Gradient Perturbation is Underrated for Differentially Private Convex Optimization

Add code
Nov 26, 2019
Figure 1 for Gradient Perturbation is Underrated for Differentially Private Convex Optimization
Figure 2 for Gradient Perturbation is Underrated for Differentially Private Convex Optimization
Figure 3 for Gradient Perturbation is Underrated for Differentially Private Convex Optimization
Figure 4 for Gradient Perturbation is Underrated for Differentially Private Convex Optimization
Viaarxiv icon

Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation

Add code
Nov 21, 2019
Figure 1 for Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Figure 2 for Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Figure 3 for Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Figure 4 for Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Viaarxiv icon