Alert button
Picture for Qun Liu

Qun Liu

Alert button

TernaryBERT: Distillation-aware Ultra-low Bit BERT

Add code
Bookmark button
Alert button
Oct 10, 2020
Wei Zhang, Lu Hou, Yichun Yin, Lifeng Shang, Xiao Chen, Xin Jiang, Qun Liu

Figure 1 for TernaryBERT: Distillation-aware Ultra-low Bit BERT
Figure 2 for TernaryBERT: Distillation-aware Ultra-low Bit BERT
Figure 3 for TernaryBERT: Distillation-aware Ultra-low Bit BERT
Figure 4 for TernaryBERT: Distillation-aware Ultra-low Bit BERT
Viaarxiv icon

Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers

Add code
Bookmark button
Alert button
Oct 06, 2020
Yimeng Wu, Peyman Passban, Mehdi Rezagholizade, Qun Liu

Figure 1 for Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers
Figure 2 for Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers
Figure 3 for Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers
Figure 4 for Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers
Viaarxiv icon

TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling

Add code
Bookmark button
Alert button
Aug 12, 2020
Shuai Zhang, Peng Zhang, Xindian Ma, Junqiu Wei, Ningning Wang, Qun Liu

Figure 1 for TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Figure 2 for TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Figure 3 for TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Figure 4 for TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Viaarxiv icon

Learning to Detect Unacceptable Machine Translations for Downstream Tasks

Add code
Bookmark button
Alert button
May 08, 2020
Meng Zhang, Xin Jiang, Yang Liu, Qun Liu

Figure 1 for Learning to Detect Unacceptable Machine Translations for Downstream Tasks
Figure 2 for Learning to Detect Unacceptable Machine Translations for Downstream Tasks
Figure 3 for Learning to Detect Unacceptable Machine Translations for Downstream Tasks
Figure 4 for Learning to Detect Unacceptable Machine Translations for Downstream Tasks
Viaarxiv icon

Accurate Word Alignment Induction from Neural Machine Translation

Add code
Bookmark button
Alert button
Apr 30, 2020
Yun Chen, Yang Liu, Guanhua Chen, Xin Jiang, Qun Liu

Figure 1 for Accurate Word Alignment Induction from Neural Machine Translation
Figure 2 for Accurate Word Alignment Induction from Neural Machine Translation
Figure 3 for Accurate Word Alignment Induction from Neural Machine Translation
Figure 4 for Accurate Word Alignment Induction from Neural Machine Translation
Viaarxiv icon

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

Add code
Bookmark button
Alert button
Apr 30, 2020
Zhiyong Wu, Yun Chen, Ben Kao, Qun Liu

Figure 1 for Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT
Figure 2 for Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT
Figure 3 for Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT
Figure 4 for Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT
Viaarxiv icon

Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order

Add code
Bookmark button
Alert button
Apr 24, 2020
Yi Liao, Xin Jiang, Qun Liu

Figure 1 for Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order
Figure 2 for Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order
Figure 3 for Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order
Figure 4 for Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order
Viaarxiv icon

DynaBERT: Dynamic BERT with Adaptive Width and Depth

Add code
Bookmark button
Alert button
Apr 08, 2020
Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu

Figure 1 for DynaBERT: Dynamic BERT with Adaptive Width and Depth
Figure 2 for DynaBERT: Dynamic BERT with Adaptive Width and Depth
Figure 3 for DynaBERT: Dynamic BERT with Adaptive Width and Depth
Figure 4 for DynaBERT: Dynamic BERT with Adaptive Width and Depth
Viaarxiv icon