Alert button
Picture for Di He

Di He

Alert button

Rethinking the Positional Encoding in Language Pre-training

Add code
Bookmark button
Alert button
Jun 28, 2020
Guolin Ke, Di He, Tie-Yan Liu

Figure 1 for Rethinking the Positional Encoding in Language Pre-training
Figure 2 for Rethinking the Positional Encoding in Language Pre-training
Figure 3 for Rethinking the Positional Encoding in Language Pre-training
Figure 4 for Rethinking the Positional Encoding in Language Pre-training
Viaarxiv icon

MC-BERT: Efficient Language Pre-Training via a Meta Controller

Add code
Bookmark button
Alert button
Jun 16, 2020
Zhenhui Xu, Linyuan Gong, Guolin Ke, Di He, Shuxin Zheng, Liwei Wang, Jiang Bian, Tie-Yan Liu

Figure 1 for MC-BERT: Efficient Language Pre-Training via a Meta Controller
Figure 2 for MC-BERT: Efficient Language Pre-Training via a Meta Controller
Figure 3 for MC-BERT: Efficient Language Pre-Training via a Meta Controller
Figure 4 for MC-BERT: Efficient Language Pre-Training via a Meta Controller
Viaarxiv icon

Invertible Image Rescaling

Add code
Bookmark button
Alert button
May 12, 2020
Mingqing Xiao, Shuxin Zheng, Chang Liu, Yaolong Wang, Di He, Guolin Ke, Jiang Bian, Zhouchen Lin, Tie-Yan Liu

Figure 1 for Invertible Image Rescaling
Figure 2 for Invertible Image Rescaling
Figure 3 for Invertible Image Rescaling
Figure 4 for Invertible Image Rescaling
Viaarxiv icon

Incorporating BERT into Neural Machine Translation

Add code
Bookmark button
Alert button
Feb 17, 2020
Jinhua Zhu, Yingce Xia, Lijun Wu, Di He, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

Figure 1 for Incorporating BERT into Neural Machine Translation
Figure 2 for Incorporating BERT into Neural Machine Translation
Figure 3 for Incorporating BERT into Neural Machine Translation
Figure 4 for Incorporating BERT into Neural Machine Translation
Viaarxiv icon

MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius

Add code
Bookmark button
Alert button
Feb 15, 2020
Runtian Zhai, Chen Dan, Di He, Huan Zhang, Boqing Gong, Pradeep Ravikumar, Cho-Jui Hsieh, Liwei Wang

Figure 1 for MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius
Figure 2 for MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius
Figure 3 for MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius
Figure 4 for MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius
Viaarxiv icon

On Layer Normalization in the Transformer Architecture

Add code
Bookmark button
Alert button
Feb 12, 2020
Ruibin Xiong, Yunchang Yang, Di He, Kai Zheng, Shuxin Zheng, Chen Xing, Huishuai Zhang, Yanyan Lan, Liwei Wang, Tie-Yan Liu

Figure 1 for On Layer Normalization in the Transformer Architecture
Figure 2 for On Layer Normalization in the Transformer Architecture
Figure 3 for On Layer Normalization in the Transformer Architecture
Figure 4 for On Layer Normalization in the Transformer Architecture
Viaarxiv icon

Defective Convolutional Layers Learn Robust CNNs

Add code
Bookmark button
Alert button
Nov 19, 2019
Tiange Luo, Tianle Cai, Mengxiao Zhang, Siyu Chen, Di He, Liwei Wang

Figure 1 for Defective Convolutional Layers Learn Robust CNNs
Figure 2 for Defective Convolutional Layers Learn Robust CNNs
Figure 3 for Defective Convolutional Layers Learn Robust CNNs
Figure 4 for Defective Convolutional Layers Learn Robust CNNs
Viaarxiv icon