Alert button
Picture for Ang Wang

Ang Wang

Alert button

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout

Add code
Bookmark button
Alert button
Oct 30, 2023
Huiyao Shu, Ang Wang, Ziji Shi, Hanyu Zhao, Yong Li, Lu Lu

Viaarxiv icon

TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation

Add code
Bookmark button
Alert button
Feb 01, 2023
Ziji Shi, Le Jiang, Ang Wang, Jie Zhang, Xianyan Jia, Yong Li, Chencan Wu, Jialin Li, Wei Lin

Figure 1 for TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation
Figure 2 for TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation
Figure 3 for TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation
Figure 4 for TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation
Viaarxiv icon

Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training

Add code
Bookmark button
Alert button
Oct 12, 2022
Taolin Zhang, Junwei Dong, Jianing Wang, Chengyu Wang, Ang Wang, Yinghui Liu, Jun Huang, Yong Li, Xiaofeng He

Figure 1 for Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training
Figure 2 for Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training
Figure 3 for Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training
Figure 4 for Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training
Viaarxiv icon

M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining

Add code
Bookmark button
Alert button
Oct 25, 2021
Junyang Lin, An Yang, Jinze Bai, Chang Zhou, Le Jiang, Xianyan Jia, Ang Wang, Jie Zhang, Yong Li, Wei Lin, Jingren Zhou, Hongxia Yang

Figure 1 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Figure 2 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Figure 3 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Figure 4 for M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining
Viaarxiv icon

Exploring Sparse Expert Models and Beyond

Add code
Bookmark button
Alert button
Jun 14, 2021
An Yang, Junyang Lin, Rui Men, Chang Zhou, Le Jiang, Xianyan Jia, Ang Wang, Jie Zhang, Jiamang Wang, Yong Li, Di Zhang, Wei Lin, Lin Qu, Jingren Zhou, Hongxia Yang

Figure 1 for Exploring Sparse Expert Models and Beyond
Figure 2 for Exploring Sparse Expert Models and Beyond
Figure 3 for Exploring Sparse Expert Models and Beyond
Figure 4 for Exploring Sparse Expert Models and Beyond
Viaarxiv icon

M6: A Chinese Multimodal Pretrainer

Add code
Bookmark button
Alert button
Mar 02, 2021
Junyang Lin, Rui Men, An Yang, Chang Zhou, Ming Ding, Yichang Zhang, Peng Wang, Ang Wang, Le Jiang, Xianyan Jia, Jie Zhang, Jianwei Zhang, Xu Zou, Zhikang Li, Xiaodong Deng, Jie Liu, Jinbao Xue, Huiling Zhou, Jianxin Ma, Jin Yu, Yong Li, Wei Lin, Jingren Zhou, Jie Tang, Hongxia Yang

Figure 1 for M6: A Chinese Multimodal Pretrainer
Figure 2 for M6: A Chinese Multimodal Pretrainer
Figure 3 for M6: A Chinese Multimodal Pretrainer
Figure 4 for M6: A Chinese Multimodal Pretrainer
Viaarxiv icon

EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications

Add code
Bookmark button
Alert button
Nov 23, 2020
Minghui Qiu, Peng Li, Hanjie Pan, Chengyu Wang, Ang Wang, Cen Chen, Yaliang Li, Dehong Gao, Jun Huang, Yong Li, Jun Yang, Deng Cai, Wei Lin

Figure 1 for EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications
Figure 2 for EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications
Figure 3 for EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications
Viaarxiv icon