Alert button
Picture for Zhihua Wu

Zhihua Wu

Alert button

Code Comparison Tuning for Code Large Language Models

Add code
Bookmark button
Alert button
Mar 28, 2024
Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu

Viaarxiv icon

RecycleGPT: An Autoregressive Language Model with Recyclable Module

Add code
Bookmark button
Alert button
Aug 08, 2023
Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu, Kunpeng Wang, Wenlai Zhao, Guangwen Yang

Figure 1 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Figure 2 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Figure 3 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Figure 4 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Viaarxiv icon

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

Add code
Bookmark button
Alert button
Feb 20, 2023
Chang Chen, Min Li, Zhihua Wu, Dianhai Yu, Chao Yang

Figure 1 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Figure 2 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Figure 3 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Figure 4 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Viaarxiv icon

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

Add code
Bookmark button
Alert button
Jul 13, 2022
Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, Dianhai Yu, Fan Wang, Yanjun Ma

Figure 1 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Figure 2 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Figure 3 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Figure 4 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Viaarxiv icon

SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

Add code
Bookmark button
Alert button
May 20, 2022
Liang Shen, Zhihua Wu, WeiBao Gong, Hongxiang Hao, Yangfan Bai, HuaChao Wu, Xinxuan Wu, Haoyi Xiong, Dianhai Yu, Yanjun Ma

Viaarxiv icon

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

Add code
Bookmark button
Alert button
May 19, 2022
Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, Dianhai Yu

Figure 1 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Figure 2 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Figure 3 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Figure 4 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Viaarxiv icon

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Add code
Bookmark button
Alert button
Dec 31, 2021
Han Zhang, Weichong Yin, Yewei Fang, Lanxin Li, Boqiang Duan, Zhihua Wu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 2 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 3 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 4 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Viaarxiv icon

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

Add code
Bookmark button
Alert button
Dec 23, 2021
Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang

Figure 1 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 2 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 3 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 4 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Viaarxiv icon

End-to-end Adaptive Distributed Training on PaddlePaddle

Add code
Bookmark button
Alert button
Dec 06, 2021
Yulong Ao, Zhihua Wu, Dianhai Yu, Weibao Gong, Zhiqing Kui, Minxu Zhang, Zilingfeng Ye, Liang Shen, Yanjun Ma, Tian Wu, Haifeng Wang, Wei Zeng, Chao Yang

Figure 1 for End-to-end Adaptive Distributed Training on PaddlePaddle
Figure 2 for End-to-end Adaptive Distributed Training on PaddlePaddle
Figure 3 for End-to-end Adaptive Distributed Training on PaddlePaddle
Figure 4 for End-to-end Adaptive Distributed Training on PaddlePaddle
Viaarxiv icon