Alert button
Picture for Shenggan Cheng

Shenggan Cheng

Alert button

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

Add code
Bookmark button
Alert button
Mar 15, 2024
Xuanlei Zhao, Shenggan Cheng, Zangwei Zheng, Zheming Yang, Ziming Liu, Yang You

Figure 1 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 2 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 3 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 4 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Viaarxiv icon

AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference

Add code
Bookmark button
Alert button
Jan 19, 2024
Xuanlei Zhao, Shenggan Cheng, Guangyang Lu, Jiarui Fang, Haotian Zhou, Bin Jia, Ziming Liu, Yang You

Viaarxiv icon

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Add code
Bookmark button
Alert button
Mar 04, 2022
Shenggan Cheng, Ruidong Wu, Zhongming Yu, Binrui Li, Xiwen Zhang, Jian Peng, Yang You

Figure 1 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Figure 2 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Figure 3 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Figure 4 for FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours
Viaarxiv icon