Alert button
Picture for Shuai Zheng

Shuai Zheng

Alert button

DualFluidNet: an Attention-based Dual-pipeline Network for Accurate and Generalizable Fluid-solid Coupled Simulation

Add code
Bookmark button
Alert button
Dec 28, 2023
Yu Chen, Shuai Zheng, Menglong Jin, Yan Chang, Nianyi Wang

Viaarxiv icon

Contractive error feedback for gradient compression

Add code
Bookmark button
Alert button
Dec 13, 2023
Bingcong Li, Shuai Zheng, Parameswaran Raman, Anshumali Shrivastava, Georgios B. Giannakis

Viaarxiv icon

DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Add code
Bookmark button
Alert button
Nov 17, 2023
Chenyu Jiang, Zhen Jia, Shuai Zheng, Yida Wang, Chuan Wu

Viaarxiv icon

Unleashing the potential of GNNs via Bi-directional Knowledge Transfer

Add code
Bookmark button
Alert button
Oct 26, 2023
Shuai Zheng, Zhizhe Liu, Zhenfeng Zhu, Xingxing Zhang, Jianxin Li, Yao Zhao

Viaarxiv icon

Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens

Add code
Bookmark button
Alert button
May 07, 2023
Zhanpeng Zeng, Cole Hawkins, Mingyi Hong, Aston Zhang, Nikolaos Pappas, Vikas Singh, Shuai Zheng

Figure 1 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Figure 2 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Figure 3 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Figure 4 for Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Viaarxiv icon

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

Add code
Bookmark button
Alert button
Apr 10, 2023
Shuhuai Ren, Aston Zhang, Yi Zhu, Shuai Zhang, Shuai Zheng, Mu Li, Alex Smola, Xu Sun

Figure 1 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Figure 2 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Figure 3 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Figure 4 for Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Viaarxiv icon

Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness

Add code
Bookmark button
Alert button
Mar 04, 2023
Zhijie Shen, Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Shuai Zheng, Yao Zhao

Figure 1 for Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness
Figure 2 for Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness
Figure 3 for Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness
Figure 4 for Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation with Cross-Scale Distortion Awareness
Viaarxiv icon

Decoupled Model Schedule for Deep Learning Training

Add code
Bookmark button
Alert button
Feb 16, 2023
Hongzheng Chen, Cody Hao Yu, Shuai Zheng, Zhen Zhang, Zhiru Zhang, Yida Wang

Figure 1 for Decoupled Model Schedule for Deep Learning Training
Figure 2 for Decoupled Model Schedule for Deep Learning Training
Figure 3 for Decoupled Model Schedule for Deep Learning Training
Figure 4 for Decoupled Model Schedule for Deep Learning Training
Viaarxiv icon

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning

Add code
Bookmark button
Alert button
Dec 21, 2022
M Saiful Bari, Aston Zhang, Shuai Zheng, Xingjian Shi, Yi Zhu, Shafiq Joty, Mu Li

Figure 1 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Figure 2 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Figure 3 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Figure 4 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Viaarxiv icon

SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing

Add code
Bookmark button
Alert button
Dec 10, 2022
Chaoyang He, Shuai Zheng, Aston Zhang, George Karypis, Trishul Chilimbi, Mahdi Soltanolkotabi, Salman Avestimehr

Figure 1 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Figure 2 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Figure 3 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Figure 4 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Viaarxiv icon