An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning

Nov 20, 2020
Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao

* 11 pages, 13 figures, 2 tables 

A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression

Nov 18, 2020
Sian Jin, Guanpeng Li, Shuaiwen Leon Song, Dingwen Tao

* 11 pages, 11 figures, 1 table, accepted by PPoPP '21 as a poster 

Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design

Nov 07, 2019
Xingyao Zhang, Shuaiwen Leon Song, Chenhao Xie, Jing Wang, Weigong Zhang, Xin Fu

* To appear in the 2020 26th International Symposium on High-Performance Computer Architecture (HPCA 2020) 

SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks

Jan 13, 2018
Linnan Wang, Jinmian Ye, Yiyang Zhao, Wei Wu, Ang Li, Shuaiwen Leon Song, Zenglin Xu, Tim Kraska

* PPoPP '2018: 23nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming 

