Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Transkimmer: Transformer Learns to Layer-wise Skim



Yue Guan , Zhengyi Li , Jingwen Leng , Zhouhan Lin , Minyi Guo

* Published as a conference paper at ACL 2022 

   Access Paper or Ask Questions

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation



Cong Guo , Yuxian Qiu , Jingwen Leng , Xiaotian Gao , Chen Zhang , Yunxin Liu , Fan Yang , Yuhao Zhu , Minyi Guo

* 18 pages, 2 figures, ICLR 2022 

   Access Paper or Ask Questions

Block-Skim: Efficient Question Answering for Transformer



Yue Guan , Zhengyi Li , Jingwen Leng , Zhouhan Lin , Minyi Guo , Yuhao Zhu

* Published as a conference paper at AAAI 2022 

   Access Paper or Ask Questions

Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection



Shulai Zhang , Zirui Li , Quan Chen , Wenli Zheng , Jingwen Leng , Minyi Guo

* 10 pages 

   Access Paper or Ask Questions

Dual-side Sparse Tensor Core



Yang Wang , Chen Zhang , Zhiqiang Xie , Cong Guo , Yunxin Liu , Jingwen Leng


   Access Paper or Ask Questions

How Far Does BERT Look At:Distance-based Clustering and Analysis of BERT$'$s Attention



Yue Guan , Jingwen Leng , Chao Li , Quan Chen , Minyi Guo


   Access Paper or Ask Questions

Architectural Implications of Graph Neural Networks



Zhihui Zhang , Jingwen Leng , Lingxiao Ma , Youshan Miao , Chao Li , Minyi Guo

* in IEEE Computer Architecture Letters, vol. 19, no. 1, pp. 59-62, 1 Jan.-June 2020 
* 4 pages, published in IEEE Computer Architecture Letters (CAL) 2020 

   Access Paper or Ask Questions

Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity



Cong Guo , Bo Yang Hsueh , Jingwen Leng , Yuxian Qiu , Yue Guan , Zehuan Wang , Xiaoying Jia , Xipeng Li , Minyi Guo , Yuhao Zhu

* 12pages, ACM/IEEE Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC20) 

   Access Paper or Ask Questions

Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration



Cong Guo , Yangjie Zhou , Jingwen Leng , Yuhao Zhu , Zidong Du , Quan Chen , Chao Li , Minyi Guo , Bin Yao

* Accepted by DAC2020 

   Access Paper or Ask Questions

Adversarial Defense Through Network Profiling Based Path Extraction



Yuxian Qiu , Jingwen Leng , Cong Guo , Quan Chen , Chao Li , Minyi Guo , Yuhao Zhu


   Access Paper or Ask Questions

1
2
>>