Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Sheng Shen

What's Hidden in a One-layer Randomly Weighted Transformer?


Sep 08, 2021
Sheng Shen, Zhewei Yao, Douwe Kiela, Kurt Keutzer, Michael W. Mahoney

* EMNLP 2021 (short) 

  Access Paper or Ask Questions

How Much Can CLIP Benefit Vision-and-Language Tasks?


Jul 13, 2021
Sheng Shen, Liunian Harold Li, Hao Tan, Mohit Bansal, Anna Rohrbach, Kai-Wei Chang, Zhewei Yao, Kurt Keutzer

* 14 pages 

  Access Paper or Ask Questions

Learned Token Pruning for Transformers


Jul 02, 2021
Sehoon Kim, Sheng Shen, David Thorsley, Amir Gholami, Joseph Hassoun, Kurt Keutzer


  Access Paper or Ask Questions

MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models


May 30, 2021
Zhewei Yao, Linjian Ma, Sheng Shen, Kurt Keutzer, Michael W. Mahoney

* 20 pages, 4 figures, 9 tables 

  Access Paper or Ask Questions

Reservoir Transformer


Dec 30, 2020
Sheng Shen, Alexei Baevski, Ari S. Morcos, Kurt Keutzer, Michael Auli, Douwe Kiela


  Access Paper or Ask Questions

MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding


Oct 12, 2020
Qinxin Wang, Hao Tan, Sheng Shen, Michael W. Mahoney, Zhewei Yao


  Access Paper or Ask Questions

Noisy Self-Knowledge Distillation for Text Summarization


Sep 15, 2020
Yang Liu, Sheng Shen, Mirella Lapata


  Access Paper or Ask Questions

Differentially Private Multi-Agent Planning for Logistic-like Problems


Aug 16, 2020
Dayong Ye, Tianqing Zhu, Sheng Shen, Wanlei Zhou, Philip S. Yu


  Access Paper or Ask Questions

ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning


Jun 01, 2020
Zhewei Yao, Amir Gholami, Sheng Shen, Kurt Keutzer, Michael W. Mahoney


  Access Paper or Ask Questions

Rethinking Batch Normalization in Transformers


Mar 17, 2020
Sheng Shen, Zhewei Yao, Amir Gholami, Michael Mahoney, Kurt Keutzer


  Access Paper or Ask Questions

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers


Feb 26, 2020
Zhuohan Li, Eric Wallace, Sheng Shen, Kevin Lin, Kurt Keutzer, Dan Klein, Joseph E. Gonzalez


  Access Paper or Ask Questions

Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT


Sep 25, 2019
Sheng Shen, Zhen Dong, Jiayu Ye, Linjian Ma, Zhewei Yao, Amir Gholami, Michael W. Mahoney, Kurt Keutzer


  Access Paper or Ask Questions

Pragmatically Informative Text Generation


Apr 04, 2019
Sheng Shen, Daniel Fried, Jacob Andreas, Dan Klein

* 8 pages. accepted as a conference paper at NAACL2019 (short paper) 

  Access Paper or Ask Questions

On the Generation of Medical Question-Answer Pairs


Nov 01, 2018
Sheng Shen, Yaliang Li, Nan Du, Xian Wu, Yusheng Xie, Shen Ge, Tao Yang, Kai Wang, Xingzheng Liang, Wei Fan


  Access Paper or Ask Questions