Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
A Delay Metric for Video Object Detection: What Average Precision Fails to Tell

Aug 18, 2019
Huizi Mao, Xiaodong Yang, William J. Dally

* ICCV 2019 

  Access Paper or Ask Questions

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

Sep 30, 2018
Huizi Mao, Taeyoung Kong, William J. Dally


  Access Paper or Ask Questions

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Feb 05, 2018
Yujun Lin, Song Han, Huizi Mao, Yu Wang, William J. Dally

* ICLR 2018 
* we find 99.9% of the gradient exchange in distributed SGD is redundant; we reduce the communication bandwidth by two orders of magnitude without losing accuracy 

  Access Paper or Ask Questions

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

Jun 05, 2017
Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, William J. Dally

* submitted to NIPS 2017 

  Access Paper or Ask Questions

Trained Ternary Quantization

Feb 23, 2017
Chenzhuo Zhu, Song Han, Huizi Mao, William J. Dally

* Accepted for Poster Presentation on ICLR 2017 

  Access Paper or Ask Questions

DSD: Dense-Sparse-Dense Training for Deep Neural Networks

Feb 21, 2017
Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally

* Published as a conference paper at ICLR 2017 

  Access Paper or Ask Questions

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

Feb 20, 2017
Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

* Accepted as full paper in FPGA'17, Monterey, CA; Also appeared at 1st International Workshop on Efficient Methods for Deep Neural Networks at NIPS 2016, Barcelona, Spain 

  Access Paper or Ask Questions

EIE: Efficient Inference Engine on Compressed Deep Neural Network

May 03, 2016
Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally

* External Links: TheNextPlatform: http://goo.gl/f7qX0L ; O'Reilly: https://goo.gl/Id1HNT ; Hacker News: https://goo.gl/KM72SV ; Embedded-vision: http://goo.gl/joQNg8 ; Talk at NVIDIA GTC'16: http://goo.gl/6wJYvn ; Talk at Embedded Vision Summit: https://goo.gl/7abFNe ; Talk at Stanford University: https://goo.gl/6lwuer. Published as a conference paper in ISCA 2016 

  Access Paper or Ask Questions

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Feb 15, 2016
Song Han, Huizi Mao, William J. Dally

* Published as a conference paper at ICLR 2016 (oral) 

  Access Paper or Ask Questions