Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Huizi Mao

PatchNet -- Short-range Template Matching for Efficient Video Processing


Mar 10, 2021
Huizi Mao, Sibo Zhu, Song Han, William J. Dally


  Access Paper or Ask Questions

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell


Aug 18, 2019
Huizi Mao, Xiaodong Yang, William J. Dally

* ICCV 2019 

  Access Paper or Ask Questions

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video


Sep 30, 2018
Huizi Mao, Taeyoung Kong, William J. Dally


  Access Paper or Ask Questions

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training


Feb 05, 2018
Yujun Lin, Song Han, Huizi Mao, Yu Wang, William J. Dally

* ICLR 2018 
* we find 99.9% of the gradient exchange in distributed SGD is redundant; we reduce the communication bandwidth by two orders of magnitude without losing accuracy 

  Access Paper or Ask Questions

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks


Jun 05, 2017
Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, William J. Dally

* submitted to NIPS 2017 

  Access Paper or Ask Questions

Trained Ternary Quantization


Feb 23, 2017
Chenzhuo Zhu, Song Han, Huizi Mao, William J. Dally

* Accepted for Poster Presentation on ICLR 2017 

  Access Paper or Ask Questions

DSD: Dense-Sparse-Dense Training for Deep Neural Networks


Feb 21, 2017
Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally

* Published as a conference paper at ICLR 2017 

  Access Paper or Ask Questions

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA


Feb 20, 2017
Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

* Accepted as full paper in FPGA'17, Monterey, CA; Also appeared at 1st International Workshop on Efficient Methods for Deep Neural Networks at NIPS 2016, Barcelona, Spain 

  Access Paper or Ask Questions

EIE: Efficient Inference Engine on Compressed Deep Neural Network


May 03, 2016
Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally

* External Links: TheNextPlatform: http://goo.gl/f7qX0L ; O'Reilly: https://goo.gl/Id1HNT ; Hacker News: https://goo.gl/KM72SV ; Embedded-vision: http://goo.gl/joQNg8 ; Talk at NVIDIA GTC'16: http://goo.gl/6wJYvn ; Talk at Embedded Vision Summit: https://goo.gl/7abFNe ; Talk at Stanford University: https://goo.gl/6lwuer. Published as a conference paper in ISCA 2016 

  Access Paper or Ask Questions

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding


Feb 15, 2016
Song Han, Huizi Mao, William J. Dally

* Published as a conference paper at ICLR 2016 (oral) 

  Access Paper or Ask Questions