Alert button
Picture for William J. Dally

William J. Dally

Alert button

Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training

Add code
Bookmark button
Alert button
Jun 13, 2022
Charbel Sakr, Steve Dai, Rangharajan Venkatesan, Brian Zimmer, William J. Dally, Brucek Khailany

Figure 1 for Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training
Figure 2 for Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training
Figure 3 for Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training
Figure 4 for Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training
Viaarxiv icon

PatchNet -- Short-range Template Matching for Efficient Video Processing

Add code
Bookmark button
Alert button
Mar 10, 2021
Huizi Mao, Sibo Zhu, Song Han, William J. Dally

Figure 1 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 2 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 3 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 4 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Viaarxiv icon

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference

Add code
Bookmark button
Alert button
Feb 08, 2021
Steve Dai, Rangharajan Venkatesan, Haoxing Ren, Brian Zimmer, William J. Dally, Brucek Khailany

Figure 1 for VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Figure 2 for VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Figure 3 for VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Figure 4 for VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Viaarxiv icon

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell

Add code
Bookmark button
Alert button
Aug 18, 2019
Huizi Mao, Xiaodong Yang, William J. Dally

Figure 1 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 2 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 3 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 4 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Viaarxiv icon

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

Add code
Bookmark button
Alert button
Sep 30, 2018
Huizi Mao, Taeyoung Kong, William J. Dally

Figure 1 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 2 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 3 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 4 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Viaarxiv icon

Efficient Sparse-Winograd Convolutional Neural Networks

Add code
Bookmark button
Alert button
Feb 18, 2018
Xingyu Liu, Jeff Pool, Song Han, William J. Dally

Figure 1 for Efficient Sparse-Winograd Convolutional Neural Networks
Figure 2 for Efficient Sparse-Winograd Convolutional Neural Networks
Figure 3 for Efficient Sparse-Winograd Convolutional Neural Networks
Figure 4 for Efficient Sparse-Winograd Convolutional Neural Networks
Viaarxiv icon

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Add code
Bookmark button
Alert button
Feb 05, 2018
Yujun Lin, Song Han, Huizi Mao, Yu Wang, William J. Dally

Figure 1 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 2 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 3 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 4 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Viaarxiv icon

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

Add code
Bookmark button
Alert button
Jun 05, 2017
Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, William J. Dally

Figure 1 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 2 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 3 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 4 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Viaarxiv icon

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks

Add code
Bookmark button
Alert button
May 23, 2017
Angshuman Parashar, Minsoo Rhu, Anurag Mukkara, Antonio Puglielli, Rangharajan Venkatesan, Brucek Khailany, Joel Emer, Stephen W. Keckler, William J. Dally

Figure 1 for SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Figure 2 for SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Figure 3 for SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Figure 4 for SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Viaarxiv icon

Trained Ternary Quantization

Add code
Bookmark button
Alert button
Feb 23, 2017
Chenzhuo Zhu, Song Han, Huizi Mao, William J. Dally

Figure 1 for Trained Ternary Quantization
Figure 2 for Trained Ternary Quantization
Figure 3 for Trained Ternary Quantization
Figure 4 for Trained Ternary Quantization
Viaarxiv icon