Alert button
Picture for Huizi Mao

Huizi Mao

Alert button

VILA: On Pre-training for Visual Language Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Ji Lin, Hongxu Yin, Wei Ping, Yao Lu, Pavlo Molchanov, Andrew Tao, Huizi Mao, Jan Kautz, Mohammad Shoeybi, Song Han

Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Add code
Bookmark button
Alert button
May 26, 2022
Zhijian Liu, Haotian Tang, Alexander Amini, Xinyu Yang, Huizi Mao, Daniela Rus, Song Han

Figure 1 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 2 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 3 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 4 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Viaarxiv icon

PatchNet -- Short-range Template Matching for Efficient Video Processing

Add code
Bookmark button
Alert button
Mar 10, 2021
Huizi Mao, Sibo Zhu, Song Han, William J. Dally

Figure 1 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 2 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 3 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 4 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Viaarxiv icon

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell

Add code
Bookmark button
Alert button
Aug 18, 2019
Huizi Mao, Xiaodong Yang, William J. Dally

Figure 1 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 2 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 3 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 4 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Viaarxiv icon

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

Add code
Bookmark button
Alert button
Sep 30, 2018
Huizi Mao, Taeyoung Kong, William J. Dally

Figure 1 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 2 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 3 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 4 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Viaarxiv icon

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Add code
Bookmark button
Alert button
Feb 05, 2018
Yujun Lin, Song Han, Huizi Mao, Yu Wang, William J. Dally

Figure 1 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 2 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 3 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 4 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Viaarxiv icon

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

Add code
Bookmark button
Alert button
Jun 05, 2017
Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, William J. Dally

Figure 1 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 2 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 3 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 4 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Viaarxiv icon

Trained Ternary Quantization

Add code
Bookmark button
Alert button
Feb 23, 2017
Chenzhuo Zhu, Song Han, Huizi Mao, William J. Dally

Figure 1 for Trained Ternary Quantization
Figure 2 for Trained Ternary Quantization
Figure 3 for Trained Ternary Quantization
Figure 4 for Trained Ternary Quantization
Viaarxiv icon

DSD: Dense-Sparse-Dense Training for Deep Neural Networks

Add code
Bookmark button
Alert button
Feb 21, 2017
Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally

Figure 1 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Figure 2 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Figure 3 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Figure 4 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Viaarxiv icon

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

Add code
Bookmark button
Alert button
Feb 20, 2017
Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Figure 1 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 2 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 3 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 4 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Viaarxiv icon