Picture for Huizi Mao

Huizi Mao

VILA: On Pre-training for Visual Language Models

Add code
Dec 14, 2023
Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Add code
May 26, 2022
Figure 1 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 2 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 3 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 4 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Viaarxiv icon

PatchNet -- Short-range Template Matching for Efficient Video Processing

Add code
Mar 10, 2021
Figure 1 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 2 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 3 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 4 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Viaarxiv icon

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell

Add code
Aug 18, 2019
Figure 1 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 2 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 3 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 4 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Viaarxiv icon

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

Add code
Sep 30, 2018
Figure 1 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 2 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 3 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 4 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Viaarxiv icon

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Add code
Feb 05, 2018
Figure 1 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 2 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 3 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 4 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Viaarxiv icon

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

Add code
Jun 05, 2017
Figure 1 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 2 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 3 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 4 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Viaarxiv icon

Trained Ternary Quantization

Add code
Feb 23, 2017
Figure 1 for Trained Ternary Quantization
Figure 2 for Trained Ternary Quantization
Figure 3 for Trained Ternary Quantization
Figure 4 for Trained Ternary Quantization
Viaarxiv icon

DSD: Dense-Sparse-Dense Training for Deep Neural Networks

Add code
Feb 21, 2017
Figure 1 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Figure 2 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Figure 3 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Figure 4 for DSD: Dense-Sparse-Dense Training for Deep Neural Networks
Viaarxiv icon

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

Add code
Feb 20, 2017
Figure 1 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 2 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 3 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 4 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Viaarxiv icon