Picture for Yanzhi Wang

Yanzhi Wang

6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration

Add code
Dec 01, 2020
Figure 1 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Figure 2 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Figure 3 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Figure 4 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Viaarxiv icon

An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning

Add code
Nov 20, 2020
Figure 1 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 2 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 3 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 4 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Viaarxiv icon

DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search

Add code
Nov 04, 2020
Figure 1 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Figure 2 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Figure 3 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Figure 4 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Viaarxiv icon

Simultaneous Relevance and Diversity: A New Recommendation Inference Approach

Add code
Sep 27, 2020
Figure 1 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Figure 2 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Figure 3 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Figure 4 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Viaarxiv icon

MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework

Add code
Sep 16, 2020
Figure 1 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Figure 2 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Figure 3 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Figure 4 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Viaarxiv icon

Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization

Add code
Sep 15, 2020
Figure 1 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 2 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 3 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 4 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Viaarxiv icon

YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design

Add code
Sep 12, 2020
Figure 1 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 2 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 3 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 4 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Viaarxiv icon

ESMFL: Efficient and Secure Models for Federated Learning

Add code
Sep 03, 2020
Figure 1 for ESMFL: Efficient and Secure Models for Federated Learning
Figure 2 for ESMFL: Efficient and Secure Models for Federated Learning
Figure 3 for ESMFL: Efficient and Secure Models for Federated Learning
Figure 4 for ESMFL: Efficient and Secure Models for Federated Learning
Viaarxiv icon

AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency

Add code
Aug 14, 2020
Figure 1 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Figure 2 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Figure 3 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Figure 4 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Viaarxiv icon

One for Many: Transfer Learning for Building HVAC Control

Add code
Aug 09, 2020
Figure 1 for One for Many: Transfer Learning for Building HVAC Control
Figure 2 for One for Many: Transfer Learning for Building HVAC Control
Figure 3 for One for Many: Transfer Learning for Building HVAC Control
Figure 4 for One for Many: Transfer Learning for Building HVAC Control
Viaarxiv icon