Alert button
Picture for Yanzhi Wang

Yanzhi Wang

Alert button

6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration

Add code
Bookmark button
Alert button
Dec 01, 2020
Zhengang Li, Geng Yuan, Wei Niu, Yanyu Li, Pu Zhao, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin

Figure 1 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Figure 2 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Figure 3 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Figure 4 for 6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Viaarxiv icon

An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning

Add code
Bookmark button
Alert button
Nov 20, 2020
Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao

Figure 1 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 2 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 3 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Figure 4 for An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning
Viaarxiv icon

DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search

Add code
Bookmark button
Alert button
Nov 04, 2020
Yushuo Guan, Ning Liu, Pengyu Zhao, Zhengping Che, Kaigui Bian, Yanzhi Wang, Jian Tang

Figure 1 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Figure 2 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Figure 3 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Figure 4 for DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search
Viaarxiv icon

Simultaneous Relevance and Diversity: A New Recommendation Inference Approach

Add code
Bookmark button
Alert button
Sep 27, 2020
Yifang Liu, Zhentao Xu, Qiyuan An, Yang Yi, Yanzhi Wang, Trevor Hastie

Figure 1 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Figure 2 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Figure 3 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Figure 4 for Simultaneous Relevance and Diversity: A New Recommendation Inference Approach
Viaarxiv icon

MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework

Add code
Bookmark button
Alert button
Sep 16, 2020
Sung-En Chang, Yanyu Li, Mengshu Sun, Weiwen Jiang, Runbin Shi, Xue Lin, Yanzhi Wang

Figure 1 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Figure 2 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Figure 3 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Figure 4 for MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework
Viaarxiv icon

Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization

Add code
Bookmark button
Alert button
Sep 15, 2020
Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang

Figure 1 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 2 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 3 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Figure 4 for Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Viaarxiv icon

YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design

Add code
Bookmark button
Alert button
Sep 12, 2020
Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang

Figure 1 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 2 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 3 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Figure 4 for YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Viaarxiv icon

ESMFL: Efficient and Secure Models for Federated Learning

Add code
Bookmark button
Alert button
Sep 03, 2020
Sheng Lin, Chenghong Wang, Hongjia Li, Jieren Deng, Yanzhi Wang, Caiwen Ding

Figure 1 for ESMFL: Efficient and Secure Models for Federated Learning
Figure 2 for ESMFL: Efficient and Secure Models for Federated Learning
Figure 3 for ESMFL: Efficient and Secure Models for Federated Learning
Figure 4 for ESMFL: Efficient and Secure Models for Federated Learning
Viaarxiv icon

AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency

Add code
Bookmark button
Alert button
Aug 14, 2020
Fuxun Yu, Chenchen Liu, Di Wang, Yanzhi Wang, Xiang Chen

Figure 1 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Figure 2 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Figure 3 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Figure 4 for AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Viaarxiv icon

One for Many: Transfer Learning for Building HVAC Control

Add code
Bookmark button
Alert button
Aug 09, 2020
Shichao Xu, Yixuan Wang, Yanzhi Wang, Zheng O'Neill, Qi Zhu

Figure 1 for One for Many: Transfer Learning for Building HVAC Control
Figure 2 for One for Many: Transfer Learning for Building HVAC Control
Figure 3 for One for Many: Transfer Learning for Building HVAC Control
Figure 4 for One for Many: Transfer Learning for Building HVAC Control
Viaarxiv icon