Picture for Xuehai Qian

Xuehai Qian

PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning

Add code
Jan 22, 2020
Figure 1 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 2 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 3 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 4 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Viaarxiv icon

Heterogeneity-Aware Asynchronous Decentralized Training

Add code
Sep 17, 2019
Figure 1 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 2 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 3 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 4 for Heterogeneity-Aware Asynchronous Decentralized Training
Viaarxiv icon

A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology

Add code
Jul 22, 2019
Figure 1 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Figure 2 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Figure 3 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Figure 4 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Viaarxiv icon

Non-structured DNN Weight Pruning Considered Harmful

Add code
Jul 03, 2019
Figure 1 for Non-structured DNN Weight Pruning Considered Harmful
Figure 2 for Non-structured DNN Weight Pruning Considered Harmful
Figure 3 for Non-structured DNN Weight Pruning Considered Harmful
Figure 4 for Non-structured DNN Weight Pruning Considered Harmful
Viaarxiv icon

Hop: Heterogeneity-Aware Decentralized Training

Add code
Feb 07, 2019
Figure 1 for Hop: Heterogeneity-Aware Decentralized Training
Figure 2 for Hop: Heterogeneity-Aware Decentralized Training
Figure 3 for Hop: Heterogeneity-Aware Decentralized Training
Figure 4 for Hop: Heterogeneity-Aware Decentralized Training
Viaarxiv icon

HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array

Add code
Jan 07, 2019
Figure 1 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 2 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 3 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 4 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Viaarxiv icon

ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers

Add code
Dec 31, 2018
Figure 1 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Figure 2 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Figure 3 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Figure 4 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Viaarxiv icon

E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs

Add code
Dec 12, 2018
Figure 1 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Figure 2 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Figure 3 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Figure 4 for E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Viaarxiv icon

Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework

Add code
Feb 18, 2018
Figure 1 for Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Figure 2 for Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Figure 3 for Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Figure 4 for Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Viaarxiv icon

VIBNN: Hardware Acceleration of Bayesian Neural Networks

Add code
Feb 02, 2018
Figure 1 for VIBNN: Hardware Acceleration of Bayesian Neural Networks
Figure 2 for VIBNN: Hardware Acceleration of Bayesian Neural Networks
Figure 3 for VIBNN: Hardware Acceleration of Bayesian Neural Networks
Figure 4 for VIBNN: Hardware Acceleration of Bayesian Neural Networks
Viaarxiv icon