Picture for Xuehai Qian

Xuehai Qian

HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation

Add code
May 04, 2021
Figure 1 for HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation
Figure 2 for HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation
Figure 3 for HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation
Figure 4 for HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation
Viaarxiv icon

Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework

Add code
Dec 12, 2020
Figure 1 for Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Figure 2 for Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Figure 3 for Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Figure 4 for Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Viaarxiv icon

PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices

Add code
Apr 23, 2020
Figure 1 for PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Figure 2 for PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Figure 3 for PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Figure 4 for PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices
Viaarxiv icon

PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning

Add code
Jan 22, 2020
Figure 1 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 2 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 3 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 4 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Viaarxiv icon

Heterogeneity-Aware Asynchronous Decentralized Training

Add code
Sep 17, 2019
Figure 1 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 2 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 3 for Heterogeneity-Aware Asynchronous Decentralized Training
Figure 4 for Heterogeneity-Aware Asynchronous Decentralized Training
Viaarxiv icon

A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology

Add code
Jul 22, 2019
Figure 1 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Figure 2 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Figure 3 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Figure 4 for A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology
Viaarxiv icon

Non-structured DNN Weight Pruning Considered Harmful

Add code
Jul 03, 2019
Figure 1 for Non-structured DNN Weight Pruning Considered Harmful
Figure 2 for Non-structured DNN Weight Pruning Considered Harmful
Figure 3 for Non-structured DNN Weight Pruning Considered Harmful
Figure 4 for Non-structured DNN Weight Pruning Considered Harmful
Viaarxiv icon

Hop: Heterogeneity-Aware Decentralized Training

Add code
Feb 07, 2019
Figure 1 for Hop: Heterogeneity-Aware Decentralized Training
Figure 2 for Hop: Heterogeneity-Aware Decentralized Training
Figure 3 for Hop: Heterogeneity-Aware Decentralized Training
Figure 4 for Hop: Heterogeneity-Aware Decentralized Training
Viaarxiv icon

HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array

Add code
Jan 07, 2019
Figure 1 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 2 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 3 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Figure 4 for HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Viaarxiv icon

ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers

Add code
Dec 31, 2018
Figure 1 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Figure 2 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Figure 3 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Figure 4 for ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Viaarxiv icon