Alert button
Picture for Deming Chen

Deming Chen

Alert button

Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture

Add code
Bookmark button
Alert button
Mar 04, 2021
Seung Won Min, Kun Wu, Sitao Huang, Mert Hidayetoğlu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei Hwu

Figure 1 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Figure 2 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Figure 3 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Figure 4 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Viaarxiv icon

PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses

Add code
Bookmark button
Alert button
Jan 20, 2021
Seung Won Min, Kun Wu, Sitao Huang, Mert Hidayetoğlu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei Hwu

Figure 1 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Figure 2 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Figure 3 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Figure 4 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Viaarxiv icon

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations

Add code
Bookmark button
Alert button
Dec 22, 2020
Yichi Zhang, Junhao Pan, Xinheng Liu, Hongzheng Chen, Deming Chen, Zhiru Zhang

Figure 1 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Figure 2 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Figure 3 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Figure 4 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Viaarxiv icon

Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices

Add code
Bookmark button
Alert button
Oct 15, 2020
Cong Hao, Yao Chen, Xiaofan Zhang, Yuhong Li, Jinjun Xiong, Wen-mei Hwu, Deming Chen

Figure 1 for Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices
Figure 2 for Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices
Figure 3 for Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices
Figure 4 for Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices
Viaarxiv icon

VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization

Add code
Bookmark button
Alert button
Jun 10, 2020
Cheng Gong, Yao Chen, Ye Lu, Tao Li, Cong Hao, Deming Chen

Figure 1 for VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization
Figure 2 for VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization
Figure 3 for VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization
Figure 4 for VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization
Viaarxiv icon

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

Add code
Bookmark button
Alert button
May 06, 2020
Yuhong Li, Cong Hao, Xiaofan Zhang, Xinheng Liu, Yao Chen, Jinjun Xiong, Wen-mei Hwu, Deming Chen

Figure 1 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Figure 2 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Figure 3 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Figure 4 for EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions
Viaarxiv icon

HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation

Add code
Bookmark button
Alert button
Apr 08, 2020
Hanchen Ye, Xiaofan Zhang, Zhize Huang, Gengsheng Chen, Deming Chen

Figure 1 for HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation
Figure 2 for HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation
Figure 3 for HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation
Figure 4 for HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation
Viaarxiv icon

Compressing Large-Scale Transformer-Based Models: A Case Study on BERT

Add code
Bookmark button
Alert button
Feb 27, 2020
Prakhar Ganesh, Yao Chen, Xin Lou, Mohammad Ali Khan, Yin Yang, Deming Chen, Marianne Winslett, Hassan Sajjad, Preslav Nakov

Figure 1 for Compressing Large-Scale Transformer-Based Models: A Case Study on BERT
Figure 2 for Compressing Large-Scale Transformer-Based Models: A Case Study on BERT
Figure 3 for Compressing Large-Scale Transformer-Based Models: A Case Study on BERT
Figure 4 for Compressing Large-Scale Transformer-Based Models: A Case Study on BERT
Viaarxiv icon

AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs

Add code
Bookmark button
Alert button
Jan 06, 2020
Pengfei Xu, Xiaofan Zhang, Cong Hao, Yang Zhao, Yongan Zhang, Yue Wang, Chaojian Li, Zetong Guan, Deming Chen, Yingyan Lin

Figure 1 for AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs
Figure 2 for AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs
Figure 3 for AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs
Figure 4 for AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs
Viaarxiv icon