Picture for Deming Chen

Deming Chen

Celine

HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and Computation

Add code
Dec 28, 2021
Figure 1 for HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and Computation
Figure 2 for HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and Computation
Figure 3 for HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and Computation
Figure 4 for HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and Computation
Viaarxiv icon

EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search

Add code
Nov 24, 2021
Figure 1 for EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search
Figure 2 for EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search
Figure 3 for EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search
Figure 4 for EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search
Viaarxiv icon

YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs

Add code
Oct 26, 2021
Figure 1 for YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs
Figure 2 for YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs
Figure 3 for YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs
Figure 4 for YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs
Viaarxiv icon

Generic Neural Architecture Search via Regression

Add code
Aug 04, 2021
Figure 1 for Generic Neural Architecture Search via Regression
Figure 2 for Generic Neural Architecture Search via Regression
Figure 3 for Generic Neural Architecture Search via Regression
Figure 4 for Generic Neural Architecture Search via Regression
Viaarxiv icon

WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs

Add code
Jul 09, 2021
Figure 1 for WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs
Figure 2 for WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs
Figure 3 for WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs
Figure 4 for WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs
Viaarxiv icon

Software/Hardware Co-design for Multi-modal Multi-task Learning in Autonomous Systems

Add code
Apr 08, 2021
Figure 1 for Software/Hardware Co-design for Multi-modal Multi-task Learning in Autonomous Systems
Figure 2 for Software/Hardware Co-design for Multi-modal Multi-task Learning in Autonomous Systems
Figure 3 for Software/Hardware Co-design for Multi-modal Multi-task Learning in Autonomous Systems
Viaarxiv icon

Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design

Add code
Mar 30, 2021
Figure 1 for Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Figure 2 for Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Figure 3 for Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Figure 4 for Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Viaarxiv icon

F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding

Add code
Mar 08, 2021
Figure 1 for F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding
Figure 2 for F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding
Figure 3 for F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding
Figure 4 for F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding
Viaarxiv icon

Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture

Add code
Mar 04, 2021
Figure 1 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Figure 2 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Figure 3 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Figure 4 for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
Viaarxiv icon

PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses

Add code
Jan 20, 2021
Figure 1 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Figure 2 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Figure 3 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Figure 4 for PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
Viaarxiv icon