Picture for Yangjie Zhou

Yangjie Zhou

Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design

Add code
Aug 16, 2023
Figure 1 for Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design
Figure 2 for Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design
Figure 3 for Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design
Figure 4 for Accelerating Generic Graph Neural Networks via Architecture, Compiler, Partition Method Co-Design
Viaarxiv icon

AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs

Add code
May 27, 2023
Figure 1 for AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Figure 2 for AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Figure 3 for AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Figure 4 for AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Viaarxiv icon

Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization

Add code
Aug 25, 2022
Figure 1 for Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization
Figure 2 for Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization
Figure 3 for Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization
Figure 4 for Efficient Activation Quantization via Adaptive Rounding Border for Post-Training Quantization
Viaarxiv icon

Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration

Add code
Feb 18, 2020
Figure 1 for Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration
Figure 2 for Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration
Figure 3 for Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration
Figure 4 for Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration
Viaarxiv icon