Picture for Hongzheng Chen

Hongzheng Chen

ByteDance, Cornell University

Allo: A Programming Model for Composable Accelerator Design

Add code
Apr 07, 2024
Figure 1 for Allo: A Programming Model for Composable Accelerator Design
Figure 2 for Allo: A Programming Model for Composable Accelerator Design
Figure 3 for Allo: A Programming Model for Composable Accelerator Design
Figure 4 for Allo: A Programming Model for Composable Accelerator Design
Viaarxiv icon

Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference

Add code
Dec 23, 2023
Figure 1 for Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Figure 2 for Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Figure 3 for Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Figure 4 for Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Viaarxiv icon

Decoupled Model Schedule for Deep Learning Training

Add code
Feb 16, 2023
Figure 1 for Decoupled Model Schedule for Deep Learning Training
Figure 2 for Decoupled Model Schedule for Deep Learning Training
Figure 3 for Decoupled Model Schedule for Deep Learning Training
Figure 4 for Decoupled Model Schedule for Deep Learning Training
Viaarxiv icon

Structured Pruning is All You Need for Pruning CNNs at Initialization

Add code
Mar 04, 2022
Figure 1 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Figure 2 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Figure 3 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Figure 4 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Viaarxiv icon

BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing

Add code
Dec 16, 2021
Figure 1 for BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing
Figure 2 for BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing
Figure 3 for BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing
Figure 4 for BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing
Viaarxiv icon

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations

Add code
Dec 22, 2020
Figure 1 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Figure 2 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Figure 3 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Figure 4 for FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Viaarxiv icon