Alert button
Picture for Qijing Huang

Qijing Huang

Alert button

Full Stack Optimization of Transformer Inference: a Survey

Add code
Bookmark button
Alert button
Feb 27, 2023
Sehoon Kim, Coleman Hooper, Thanakul Wattanawong, Minwoo Kang, Ruohan Yan, Hasan Genc, Grace Dinh, Qijing Huang, Kurt Keutzer, Michael W. Mahoney, Yakun Sophia Shao, Amir Gholami

Figure 1 for Full Stack Optimization of Transformer Inference: a Survey
Figure 2 for Full Stack Optimization of Transformer Inference: a Survey
Figure 3 for Full Stack Optimization of Transformer Inference: a Survey
Figure 4 for Full Stack Optimization of Transformer Inference: a Survey
Viaarxiv icon

CoSA: Scheduling by Constrained Optimization for Spatial Accelerators

Add code
Bookmark button
Alert button
May 05, 2021
Qijing Huang, Minwoo Kang, Grace Dinh, Thomas Norell, Aravind Kalaiah, James Demmel, John Wawrzynek, Yakun Sophia Shao

Figure 1 for CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
Figure 2 for CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
Figure 3 for CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
Figure 4 for CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
Viaarxiv icon

HAO: Hardware-aware neural Architecture Optimization for Efficient Inference

Add code
Bookmark button
Alert button
Apr 26, 2021
Zhen Dong, Yizhao Gao, Qijing Huang, John Wawrzynek, Hayden K. H. So, Kurt Keutzer

Figure 1 for HAO: Hardware-aware neural Architecture Optimization for Efficient Inference
Figure 2 for HAO: Hardware-aware neural Architecture Optimization for Efficient Inference
Figure 3 for HAO: Hardware-aware neural Architecture Optimization for Efficient Inference
Figure 4 for HAO: Hardware-aware neural Architecture Optimization for Efficient Inference
Viaarxiv icon

HAWQV3: Dyadic Neural Network Quantization

Add code
Bookmark button
Alert button
Nov 20, 2020
Zhewei Yao, Zhen Dong, Zhangcheng Zheng, Amir Gholami, Jiali Yu, Eric Tan, Leyuan Wang, Qijing Huang, Yida Wang, Michael W. Mahoney, Kurt Keutzer

Figure 1 for HAWQV3: Dyadic Neural Network Quantization
Figure 2 for HAWQV3: Dyadic Neural Network Quantization
Figure 3 for HAWQV3: Dyadic Neural Network Quantization
Figure 4 for HAWQV3: Dyadic Neural Network Quantization
Viaarxiv icon

CoDeNet: Algorithm-hardware Co-design for Deformable Convolution

Add code
Bookmark button
Alert button
Jun 12, 2020
Zhen Dong, Dequan Wang, Qijing Huang, Yizhao Gao, Yaohui Cai, Bichen Wu, Kurt Keutzer, John Wawrzynek

Figure 1 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Figure 2 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Figure 3 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Figure 4 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Viaarxiv icon

ProTuner: Tuning Programs with Monte Carlo Tree Search

Add code
Bookmark button
Alert button
May 27, 2020
Ameer Haj-Ali, Hasan Genc, Qijing Huang, William Moses, John Wawrzynek, Krste Asanović, Ion Stoica

Figure 1 for ProTuner: Tuning Programs with Monte Carlo Tree Search
Figure 2 for ProTuner: Tuning Programs with Monte Carlo Tree Search
Figure 3 for ProTuner: Tuning Programs with Monte Carlo Tree Search
Figure 4 for ProTuner: Tuning Programs with Monte Carlo Tree Search
Viaarxiv icon

AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 04, 2020
Qijing Huang, Ameer Haj-Ali, William Moses, John Xiang, Ion Stoica, Krste Asanovic, John Wawrzynek

Figure 1 for AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning
Figure 2 for AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning
Figure 3 for AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning
Figure 4 for AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning
Viaarxiv icon

Algorithm-hardware Co-design for Deformable Convolution

Add code
Bookmark button
Alert button
Feb 19, 2020
Qijing Huang, Dequan Wang, Yizhao Gao, Yaohui Cai, Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek

Figure 1 for Algorithm-hardware Co-design for Deformable Convolution
Figure 2 for Algorithm-hardware Co-design for Deformable Convolution
Viaarxiv icon

Integrating NVIDIA Deep Learning Accelerator (NVDLA) with RISC-V SoC on FireSim

Add code
Bookmark button
Alert button
Mar 05, 2019
Farzad Farshchi, Qijing Huang, Heechul Yun

Figure 1 for Integrating NVIDIA Deep Learning Accelerator (NVDLA) with RISC-V SoC on FireSim
Figure 2 for Integrating NVIDIA Deep Learning Accelerator (NVDLA) with RISC-V SoC on FireSim
Figure 3 for Integrating NVIDIA Deep Learning Accelerator (NVDLA) with RISC-V SoC on FireSim
Figure 4 for Integrating NVIDIA Deep Learning Accelerator (NVDLA) with RISC-V SoC on FireSim
Viaarxiv icon