Alert button
Picture for Gennady Pekhimenko

Gennady Pekhimenko

Alert button

RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads

Add code
Bookmark button
Alert button
Feb 08, 2021
James Gleeson, Srivatsan Krishnan, Moshe Gabel, Vijay Janapa Reddi, Eyal de Lara, Gennady Pekhimenko

Figure 1 for RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
Figure 2 for RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
Figure 3 for RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
Figure 4 for RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
Viaarxiv icon

Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models

Add code
Bookmark button
Alert button
Feb 07, 2021
Shang Wang, Peiming Yang, Yuxuan Zheng, Xin Li, Gennady Pekhimenko

Figure 1 for Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models
Figure 2 for Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models
Figure 3 for Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models
Figure 4 for Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models
Viaarxiv icon

Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach

Add code
Bookmark button
Alert button
Jan 31, 2021
Geoffrey X. Yu, Yubo Gao, Pavel Golikov, Gennady Pekhimenko

Figure 1 for Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach
Figure 2 for Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach
Figure 3 for Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach
Figure 4 for Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach
Viaarxiv icon

IOS: Inter-Operator Scheduler for CNN Acceleration

Add code
Bookmark button
Alert button
Nov 02, 2020
Yaoyao Ding, Ligeng Zhu, Zhihao Jia, Gennady Pekhimenko, Song Han

Figure 1 for IOS: Inter-Operator Scheduler for CNN Acceleration
Figure 2 for IOS: Inter-Operator Scheduler for CNN Acceleration
Figure 3 for IOS: Inter-Operator Scheduler for CNN Acceleration
Figure 4 for IOS: Inter-Operator Scheduler for CNN Acceleration
Viaarxiv icon

FPRaker: A Processing Element For Accelerating Neural Network Training

Add code
Bookmark button
Alert button
Oct 15, 2020
Omar Mohamed Awad, Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Ciaran Bannon, Anand Jayarajan, Gennady Pekhimenko, Andreas Moshovos

Figure 1 for FPRaker: A Processing Element For Accelerating Neural Network Training
Figure 2 for FPRaker: A Processing Element For Accelerating Neural Network Training
Figure 3 for FPRaker: A Processing Element For Accelerating Neural Network Training
Figure 4 for FPRaker: A Processing Element For Accelerating Neural Network Training
Viaarxiv icon

TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference

Add code
Bookmark button
Alert button
Sep 01, 2020
Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos

Figure 1 for TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference
Figure 2 for TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference
Figure 3 for TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference
Figure 4 for TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference
Viaarxiv icon

Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training

Add code
Bookmark button
Alert button
Aug 20, 2020
Geoffrey X. Yu, Tovi Grossman, Gennady Pekhimenko

Figure 1 for Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training
Figure 2 for Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training
Figure 3 for Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training
Figure 4 for Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training
Viaarxiv icon

Multi-node Bert-pretraining: Cost-efficient Approach

Add code
Bookmark button
Alert button
Aug 01, 2020
Jiahuang Lin, Xin Li, Gennady Pekhimenko

Figure 1 for Multi-node Bert-pretraining: Cost-efficient Approach
Figure 2 for Multi-node Bert-pretraining: Cost-efficient Approach
Figure 3 for Multi-node Bert-pretraining: Cost-efficient Approach
Figure 4 for Multi-node Bert-pretraining: Cost-efficient Approach
Viaarxiv icon