Picture for John D. Owens

John D. Owens

Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms

Add code
Apr 19, 2024
Figure 1 for Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Figure 2 for Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Figure 3 for Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Figure 4 for Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Viaarxiv icon

The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks

Sep 30, 2023
Figure 1 for The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
Figure 2 for The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
Figure 3 for The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
Figure 4 for The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
Viaarxiv icon

Building a Performance Model for Deep Learning Recommendation Model Training on GPUs

Add code
Jan 19, 2022
Figure 1 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 2 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 3 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 4 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Viaarxiv icon

Energy-based Out-of-distribution Detection

Add code
Oct 13, 2020
Figure 1 for Energy-based Out-of-distribution Detection
Figure 2 for Energy-based Out-of-distribution Detection
Figure 3 for Energy-based Out-of-distribution Detection
Figure 4 for Energy-based Out-of-distribution Detection
Viaarxiv icon

Unsupervised Object Segmentation with Explicit Localization Module

Nov 21, 2019
Figure 1 for Unsupervised Object Segmentation with Explicit Localization Module
Figure 2 for Unsupervised Object Segmentation with Explicit Localization Module
Figure 3 for Unsupervised Object Segmentation with Explicit Localization Module
Figure 4 for Unsupervised Object Segmentation with Explicit Localization Module
Viaarxiv icon

Object Localization and Motion Transfer learning with Capsules

Add code
May 20, 2018
Figure 1 for Object Localization and Motion Transfer learning with Capsules
Figure 2 for Object Localization and Motion Transfer learning with Capsules
Figure 3 for Object Localization and Motion Transfer learning with Capsules
Figure 4 for Object Localization and Motion Transfer learning with Capsules
Viaarxiv icon