Picture for Misha Smelyanskiy

Misha Smelyanskiy

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

Differentiable NAS Framework and Application to Ads CTR Prediction

Add code
Oct 25, 2021
Figure 1 for Differentiable NAS Framework and Application to Ads CTR Prediction
Figure 2 for Differentiable NAS Framework and Application to Ads CTR Prediction
Figure 3 for Differentiable NAS Framework and Application to Ads CTR Prediction
Figure 4 for Differentiable NAS Framework and Application to Ads CTR Prediction
Viaarxiv icon

Check-N-Run: A Checkpointing System for Training Recommendation Models

Add code
Oct 17, 2020
Figure 1 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 2 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 3 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 4 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Viaarxiv icon

A Study of BFLOAT16 for Deep Learning Training

Add code
Jun 13, 2019
Figure 1 for A Study of BFLOAT16 for Deep Learning Training
Figure 2 for A Study of BFLOAT16 for Deep Learning Training
Figure 3 for A Study of BFLOAT16 for Deep Learning Training
Figure 4 for A Study of BFLOAT16 for Deep Learning Training
Viaarxiv icon

Deep Learning Recommendation Model for Personalization and Recommendation Systems

Add code
May 31, 2019
Figure 1 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Figure 2 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Figure 3 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Figure 4 for Deep Learning Recommendation Model for Personalization and Recommendation Systems
Viaarxiv icon

Bandana: Using Non-volatile Memory for Storing Deep Learning Models

Add code
Nov 15, 2018
Figure 1 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Figure 2 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Figure 3 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Figure 4 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Viaarxiv icon