Picture for Cliff Young

Cliff Young

TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings

Add code
Apr 20, 2023
Figure 1 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 2 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 3 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 4 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Viaarxiv icon

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

Add code
Nov 29, 2022
Figure 1 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Figure 2 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Figure 3 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Figure 4 for MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
Viaarxiv icon

Exploring the limits of Concurrency in ML Training on Google TPUs

Add code
Nov 07, 2020
Figure 1 for Exploring the limits of Concurrency in ML Training on Google TPUs
Figure 2 for Exploring the limits of Concurrency in ML Training on Google TPUs
Figure 3 for Exploring the limits of Concurrency in ML Training on Google TPUs
Figure 4 for Exploring the limits of Concurrency in ML Training on Google TPUs
Viaarxiv icon

Sparse GPU Kernels for Deep Learning

Add code
Jun 18, 2020
Figure 1 for Sparse GPU Kernels for Deep Learning
Figure 2 for Sparse GPU Kernels for Deep Learning
Figure 3 for Sparse GPU Kernels for Deep Learning
Figure 4 for Sparse GPU Kernels for Deep Learning
Viaarxiv icon

Bit-Parallel Vector Composability for Neural Acceleration

Add code
Apr 11, 2020
Figure 1 for Bit-Parallel Vector Composability for Neural Acceleration
Figure 2 for Bit-Parallel Vector Composability for Neural Acceleration
Figure 3 for Bit-Parallel Vector Composability for Neural Acceleration
Figure 4 for Bit-Parallel Vector Composability for Neural Acceleration
Viaarxiv icon

MLPerf Training Benchmark

Add code
Oct 30, 2019
Figure 1 for MLPerf Training Benchmark
Figure 2 for MLPerf Training Benchmark
Figure 3 for MLPerf Training Benchmark
Figure 4 for MLPerf Training Benchmark
Viaarxiv icon

Mesh-TensorFlow: Deep Learning for Supercomputers

Add code
Nov 05, 2018
Figure 1 for Mesh-TensorFlow: Deep Learning for Supercomputers
Figure 2 for Mesh-TensorFlow: Deep Learning for Supercomputers
Figure 3 for Mesh-TensorFlow: Deep Learning for Supercomputers
Viaarxiv icon

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Add code
Oct 08, 2016
Figure 1 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Figure 2 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Figure 3 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Figure 4 for Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Viaarxiv icon