Picture for Adam Paszke

Adam Paszke

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

PartIR: Composing SPMD Partitioning Strategies for Machine Learning

Add code
Jan 23, 2024
Figure 1 for PartIR: Composing SPMD Partitioning Strategies for Machine Learning
Figure 2 for PartIR: Composing SPMD Partitioning Strategies for Machine Learning
Figure 3 for PartIR: Composing SPMD Partitioning Strategies for Machine Learning
Figure 4 for PartIR: Composing SPMD Partitioning Strategies for Machine Learning
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Automap: Towards Ergonomic Automated Parallelism for ML Models

Add code
Dec 06, 2021
Figure 1 for Automap: Towards Ergonomic Automated Parallelism for ML Models
Figure 2 for Automap: Towards Ergonomic Automated Parallelism for ML Models
Figure 3 for Automap: Towards Ergonomic Automated Parallelism for ML Models
Figure 4 for Automap: Towards Ergonomic Automated Parallelism for ML Models
Viaarxiv icon

Memory-efficient array redistribution through portable collective communication

Add code
Dec 02, 2021
Figure 1 for Memory-efficient array redistribution through portable collective communication
Figure 2 for Memory-efficient array redistribution through portable collective communication
Figure 3 for Memory-efficient array redistribution through portable collective communication
Figure 4 for Memory-efficient array redistribution through portable collective communication
Viaarxiv icon

Decomposing reverse-mode automatic differentiation

Add code
May 20, 2021
Viaarxiv icon

Tensors Fitting Perfectly

Add code
Feb 26, 2021
Figure 1 for Tensors Fitting Perfectly
Viaarxiv icon

PyTorch Distributed: Experiences on Accelerating Data Parallel Training

Add code
Jun 28, 2020
Figure 1 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Figure 2 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Figure 3 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Figure 4 for PyTorch Distributed: Experiences on Accelerating Data Parallel Training
Viaarxiv icon

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Add code
Dec 03, 2019
Figure 1 for PyTorch: An Imperative Style, High-Performance Deep Learning Library
Figure 2 for PyTorch: An Imperative Style, High-Performance Deep Learning Library
Figure 3 for PyTorch: An Imperative Style, High-Performance Deep Learning Library
Viaarxiv icon