Picture for Srinivas Sridharan

Srinivas Sridharan

STAGE: A Symbolic Tensor grAph GEnerator for distributed AI system co-design

Add code
Nov 14, 2025
Viaarxiv icon

Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators

Add code
Mar 26, 2025
Figure 1 for Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators
Figure 2 for Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators
Figure 3 for Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators
Figure 4 for Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators
Viaarxiv icon

LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation

Add code
Nov 04, 2024
Figure 1 for LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Figure 2 for LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Figure 3 for LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Figure 4 for LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Viaarxiv icon

Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces

Add code
May 26, 2023
Figure 1 for Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
Figure 2 for Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
Figure 3 for Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
Figure 4 for Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
Viaarxiv icon

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

Add code
Mar 24, 2023
Viaarxiv icon

Mystique: Accurate and Scalable Production AI Benchmarks Generation

Add code
Dec 16, 2022
Figure 1 for Mystique: Accurate and Scalable Production AI Benchmarks Generation
Figure 2 for Mystique: Accurate and Scalable Production AI Benchmarks Generation
Figure 3 for Mystique: Accurate and Scalable Production AI Benchmarks Generation
Figure 4 for Mystique: Accurate and Scalable Production AI Benchmarks Generation
Viaarxiv icon

Impact of RoCE Congestion Control Policies on Distributed Training of DNNs

Add code
Jul 22, 2022
Figure 1 for Impact of RoCE Congestion Control Policies on Distributed Training of DNNs
Figure 2 for Impact of RoCE Congestion Control Policies on Distributed Training of DNNs
Figure 3 for Impact of RoCE Congestion Control Policies on Distributed Training of DNNs
Figure 4 for Impact of RoCE Congestion Control Policies on Distributed Training of DNNs
Viaarxiv icon

Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models

Add code
Oct 09, 2021
Figure 1 for Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models
Figure 2 for Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models
Figure 3 for Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models
Figure 4 for Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models
Viaarxiv icon

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Add code
Apr 15, 2021
Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon

Automatic Model Parallelism for Deep Neural Networks with Compiler and Hardware Support

Add code
Jun 11, 2019
Figure 1 for Automatic Model Parallelism for Deep Neural Networks with Compiler and Hardware Support
Viaarxiv icon