Alert button
Picture for Jayashree Mohan

Jayashree Mohan

Alert button

Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve

Add code
Bookmark button
Alert button
Mar 04, 2024
Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee

Figure 1 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Figure 2 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Figure 3 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Figure 4 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Viaarxiv icon

SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills

Add code
Bookmark button
Alert button
Aug 31, 2023
Amey Agrawal, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Ramachandran Ramjee

Figure 1 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Figure 2 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Figure 3 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Figure 4 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Viaarxiv icon

Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters

Add code
Bookmark button
Alert button
Oct 12, 2021
Jayashree Mohan, Amar Phanishayee, Janardhan Kulkarni, Vijay Chidambaram

Figure 1 for Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters
Figure 2 for Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters
Figure 3 for Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters
Figure 4 for Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters
Viaarxiv icon

Memory Optimization for Deep Networks

Add code
Bookmark button
Alert button
Oct 29, 2020
Aashaka Shah, Chao-Yuan Wu, Jayashree Mohan, Vijay Chidambaram, Philipp Krähenbühl

Figure 1 for Memory Optimization for Deep Networks
Figure 2 for Memory Optimization for Deep Networks
Figure 3 for Memory Optimization for Deep Networks
Figure 4 for Memory Optimization for Deep Networks
Viaarxiv icon

Analyzing and Mitigating Data Stalls in DNN Training

Add code
Bookmark button
Alert button
Jul 14, 2020
Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram

Figure 1 for Analyzing and Mitigating Data Stalls in DNN Training
Figure 2 for Analyzing and Mitigating Data Stalls in DNN Training
Figure 3 for Analyzing and Mitigating Data Stalls in DNN Training
Figure 4 for Analyzing and Mitigating Data Stalls in DNN Training
Viaarxiv icon