Alert button
Picture for Murali Emani

Murali Emani

Alert button

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Add code
Bookmark button
Alert button
Oct 11, 2023
Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri, Rao Kotamarthi, Venkatram Vishwanath, Arvind Ramanathan, Sam Foreman, Kyle Hippe, Troy Arcomano, Romit Maulik, Maxim Zvyagin, Alexander Brace, Bin Zhang, Cindy Orozco Bohorquez, Austin Clyde, Bharat Kale, Danilo Perez-Rivera, Heng Ma, Carla M. Mann, Michael Irvin, J. Gregory Pauloski, Logan Ward, Valerie Hayot, Murali Emani, Zhen Xie, Diangen Lin, Maulik Shukla, Ian Foster, James J. Davis, Michael E. Papka, Thomas Brettin, Prasanna Balaprakash, Gina Tourassi, John Gounley, Heidi Hanson, Thomas E Potok, Massimiliano Lupo Pasini, Kate Evans, Dan Lu, Dalton Lunga, Junqi Yin, Sajal Dash, Feiyi Wang, Mallikarjun Shankar, Isaac Lyngaas, Xiao Wang, Guojing Cong, Pei Zhang, Ming Fan, Siyan Liu, Adolfy Hoisie, Shinjae Yoo, Yihui Ren, William Tang, Kyle Felker, Alexey Svyatkovskiy, Hang Liu, Ashwin Aji, Angela Dalton, Michael Schulte, Karl Schulz, Yuntian Deng, Weili Nie, Josh Romero, Christian Dallago, Arash Vahdat, Chaowei Xiao, Thomas Gibbs, Anima Anandkumar, Rick Stevens

Figure 1 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 2 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 3 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 4 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Viaarxiv icon

A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators

Add code
Bookmark button
Alert button
Oct 06, 2023
Murali Emani, Sam Foreman, Varuni Sastry, Zhen Xie, Siddhisanket Raskar, William Arnold, Rajeev Thakur, Venkatram Vishwanath, Michael E. Papka

Viaarxiv icon

Data Race Detection Using Large Language Models

Add code
Bookmark button
Alert button
Aug 15, 2023
Le Chen, Xianzhong Ding, Murali Emani, Tristan Vanderbruggen, Pei-hung Lin, Chuanhua Liao

Figure 1 for Data Race Detection Using Large Language Models
Figure 2 for Data Race Detection Using Large Language Models
Figure 3 for Data Race Detection Using Large Language Models
Figure 4 for Data Race Detection Using Large Language Models
Viaarxiv icon

A Survey of Techniques for Optimizing Transformer Inference

Add code
Bookmark button
Alert button
Jul 16, 2023
Krishna Teja Chitty-Venkata, Sparsh Mittal, Murali Emani, Venkatram Vishwanath, Arun K. Somani

Figure 1 for A Survey of Techniques for Optimizing Transformer Inference
Figure 2 for A Survey of Techniques for Optimizing Transformer Inference
Figure 3 for A Survey of Techniques for Optimizing Transformer Inference
Figure 4 for A Survey of Techniques for Optimizing Transformer Inference
Viaarxiv icon

LM4HPC: Towards Effective Language Model Application in High-Performance Computing

Add code
Bookmark button
Alert button
Jun 26, 2023
Le Chen, Pei-Hung Lin, Tristan Vanderbruggen, Chunhua Liao, Murali Emani, Bronis de Supinski

Viaarxiv icon

A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems

Add code
Bookmark button
Alert button
Jun 15, 2023
Shilpika, Bethany Lusch, Murali Emani, Filippo Simini, Venkatram Vishwanath, Michael E. Papka, Kwan-Liu Ma

Figure 1 for A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems
Figure 2 for A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems
Figure 3 for A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems
Figure 4 for A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems
Viaarxiv icon

Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation

Add code
Bookmark button
Alert button
Apr 11, 2023
Gaurav Verma, Siddhisanket Raskar, Zhen Xie, Abid M Malik, Murali Emani, Barbara Chapman

Figure 1 for Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
Figure 2 for Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
Figure 3 for Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
Figure 4 for Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
Viaarxiv icon

Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study

Add code
Bookmark button
Alert button
Nov 03, 2022
Pei-Hung Lin, Chunhua Liao, Winson Chen, Tristan Vanderbruggen, Murali Emani, Hailu Xu

Figure 1 for Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study
Figure 2 for Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study
Figure 3 for Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study
Figure 4 for Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study
Viaarxiv icon

Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines

Add code
Bookmark button
Alert button
Aug 11, 2022
Patrick Flynn, Tristan Vanderbruggen, Chunhua Liao, Pei-Hung Lin, Murali Emani, Xipeng Shen

Figure 1 for Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Figure 2 for Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Figure 3 for Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Figure 4 for Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines
Viaarxiv icon

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

Add code
Bookmark button
Alert button
Oct 26, 2021
Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda, Toshiyuki Imamura, Akihiko Kasagi, Kentaro Kawakami, Shuhei Kudo, Akiyoshi Kuroda, Maxime Martinasso, Satoshi Matsuoka, Henrique Mendonça, Kazuki Minami, Prabhat Ram, Takashi Sawada, Mallikarjun Shankar, Tom St. John, Akihiro Tabuchi, Venkatram Vishwanath, Mohamed Wahib, Masafumi Yamazaki, Junqi Yin

Figure 1 for MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems
Figure 2 for MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems
Figure 3 for MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems
Figure 4 for MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems
Viaarxiv icon