Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Satoshi Matsuoka

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems


Oct 26, 2021
Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda, Toshiyuki Imamura, Akihiko Kasagi, Kentaro Kawakami, Shuhei Kudo, Akiyoshi Kuroda, Maxime Martinasso, Satoshi Matsuoka, Henrique Mendonça, Kazuki Minami, Prabhat Ram, Takashi Sawada, Mallikarjun Shankar, Tom St. John, Akihiro Tabuchi, Venkatram Vishwanath, Mohamed Wahib, Masafumi Yamazaki, Junqi Yin


  Access Paper or Ask Questions

Scaling Distributed Deep Learning Workloads beyond the Memory Capacity with KARMA


Aug 26, 2020
Mohamed Wahib, Haoyu Zhang, Truong Thao Nguyen, Aleksandr Drozd, Jens Domke, Lingqi Zhang, Ryousei Takano, Satoshi Matsuoka

* ACM/IEEE Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20) 

  Access Paper or Ask Questions

The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism


Jul 25, 2020
Yosuke Oyama, Naoya Maruyama, Nikoli Dryden, Erin McCarthy, Peter Harrington, Jan Balewski, Satoshi Matsuoka, Peter Nugent, Brian Van Essen

* 12 pages, 10 figures 

  Access Paper or Ask Questions

Second-order Optimization Method for Large Mini-batch: Training ResNet-50 on ImageNet in 35 Epochs


Dec 05, 2018
Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, Akira Naruse, Rio Yokota, Satoshi Matsuoka

* 10 pages, 7 figures 

  Access Paper or Ask Questions

μ-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching


Apr 13, 2018
Yosuke Oyama, Tal Ben-Nun, Torsten Hoefler, Satoshi Matsuoka

* 11 pages, 14 figures. Part of the content have been published in IPSJ SIG Technical Report, Vol. 2017-HPC-162, No. 22, pp. 1-9, 2017. (DOI: http://id.nii.ac.jp/1001/00184814

  Access Paper or Ask Questions