Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference

Oct 25, 2020
Yujeong Choi, Yunseong Kim, Minsoo Rhu


  Access Paper or Ask Questions

Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training

Oct 25, 2020
Youngeun Kwon, Yunjae Lee, Minsoo Rhu


  Access Paper or Ask Questions

Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations

May 12, 2020
Ranggi Hwang, Taehun Kim, Youngeun Kwon, Minsoo Rhu

* Accepted for publication at the 47th IEEE/ACM International Symposium on Computer Architecture (ISCA-47), 2020 

  Access Paper or Ask Questions

NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units

Nov 15, 2019
Bongjoon Hyun, Youngeun Kwon, Yujeong Choi, John Kim, Minsoo Rhu


  Access Paper or Ask Questions

PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units

Sep 06, 2019
Yujeong Choi, Minsoo Rhu


  Access Paper or Ask Questions

TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning

Aug 25, 2019
Youngeun Kwon, Yunjae Lee, Minsoo Rhu

* Accepted for publication at the 52nd IEEE/ACM International Symposium on Microarchitecture (MICRO-52), 2019 

  Access Paper or Ask Questions

Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning

Feb 18, 2019
Youngeun Kwon, Minsoo Rhu

* Published as a conference paper at the 51st IEEE/ACM International Symposium on Microarchitecture (MICRO-51), 2018 

  Access Paper or Ask Questions

Structurally Sparsified Backward Propagation for Faster Long Short-Term Memory Training

Jun 01, 2018
Maohua Zhu, Jason Clemons, Jeff Pool, Minsoo Rhu, Stephen W. Keckler, Yuan Xie


  Access Paper or Ask Questions

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks

May 23, 2017
Angshuman Parashar, Minsoo Rhu, Anurag Mukkara, Antonio Puglielli, Rangharajan Venkatesan, Brucek Khailany, Joel Emer, Stephen W. Keckler, William J. Dally


  Access Paper or Ask Questions

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

May 03, 2017
Minsoo Rhu, Mike O'Connor, Niladrish Chatterjee, Jeff Pool, Stephen W. Keckler


  Access Paper or Ask Questions

vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design

Jul 28, 2016
Minsoo Rhu, Natalia Gimelshein, Jason Clemons, Arslan Zulfiqar, Stephen W. Keckler

* Published as a conference paper at the 49th IEEE/ACM International Symposium on Microarchitecture (MICRO-49), 2016 

  Access Paper or Ask Questions