Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Hera: A Heterogeneity-Aware Multi-Tenant Inference Server for Personalized Recommendations


Feb 23, 2023
Yujeong Choi, John Kim, Minsoo Rhu

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

GPU-based Private Information Retrieval for On-Device Machine Learning Inference


Jan 27, 2023
Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, Edward Suh

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

DiVa: An Accelerator for Differentially Private Machine Learning


Aug 26, 2022
Beomsik Park, Ranggi Hwang, Dongho Yoon, Yoonhyuk Choi, Minsoo Rhu

Add code

* Accepted for publication at the 55th IEEE/ACM International Symposium on Microarchitecture (MICRO-55), 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures


May 10, 2022
Yunjae Lee, Jinha Chung, Minsoo Rhu

Add code

* Accepted for publication at the 49th IEEE/ACM International Symposium on Computer Architecture (ISCA-49), 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards


May 10, 2022
Youngeun Kwon, Minsoo Rhu

Add code

* Accepted for publication at the 49th IEEE/ACM International Symposium on Computer Architecture (ISCA-49), 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks


Mar 02, 2022
Minhoo Kang, Ranggi Hwang, Jiwon Lee, Dongyun Kam, Youngjoo Lee, Minsoo Rhu

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

PARIS and ELSA: An Elastic Scheduling Algorithm for Reconfigurable Multi-GPU Inference Servers


Feb 27, 2022
Yunseong Kim, Yujeong Choi, Minsoo Rhu

Add code

* This is an extended version of our work, which is accepted for publication at the 59th ACM/ESDA/IEEE Design Automation Conference (DAC), 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference


Oct 25, 2020
Yujeong Choi, Yunseong Kim, Minsoo Rhu

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training


Oct 25, 2020
Youngeun Kwon, Yunjae Lee, Minsoo Rhu

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>