Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores


Feb 24, 2022
Junkyeong Choi, Hyucksung Kwon, Woongkyu Lee, Jungwook Choi, Jieun Lim

* 10 pages, 16 figures, preliminary work 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

NN-LUT: Neural Approximation of Non-Linear Operations for Efficient Transformer Inference


Dec 03, 2021
Joonsang Yu, Junki Park, Seongmin Park, Minsoo Kim, Sihwa Lee, Dong Hyun Lee, Jungwook Choi

* 7 pages, 3 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling


Oct 07, 2021
Kyuhong Shim, Iksoo Choi, Wonyong Sung, Jungwook Choi


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Robust Machine Learning Systems: Challenges, Current Trends, Perspectives, and the Road Ahead


Jan 04, 2021
Muhammad Shafique, Mahum Naseer, Theocharis Theocharides, Christos Kyrkou, Onur Mutlu, Lois Orosa, Jungwook Choi

* IEEE Design and Test (Volume: 37, Issue: 2, April 2020): 30-57 
* Final version appears in https://ieeexplore.ieee.org/document/8979377 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks


Sep 30, 2020
Yoonho Boo, Sungho Shin, Jungwook Choi, Wonyong Sung


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks


Jan 19, 2019
Charbel Sakr, Naigang Wang, Chia-Yu Chen, Jungwook Choi, Ankur Agrawal, Naresh Shanbhag, Kailash Gopalakrishnan

* Published as a conference paper in ICLR 2019 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Training Deep Neural Networks with 8-bit Floating Point Numbers


Dec 19, 2018
Naigang Wang, Jungwook Choi, Daniel Brand, Chia-Yu Chen, Kailash Gopalakrishnan

* NeurIPS 2018 (12 pages) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)


Jul 17, 2018
Jungwook Choi, Pierce I-Jen Chuang, Zhuo Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan

* arXiv admin note: substantial text overlap with arXiv:1805.06085 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

PACT: Parameterized Clipping Activation for Quantized Neural Networks


Jul 17, 2018
Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>