Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Iksoo Choi

Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling

Oct 07, 2021
Kyuhong Shim, Iksoo Choi, Wonyong Sung, Jungwook Choi

  Access Paper or Ask Questions

S-SGD: Symmetrical Stochastic Gradient Descent with Weight Noise Injection for Reaching Flat Minima

Sep 05, 2020
Wonyong Sung, Iksoo Choi, Jinhwan Park, Seokhyun Choi, Sungho Shin

  Access Paper or Ask Questions