Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Iksoo Choi

Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling


Oct 07, 2021
Kyuhong Shim, Iksoo Choi, Wonyong Sung, Jungwook Choi


  Access Paper or Ask Questions

S-SGD: Symmetrical Stochastic Gradient Descent with Weight Noise Injection for Reaching Flat Minima


Sep 05, 2020
Wonyong Sung, Iksoo Choi, Jinhwan Park, Seokhyun Choi, Sungho Shin


  Access Paper or Ask Questions