Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages



Felix Wu , Kwangyoun Kim , Shinji Watanabe , Kyu Han , Ryan McDonald , Kilian Q. Weinberger , Yoav Artzi

* Code available at https://github.com/asappresearch/wav2seq 

   Access Paper or Ask Questions

SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition



Jing Pan , Tao Lei , Kwangyoun Kim , Kyu Han , Shinji Watanabe


   Access Paper or Ask Questions

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition



Felix Wu , Kwangyoun Kim , Jing Pan , Kyu Han , Kilian Q. Weinberger , Yoav Artzi

* Code available at https://github.com/asappresearch/sew 

   Access Paper or Ask Questions

Multi-mode Transformer Transducer with Stochastic Future Context



Kwangyoun Kim , Felix Wu , Prashant Sridhar , Kyu J. Han , Shinji Watanabe

* Accepted to Interspeech 2021 

   Access Paper or Ask Questions

Sequential Routing Framework: Fully Capsule Network-based Speech Recognition



Kyungmin Lee , Hyunwhan Joe , Hyeontaek Lim , Kwangyoun Kim , Sungsoo Kim , Chang Woo Han , Hong-Gee Kim

* 40 pages, 7 figures (totally 10 figures), submitted to Computer Speech and Language (Only line numbers were removed from the submitted version) 

   Access Paper or Ask Questions

Small energy masking for improved neural network training for end-to-end speech recognition



Chanwoo Kim , Kwangyoun Kim , Sathish Reddy Indurthi

* Accepted at ICASSP 2020 

   Access Paper or Ask Questions

Attention based on-device streaming speech recognition with large speech corpus



Kwangyoun Kim , Kyungmin Lee , Dhananjaya Gowda , Junmo Park , Sungsoo Kim , Sichen Jin , Young-Yoon Lee , Jinsu Yeo , Daehyun Kim , Seokyeong Jung , Jungin Lee , Myoungji Han , Chanwoo Kim

* Accepted and presented at the ASRU 2019 conference 

   Access Paper or Ask Questions

Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models



Abhinav Garg , Dhananjaya Gowda , Ankur Kumar , Kwangyoun Kim , Mehul Kumar , Chanwoo Kim

* Accepted and presented at the ASRU 2019 conference 

   Access Paper or Ask Questions

power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition



Chanwoo Kim , Mehul Kumar , Kwangyoun Kim , Dhananjaya Gowda

* Accepted and presented at the ASRU 2019 conference 

   Access Paper or Ask Questions

end-to-end training of a large vocabulary end-to-end speech recognition system



Chanwoo Kim , Sungsoo Kim , Kwangyoun Kim , Mehul Kumar , Jiyeon Kim , Kyungmin Lee , Changwoo Han , Abhinav Garg , Eunhyang Kim , Minkyoo Shin , Shatrughan Singh , Larry Heck , Dhananjaya Gowda

* Accepted and presented at the ASRU 2019 conference 

   Access Paper or Ask Questions