Alert button
Picture for Dongseong Hwang

Dongseong Hwang

Alert button

TransformerFAM: Feedback attention is working memory

Add code
Bookmark button
Alert button
Apr 14, 2024
Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar

Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Bookmark button
Alert button
Feb 27, 2024
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno

Viaarxiv icon

Revisiting the Entropy Semiring for Neural Speech Recognition

Add code
Bookmark button
Alert button
Dec 19, 2023
Oscar Chang, Dongseong Hwang, Olivier Siohan

Figure 1 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 2 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 3 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 4 for Revisiting the Entropy Semiring for Neural Speech Recognition
Viaarxiv icon

Massive End-to-end Models for Short Search Queries

Add code
Bookmark button
Alert button
Sep 22, 2023
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara Sainath, Pedro Moreno Mengibar

Figure 1 for Massive End-to-end Models for Short Search Queries
Figure 2 for Massive End-to-end Models for Short Search Queries
Figure 3 for Massive End-to-end Models for Short Search Queries
Figure 4 for Massive End-to-end Models for Short Search Queries
Viaarxiv icon

Improving Speech Recognition for African American English With Audio Classification

Add code
Bookmark button
Alert button
Sep 16, 2023
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara Sainath, Françoise Beaufays, Pedro Moreno Mengibar

Figure 1 for Improving Speech Recognition for African American English With Audio Classification
Figure 2 for Improving Speech Recognition for African American English With Audio Classification
Figure 3 for Improving Speech Recognition for African American English With Audio Classification
Figure 4 for Improving Speech Recognition for African American English With Audio Classification
Viaarxiv icon

Edit Distance based RL for RNNT decoding

Add code
Bookmark button
Alert button
May 31, 2023
Dongseong Hwang, Changwan Ryu, Khe Chai Sim

Figure 1 for Edit Distance based RL for RNNT decoding
Figure 2 for Edit Distance based RL for RNNT decoding
Figure 3 for Edit Distance based RL for RNNT decoding
Figure 4 for Edit Distance based RL for RNNT decoding
Viaarxiv icon

Modular Domain Adaptation for Conformer-Based Streaming ASR

Add code
Bookmark button
Alert button
May 22, 2023
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro M. Mengibar

Figure 1 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Figure 2 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Figure 3 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Figure 4 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Viaarxiv icon

Efficient Domain Adaptation for Speech Foundation Models

Add code
Bookmark button
Alert button
Feb 03, 2023
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Francoise Beaufays

Figure 1 for Efficient Domain Adaptation for Speech Foundation Models
Figure 2 for Efficient Domain Adaptation for Speech Foundation Models
Figure 3 for Efficient Domain Adaptation for Speech Foundation Models
Figure 4 for Efficient Domain Adaptation for Speech Foundation Models
Viaarxiv icon

Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion

Add code
Bookmark button
Alert button
Nov 04, 2022
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman

Figure 1 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 2 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 3 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 4 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Viaarxiv icon

Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR

Add code
Bookmark button
Alert button
Oct 11, 2022
Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman

Figure 1 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 2 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 3 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 4 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Viaarxiv icon