Picture for Dongseong Hwang

Dongseong Hwang

FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information

Add code
May 23, 2024
Viaarxiv icon

TransformerFAM: Feedback attention is working memory

Add code
Apr 14, 2024
Figure 1 for TransformerFAM: Feedback attention is working memory
Figure 2 for TransformerFAM: Feedback attention is working memory
Figure 3 for TransformerFAM: Feedback attention is working memory
Figure 4 for TransformerFAM: Feedback attention is working memory
Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Feb 27, 2024
Viaarxiv icon

Revisiting the Entropy Semiring for Neural Speech Recognition

Add code
Dec 19, 2023
Figure 1 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 2 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 3 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 4 for Revisiting the Entropy Semiring for Neural Speech Recognition
Viaarxiv icon

Massive End-to-end Models for Short Search Queries

Add code
Sep 22, 2023
Figure 1 for Massive End-to-end Models for Short Search Queries
Figure 2 for Massive End-to-end Models for Short Search Queries
Figure 3 for Massive End-to-end Models for Short Search Queries
Figure 4 for Massive End-to-end Models for Short Search Queries
Viaarxiv icon

Improving Speech Recognition for African American English With Audio Classification

Add code
Sep 16, 2023
Figure 1 for Improving Speech Recognition for African American English With Audio Classification
Figure 2 for Improving Speech Recognition for African American English With Audio Classification
Figure 3 for Improving Speech Recognition for African American English With Audio Classification
Figure 4 for Improving Speech Recognition for African American English With Audio Classification
Viaarxiv icon

Edit Distance based RL for RNNT decoding

Add code
May 31, 2023
Figure 1 for Edit Distance based RL for RNNT decoding
Figure 2 for Edit Distance based RL for RNNT decoding
Figure 3 for Edit Distance based RL for RNNT decoding
Figure 4 for Edit Distance based RL for RNNT decoding
Viaarxiv icon

Modular Domain Adaptation for Conformer-Based Streaming ASR

Add code
May 22, 2023
Viaarxiv icon

Efficient Domain Adaptation for Speech Foundation Models

Add code
Feb 03, 2023
Figure 1 for Efficient Domain Adaptation for Speech Foundation Models
Figure 2 for Efficient Domain Adaptation for Speech Foundation Models
Figure 3 for Efficient Domain Adaptation for Speech Foundation Models
Figure 4 for Efficient Domain Adaptation for Speech Foundation Models
Viaarxiv icon

Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion

Add code
Nov 04, 2022
Figure 1 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 2 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 3 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Figure 4 for Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Viaarxiv icon