Alert button
Picture for Boris Ginsburg

Boris Ginsburg

Alert button

A Chat About Boring Problems: Studying GPT-based text normalization

Add code
Bookmark button
Alert button
Sep 23, 2023
Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg

Viaarxiv icon

Investigating End-to-End ASR Architectures for Long Form Audio Transcription

Add code
Bookmark button
Alert button
Sep 20, 2023
Nithin Rao Koluguri, Samuel Kriman, Georgy Zelenfroind, Somshubra Majumdar, Dima Rekesh, Vahid Noroozi, Jagadeesh Balam, Boris Ginsburg

Figure 1 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 2 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 3 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 4 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Viaarxiv icon

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition

Add code
Bookmark button
Alert button
Sep 19, 2023
Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg

Viaarxiv icon

Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio

Add code
Bookmark button
Alert button
Aug 09, 2023
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg

Figure 1 for Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Figure 2 for Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Figure 3 for Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Figure 4 for Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio
Viaarxiv icon

Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling

Add code
Bookmark button
Alert button
Jul 13, 2023
He Huang, Jagadeesh Balam, Boris Ginsburg

Figure 1 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Figure 2 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Figure 3 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Figure 4 for Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
Viaarxiv icon

Confidence-based Ensembles of End-to-End Speech Recognition Models

Add code
Bookmark button
Alert button
Jun 27, 2023
Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev, Boris Ginsburg

Figure 1 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Figure 2 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Figure 3 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Figure 4 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Viaarxiv icon

Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources

Add code
Bookmark button
Alert button
Jun 14, 2023
Kunal Dhawan, Dima Rekesh, Boris Ginsburg

Figure 1 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Figure 2 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Figure 3 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Figure 4 for Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources
Viaarxiv icon

SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings

Add code
Bookmark button
Alert button
Jun 04, 2023
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg

Figure 1 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Figure 2 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Figure 3 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Figure 4 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Viaarxiv icon

Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2023
Dima Rekesh, Samuel Kriman, Somshubra Majumdar, Vahid Noroozi, He Huang, Oleksii Hrinchuk, Ankur Kumar, Boris Ginsburg

Figure 1 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 2 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 3 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 4 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Viaarxiv icon