Alert button

"speech recognition": models, code, and papers
Alert button

SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings

Add code
Bookmark button
Alert button
Jun 04, 2023
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg

Figure 1 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Figure 2 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Figure 3 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Figure 4 for SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Viaarxiv icon

Acoustic absement in detail: Quantifying acoustic differences across time-series representations of speech data

Add code
Bookmark button
Alert button
Apr 14, 2023
Matthew C. Kelley

Figure 1 for Acoustic absement in detail: Quantifying acoustic differences across time-series representations of speech data
Viaarxiv icon

DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research

Add code
Bookmark button
Alert button
Sep 04, 2023
Yu-Neng Chuang, Guanchu Wang, Chia-Yuan Chang, Kwei-Herng Lai, Daochen Zha, Ruixiang Tang, Fan Yang, Alfredo Costilla Reyes, Kaixiong Zhou, Xiaoqian Jiang, Xia Hu

Figure 1 for DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research
Figure 2 for DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research
Figure 3 for DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research
Figure 4 for DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research
Viaarxiv icon

Improving Language Model Integration for Neural Machine Translation

Jun 08, 2023
Christian Herold, Yingbo Gao, Mohammad Zeineldeen, Hermann Ney

Figure 1 for Improving Language Model Integration for Neural Machine Translation
Figure 2 for Improving Language Model Integration for Neural Machine Translation
Figure 3 for Improving Language Model Integration for Neural Machine Translation
Figure 4 for Improving Language Model Integration for Neural Machine Translation
Viaarxiv icon

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 02, 2022
Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu

Figure 1 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 2 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 3 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Figure 4 for SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Viaarxiv icon

Quilt-1M: One Million Image-Text Pairs for Histopathology

Add code
Bookmark button
Alert button
Jun 22, 2023
Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda Shapiro

Figure 1 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Figure 2 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Figure 3 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Figure 4 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Viaarxiv icon

DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model

Add code
Bookmark button
Alert button
Jun 02, 2023
Haoyu Wang, Siyuan Wang, Wei-Qiang Zhang, Jinfeng Bai

Figure 1 for DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
Figure 2 for DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
Figure 3 for DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
Figure 4 for DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
Viaarxiv icon

Regularizing Contrastive Predictive Coding for Speech Applications

Apr 26, 2023
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 2 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 3 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 4 for Regularizing Contrastive Predictive Coding for Speech Applications
Viaarxiv icon

Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition

Apr 08, 2022
Zehai Tu, Jack Deadman, Ning Ma, Jon Barker

Figure 1 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Figure 2 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Figure 3 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Figure 4 for Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition
Viaarxiv icon

Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization

Jun 07, 2023
Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Tomohiro Tanaka, Takatomo Kano, Atsunori Ogawa, Marc Delcroix

Figure 1 for Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
Figure 2 for Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
Figure 3 for Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
Figure 4 for Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
Viaarxiv icon