Alert button

"speech recognition": models, code, and papers
Alert button

Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition

Jul 02, 2021
Niko Moritz, Takaaki Hori, Jonathan Le Roux

Figure 1 for Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition
Figure 2 for Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition
Viaarxiv icon

Defending against Adversarial Audio via Diffusion Model

Add code
Bookmark button
Alert button
Mar 02, 2023
Shutong Wu, Jiongxiao Wang, Wei Ping, Weili Nie, Chaowei Xiao

Figure 1 for Defending against Adversarial Audio via Diffusion Model
Figure 2 for Defending against Adversarial Audio via Diffusion Model
Figure 3 for Defending against Adversarial Audio via Diffusion Model
Figure 4 for Defending against Adversarial Audio via Diffusion Model
Viaarxiv icon

Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

Add code
Bookmark button
Alert button
Nov 17, 2022
Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu

Figure 1 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 2 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 3 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Figure 4 for Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Viaarxiv icon

Sparsification via Compressed Sensing for Automatic Speech Recognition

Feb 09, 2021
Kai Zhen, Hieu Duy Nguyen, Feng-Ju Chang, Athanasios Mouchtaris, Ariya Rastrow, .

Figure 1 for Sparsification via Compressed Sensing for Automatic Speech Recognition
Figure 2 for Sparsification via Compressed Sensing for Automatic Speech Recognition
Figure 3 for Sparsification via Compressed Sensing for Automatic Speech Recognition
Figure 4 for Sparsification via Compressed Sensing for Automatic Speech Recognition
Viaarxiv icon

Essence Knowledge Distillation for Speech Recognition

Jun 26, 2019
Zhenchuan Yang, Chun Zhang, Weibin Zhang, Jianxiu Jin, Dongpeng Chen

Figure 1 for Essence Knowledge Distillation for Speech Recognition
Figure 2 for Essence Knowledge Distillation for Speech Recognition
Figure 3 for Essence Knowledge Distillation for Speech Recognition
Figure 4 for Essence Knowledge Distillation for Speech Recognition
Viaarxiv icon

Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021

Add code
Bookmark button
Alert button
Jul 01, 2021
Yuriy Arabskyy, Aashish Agarwal, Subhadeep Dey, Oscar Koller

Figure 1 for Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021
Figure 2 for Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021
Viaarxiv icon

Evolutionary optimization of contexts for phonetic correction in speech recognition systems

Feb 23, 2021
Rafael Viana-Cámara, Diego Campos-Sobrino, Mario Campos-Soberanis

Figure 1 for Evolutionary optimization of contexts for phonetic correction in speech recognition systems
Figure 2 for Evolutionary optimization of contexts for phonetic correction in speech recognition systems
Figure 3 for Evolutionary optimization of contexts for phonetic correction in speech recognition systems
Figure 4 for Evolutionary optimization of contexts for phonetic correction in speech recognition systems
Viaarxiv icon

DiaCorrect: End-to-end error correction for speaker diarization

Add code
Bookmark button
Alert button
Oct 31, 2022
Jiangyu Han, Yuhang Cao, Heng Lu, Yanhua Long

Figure 1 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 2 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 3 for DiaCorrect: End-to-end error correction for speaker diarization
Figure 4 for DiaCorrect: End-to-end error correction for speaker diarization
Viaarxiv icon

MASRI-HEADSET: A Maltese Corpus for Speech Recognition

Add code
Bookmark button
Alert button
Aug 13, 2020
Carlos Mena, Albert Gatt, Andrea DeMarco, Claudia Borg, Lonneke van der Plas, Amanda Muscat, Ian Padovani

Figure 1 for MASRI-HEADSET: A Maltese Corpus for Speech Recognition
Figure 2 for MASRI-HEADSET: A Maltese Corpus for Speech Recognition
Figure 3 for MASRI-HEADSET: A Maltese Corpus for Speech Recognition
Figure 4 for MASRI-HEADSET: A Maltese Corpus for Speech Recognition
Viaarxiv icon

Measuring Equality in Machine Learning Security Defenses

Mar 01, 2023
Luke E. Richards, Edward Raff, Cynthia Matuszek

Figure 1 for Measuring Equality in Machine Learning Security Defenses
Figure 2 for Measuring Equality in Machine Learning Security Defenses
Figure 3 for Measuring Equality in Machine Learning Security Defenses
Figure 4 for Measuring Equality in Machine Learning Security Defenses
Viaarxiv icon