Alert button

"speech": models, code, and papers
Alert button

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy

Oct 11, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Figure 1 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Figure 2 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Figure 3 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Viaarxiv icon

Automatic Documentation of ICD Codes with Far-Field Speech Recognition

Nov 04, 2018
Albert Haque, Corinna Fukushima

Figure 1 for Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Figure 2 for Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Figure 3 for Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Viaarxiv icon

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting

Jun 20, 2019
Yue Gu, Zhihao Du, Hui Zhang, Xueliang Zhang

Figure 1 for A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting
Figure 2 for A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting
Figure 3 for A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting
Figure 4 for A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting
Viaarxiv icon

SlovakBERT: Slovak Masked Language Model

Sep 30, 2021
Matúš Pikuliak, Štefan Grivalský, Martin Konôpka, Miroslav Blšták, Martin Tamajka, Viktor Bachratý, Marián Šimko, Pavol Balážik, Michal Trnka, Filip Uhlárik

Figure 1 for SlovakBERT: Slovak Masked Language Model
Figure 2 for SlovakBERT: Slovak Masked Language Model
Figure 3 for SlovakBERT: Slovak Masked Language Model
Figure 4 for SlovakBERT: Slovak Masked Language Model
Viaarxiv icon

Homophone-based Label Smoothing in End-to-End Automatic Speech Recognition

Apr 07, 2020
Yi Zheng, Xianjie Yang, Xuyong Dang

Figure 1 for Homophone-based Label Smoothing in End-to-End Automatic Speech Recognition
Viaarxiv icon

Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification

Dec 24, 2021
Sung Hwan Mun, Min Hyun Han, Dongjune Lee, Jihwan Kim, Nam Soo Kim

Figure 1 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Figure 2 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Figure 3 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Figure 4 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Viaarxiv icon

Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription

Apr 24, 2020
Andrei Andrusenko, Aleksandr Laptev, Ivan Medennikov

Figure 1 for Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
Figure 2 for Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
Figure 3 for Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
Figure 4 for Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
Viaarxiv icon

Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation

Jun 17, 2019
Siyuan Feng, Tan Lee

Figure 1 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Figure 2 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Figure 3 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Figure 4 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Viaarxiv icon

Attention Based Fully Convolutional Network for Speech Emotion Recognition

Jun 05, 2018
Yuanyuan Zhang, Jun Du, Zirui Wang, Jianshu Zhang

Figure 1 for Attention Based Fully Convolutional Network for Speech Emotion Recognition
Figure 2 for Attention Based Fully Convolutional Network for Speech Emotion Recognition
Figure 3 for Attention Based Fully Convolutional Network for Speech Emotion Recognition
Figure 4 for Attention Based Fully Convolutional Network for Speech Emotion Recognition
Viaarxiv icon

Don't stop the training: continuously-updating self-supervised algorithms best account for auditory responses in the cortex

Feb 15, 2022
Pierre Orhan, Yves Boubenec, Jean-Rémi King

Figure 1 for Don't stop the training: continuously-updating self-supervised algorithms best account for auditory responses in the cortex
Figure 2 for Don't stop the training: continuously-updating self-supervised algorithms best account for auditory responses in the cortex
Figure 3 for Don't stop the training: continuously-updating self-supervised algorithms best account for auditory responses in the cortex
Figure 4 for Don't stop the training: continuously-updating self-supervised algorithms best account for auditory responses in the cortex
Viaarxiv icon