Alert button

"speech recognition": models, code, and papers
Alert button

Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

Add code
Bookmark button
Alert button
Oct 12, 2021
Li-Wei Chen, Alexander Rudnicky

Figure 1 for Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
Figure 2 for Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
Figure 3 for Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
Figure 4 for Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
Viaarxiv icon

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition

Oct 09, 2021
Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe

Figure 1 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Figure 2 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Figure 3 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Figure 4 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Viaarxiv icon

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers

Dec 07, 2022
Zijian Yang, Wei Zhou, Ralf Schlüter, Hermann Ney

Figure 1 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Figure 2 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Figure 3 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Viaarxiv icon

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Oct 30, 2019
Dongwei Jiang, Xiaoning Lei, Wubo Li, Ne Luo, Yuxuan Hu, Wei Zou, Xiangang Li

Figure 1 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 2 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 3 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 4 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Viaarxiv icon

A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition

Aug 06, 2020
Sitong Zhou, Homayoon Beigi

Figure 1 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Figure 2 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Figure 3 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Figure 4 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Viaarxiv icon

Evaluating context-invariance in unsupervised speech representations

Oct 27, 2022
Mark Hallap, Emmanuel Dupoux, Ewan Dunbar

Figure 1 for Evaluating context-invariance in unsupervised speech representations
Figure 2 for Evaluating context-invariance in unsupervised speech representations
Figure 3 for Evaluating context-invariance in unsupervised speech representations
Figure 4 for Evaluating context-invariance in unsupervised speech representations
Viaarxiv icon

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Jun 16, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Figure 1 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Figure 2 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Viaarxiv icon

Toward Cross-Domain Speech Recognition with End-to-End Models

Mar 09, 2020
Thai-Son Nguyen, Sebastian Stüker, Alex Waibel

Figure 1 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 2 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 3 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 4 for Toward Cross-Domain Speech Recognition with End-to-End Models
Viaarxiv icon

Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge

Add code
Bookmark button
Alert button
Jul 29, 2022
Alef Iury Siqueira Ferreira, Gustavo dos Reis Oliveira

Figure 1 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Figure 2 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Figure 3 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Figure 4 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Viaarxiv icon

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition

Jan 14, 2022
Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng

Figure 1 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 2 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 3 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 4 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Viaarxiv icon