Alert button

"speech recognition": models, code, and papers
Alert button

Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition

Sep 13, 2022
Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno

Figure 1 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 2 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 3 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 4 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Viaarxiv icon

Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale

Apr 30, 2023
Hiuchung Hung, Andreas Maier, Thorsten Piske

Figure 1 for Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale
Figure 2 for Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale
Viaarxiv icon

Unsupervised ASR via Cross-Lingual Pseudo-Labeling

May 19, 2023
Tatiana Likhomanenko, Loren Lugosch, Ronan Collobert

Figure 1 for Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Figure 2 for Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Figure 3 for Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Figure 4 for Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Viaarxiv icon

Speech Emotion Diarization: Which Emotion Appears When?

Add code
Bookmark button
Alert button
Jun 22, 2023
Yingzhi Wang, Mirco Ravanelli, Alaa Nfissi, Alya Yacoubi

Figure 1 for Speech Emotion Diarization: Which Emotion Appears When?
Figure 2 for Speech Emotion Diarization: Which Emotion Appears When?
Figure 3 for Speech Emotion Diarization: Which Emotion Appears When?
Figure 4 for Speech Emotion Diarization: Which Emotion Appears When?
Viaarxiv icon

MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction

Add code
Bookmark button
Alert button
Aug 15, 2023
Jie Yang, Soyeon Caren Han, Siqu Long, Josiah Poon, Goran Nenadic

Figure 1 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Figure 2 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Figure 3 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Figure 4 for MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction
Viaarxiv icon

Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis

Mar 27, 2023
Karren Yang, Ting-Yao Hu, Jen-Hao Rick Chang, Hema Swetha Koppula, Oncel Tuzel

Figure 1 for Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis
Figure 2 for Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis
Figure 3 for Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis
Figure 4 for Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis
Viaarxiv icon

Unsupervised Speech Recognition

Add code
Bookmark button
Alert button
May 24, 2021
Alexei Baevski, Wei-Ning Hsu, Alexis Conneau, Michael Auli

Figure 1 for Unsupervised Speech Recognition
Figure 2 for Unsupervised Speech Recognition
Figure 3 for Unsupervised Speech Recognition
Figure 4 for Unsupervised Speech Recognition
Viaarxiv icon

Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

Add code
Bookmark button
Alert button
Nov 07, 2022
Taesu Kim, SeungHeon Doh, Gyunpyo Lee, Hyungseok Jeon, Juhan Nam, Hyeon-Jeong Suk

Figure 1 for Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words
Figure 2 for Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words
Figure 3 for Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words
Figure 4 for Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words
Viaarxiv icon

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training

Jun 16, 2022
Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki

Figure 1 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 2 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 3 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 4 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Viaarxiv icon

Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition

Apr 04, 2022
Abner Hernandez, Paula Andrea Pérez-Toro, Elmar Nöth, Juan Rafael Orozco-Arroyave, Andreas Maier, Seung Hee Yang

Figure 1 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Figure 2 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Figure 3 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Figure 4 for Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Viaarxiv icon