Alert button

"speech": models, code, and papers
Alert button

Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue

Dec 07, 2022
Daxin Tan, Nikos Kargas, David McHardy, Constantinos Papayiannis, Antonio Bonafonte, Marek Strelec, Jonas Rohnke, Agis Oikonomou Filandras, Trevor Wood

Figure 1 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Figure 2 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Figure 3 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Figure 4 for Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue
Viaarxiv icon

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers

Dec 07, 2022
Zijian Yang, Wei Zhou, Ralf Schlüter, Hermann Ney

Figure 1 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Figure 2 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Figure 3 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Viaarxiv icon

CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

Add code
Bookmark button
Alert button
Oct 14, 2021
Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

Figure 1 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Figure 2 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Figure 3 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Figure 4 for CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Viaarxiv icon

Gated Recurrent Neural Networks with Weighted Time-Delay Feedback

Dec 01, 2022
N. Benjamin Erichson, Soon Hoe Lim, Michael W. Mahoney

Figure 1 for Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
Figure 2 for Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
Figure 3 for Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
Figure 4 for Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
Viaarxiv icon

Foundation Transformers

Add code
Bookmark button
Alert button
Oct 12, 2022
Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei

Figure 1 for Foundation Transformers
Figure 2 for Foundation Transformers
Figure 3 for Foundation Transformers
Figure 4 for Foundation Transformers
Viaarxiv icon

GNN-SL: Sequence Labeling Based on Nearest Examples via GNN

Add code
Bookmark button
Alert button
Dec 12, 2022
Shuhe Wang, Yuxian Meng, Rongbin Ouyang, Jiwei Li, Tianwei Zhang, Lingjuan Lyu, Guoyin Wang

Figure 1 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Figure 2 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Figure 3 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Figure 4 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Viaarxiv icon

GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge

Add code
Bookmark button
Alert button
Sep 21, 2022
Dongkeon Park, Yechan Yu, Kyeong Wan Park, Ji Won Kim, Hong Kook Kim

Figure 1 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Figure 2 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Figure 3 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Figure 4 for GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge
Viaarxiv icon

HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition

Apr 13, 2022
Ji Won Yoon, Beom Jun Woo, Nam Soo Kim

Figure 1 for HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition
Figure 2 for HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition
Figure 3 for HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition
Figure 4 for HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition
Viaarxiv icon

Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition

Oct 21, 2021
Ting-Yao Hu, Mohammadreza Armandpour, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Oncel Tuzel

Figure 1 for Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Figure 2 for Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Figure 3 for Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Figure 4 for Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Viaarxiv icon

Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement

Dec 09, 2021
Yi Li, Yang Sun, Kirill Horoshenkov, Syed Mohsen Naqvi

Figure 1 for Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Figure 2 for Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Figure 3 for Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Figure 4 for Domain Adaptation and Autoencoder Based Unsupervised Speech Enhancement
Viaarxiv icon