Alert button

"speech": models, code, and papers
Alert button

High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion

Add code
Bookmark button
Alert button
Jan 25, 2021
Mohammed Salah Al-Radhi

Figure 1 for High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion
Figure 2 for High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion
Figure 3 for High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion
Figure 4 for High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion
Viaarxiv icon

Speech enhancement with mixture-of-deep-experts with clean clustering pre-training

Feb 11, 2021
Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot

Figure 1 for Speech enhancement with mixture-of-deep-experts with clean clustering pre-training
Figure 2 for Speech enhancement with mixture-of-deep-experts with clean clustering pre-training
Figure 3 for Speech enhancement with mixture-of-deep-experts with clean clustering pre-training
Figure 4 for Speech enhancement with mixture-of-deep-experts with clean clustering pre-training
Viaarxiv icon

Speech Emotion Recognition using Semantic Information

Add code
Bookmark button
Alert button
Mar 04, 2021
Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller

Figure 1 for Speech Emotion Recognition using Semantic Information
Figure 2 for Speech Emotion Recognition using Semantic Information
Figure 3 for Speech Emotion Recognition using Semantic Information
Figure 4 for Speech Emotion Recognition using Semantic Information
Viaarxiv icon

CoVoST 2 and Massively Multilingual Speech-to-Text Translation

Add code
Bookmark button
Alert button
Aug 20, 2020
Changhan Wang, Anne Wu, Juan Pino

Figure 1 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Figure 2 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Figure 3 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Figure 4 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Viaarxiv icon

Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge

Feb 10, 2022
Jingguang Tian, Xinhui Hu, Xinkang Xu

Figure 1 for Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Figure 2 for Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Figure 3 for Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Figure 4 for Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Viaarxiv icon

On the limit of English conversational speech recognition

May 03, 2021
Zoltán Tüske, George Saon, Brian Kingsbury

Figure 1 for On the limit of English conversational speech recognition
Figure 2 for On the limit of English conversational speech recognition
Figure 3 for On the limit of English conversational speech recognition
Viaarxiv icon

MLS: A Large-Scale Multilingual Dataset for Speech Research

Add code
Bookmark button
Alert button
Dec 07, 2020
Vineel Pratap, Qiantong Xu, Anuroop Sriram, Gabriel Synnaeve, Ronan Collobert

Figure 1 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 2 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 3 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 4 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Viaarxiv icon

Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech

Add code
Bookmark button
Alert button
Mar 30, 2021
Chenglin Xu, Wei Rao, Jibin Wu, Haizhou Li

Figure 1 for Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Figure 2 for Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Figure 3 for Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Figure 4 for Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Viaarxiv icon

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

Add code
Bookmark button
Alert button
Jun 30, 2021
Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani

Figure 1 for DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Figure 2 for DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Figure 3 for DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Figure 4 for DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Viaarxiv icon

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Add code
Bookmark button
Alert button
Jun 08, 2021
Yuki Takashima, Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Paola García, Kenji Nagamatsu

Figure 1 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Figure 2 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Figure 3 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Figure 4 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Viaarxiv icon