Alert button

"speech": models, code, and papers
Alert button

Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jan 28, 2022
Piotr Żelasko, Siyuan Feng, Laureano Moro Velazquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak

Figure 1 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 2 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 3 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 4 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Viaarxiv icon

Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin

Aug 27, 2021
Zane Durante, Leena Mathur, Eric Ye, Sichong Zhao, Tejas Ramdas, Khalil Iskarous

Figure 1 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Figure 2 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Figure 3 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Figure 4 for Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Viaarxiv icon

Inter-channel Conv-TasNet for multichannel speech enhancement

Nov 08, 2021
Dongheon Lee, Seongrae Kim, Jung-Woo Choi

Figure 1 for Inter-channel Conv-TasNet for multichannel speech enhancement
Figure 2 for Inter-channel Conv-TasNet for multichannel speech enhancement
Figure 3 for Inter-channel Conv-TasNet for multichannel speech enhancement
Figure 4 for Inter-channel Conv-TasNet for multichannel speech enhancement
Viaarxiv icon

A study on cross-corpus speech emotion recognition and data augmentation

Add code
Bookmark button
Alert button
Jan 10, 2022
Norbert Braunschweiler, Rama Doddipatla, Simon Keizer, Svetlana Stoyanchev

Figure 1 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 2 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 3 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 4 for A study on cross-corpus speech emotion recognition and data augmentation
Viaarxiv icon

Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids

Jun 06, 2022
Leandro A. Passos, João Paulo Papa, Ahsan Adeel

Figure 1 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Figure 2 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Figure 3 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Figure 4 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Viaarxiv icon

Modeling Dependent Structure for Utterances in ASR Evaluation

Sep 07, 2022
Zhe Liu, Fuchun Peng

Figure 1 for Modeling Dependent Structure for Utterances in ASR Evaluation
Figure 2 for Modeling Dependent Structure for Utterances in ASR Evaluation
Figure 3 for Modeling Dependent Structure for Utterances in ASR Evaluation
Figure 4 for Modeling Dependent Structure for Utterances in ASR Evaluation
Viaarxiv icon

Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems

Add code
Bookmark button
Alert button
Jun 18, 2022
Danwei Cai, Zexin Cai, Ming Li

Figure 1 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Figure 2 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Figure 3 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Figure 4 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Viaarxiv icon

Signal inpainting from Fourier magnitudes

Add code
Bookmark button
Alert button
Oct 28, 2022
Louis Bahrman, Marina Krémé, Paul Magron, Antoine Deleforge

Figure 1 for Signal inpainting from Fourier magnitudes
Figure 2 for Signal inpainting from Fourier magnitudes
Figure 3 for Signal inpainting from Fourier magnitudes
Figure 4 for Signal inpainting from Fourier magnitudes
Viaarxiv icon

CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition

Add code
Bookmark button
Alert button
Jan 11, 2022
Wenliang Dai, Samuel Cahyawijaya, Tiezheng Yu, Elham J. Barezi, Peng Xu, Cheuk Tung Shadow Yiu, Rita Frieske, Holy Lovenia, Genta Indra Winata, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

Figure 1 for CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
Figure 2 for CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
Figure 3 for CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
Figure 4 for CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
Viaarxiv icon