Alert button

"speech": models, code, and papers
Alert button

Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition

Add code
Bookmark button
Alert button
Sep 17, 2021
Guangzhi Sun, Chao Zhang, Philip C. Woodland

Figure 1 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 2 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 3 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 4 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Viaarxiv icon

Multimodal Crop Type Classification Fusing Multi-Spectral Satellite Time Series with Farmers Crop Rotations and Local Crop Distribution

Add code
Bookmark button
Alert button
Aug 23, 2022
Valentin Barriere, Martin Claverie

Figure 1 for Multimodal Crop Type Classification Fusing Multi-Spectral Satellite Time Series with Farmers Crop Rotations and Local Crop Distribution
Figure 2 for Multimodal Crop Type Classification Fusing Multi-Spectral Satellite Time Series with Farmers Crop Rotations and Local Crop Distribution
Figure 3 for Multimodal Crop Type Classification Fusing Multi-Spectral Satellite Time Series with Farmers Crop Rotations and Local Crop Distribution
Viaarxiv icon

The VoicePrivacy 2022 Challenge Evaluation Plan

Add code
Bookmark button
Alert button
Mar 27, 2022
Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Hubert Nourtel, Pierre Champion, Massimiliano Todisco, Emmanuel Vincent, Nicholas Evans, Junichi Yamagishi, Jean-François Bonastre

Figure 1 for The VoicePrivacy 2022 Challenge Evaluation Plan
Figure 2 for The VoicePrivacy 2022 Challenge Evaluation Plan
Figure 3 for The VoicePrivacy 2022 Challenge Evaluation Plan
Figure 4 for The VoicePrivacy 2022 Challenge Evaluation Plan
Viaarxiv icon

Residual Excitation Skewness for Automatic Speech Polarity Detection

May 31, 2020
Thomas Drugman

Figure 1 for Residual Excitation Skewness for Automatic Speech Polarity Detection
Figure 2 for Residual Excitation Skewness for Automatic Speech Polarity Detection
Figure 3 for Residual Excitation Skewness for Automatic Speech Polarity Detection
Figure 4 for Residual Excitation Skewness for Automatic Speech Polarity Detection
Viaarxiv icon

TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices

Aug 11, 2020
Alexander Wong, Mahmoud Famouri, Maya Pavlova, Siddharth Surana

Figure 1 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 2 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 3 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Viaarxiv icon

Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media

Jun 16, 2022
Abdelkader El Mahdaouy, Abdellah El Mekki, Ahmed Oumar, Hajar Mousannif, Ismail Berrada

Figure 1 for Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media
Figure 2 for Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media
Figure 3 for Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media
Figure 4 for Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Add code
Bookmark button
Alert button
Feb 15, 2022
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi

Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon

ASR in German: A Detailed Error Analysis

Add code
Bookmark button
Alert button
Apr 12, 2022
Johannes Wirth, Rene Peinl

Figure 1 for ASR in German: A Detailed Error Analysis
Figure 2 for ASR in German: A Detailed Error Analysis
Figure 3 for ASR in German: A Detailed Error Analysis
Figure 4 for ASR in German: A Detailed Error Analysis
Viaarxiv icon

Are Chess Discussions Racist? An Adversarial Hate Speech Data Set

Nov 20, 2020
Rupak Sarkar, Ashiqur R. KhudaBukhsh

Figure 1 for Are Chess Discussions Racist? An Adversarial Hate Speech Data Set
Figure 2 for Are Chess Discussions Racist? An Adversarial Hate Speech Data Set
Figure 3 for Are Chess Discussions Racist? An Adversarial Hate Speech Data Set
Viaarxiv icon

Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation

Add code
Bookmark button
Alert button
Mar 25, 2021
Jiyoung Lee, Soo-Whan Chung, Sunok Kim, Hong-Goo Kang, Kwanghoon Sohn

Figure 1 for Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
Figure 2 for Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
Figure 3 for Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
Figure 4 for Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
Viaarxiv icon