Alert button

"speech": models, code, and papers
Alert button

Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset

Add code
Bookmark button
Alert button
Jan 07, 2022
Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

Figure 1 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Figure 2 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Figure 3 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Figure 4 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Viaarxiv icon

Memories are One-to-Many Mapping Alleviators in Talking Face Generation

Add code
Bookmark button
Alert button
Dec 09, 2022
Anni Tang, Tianyu He, Xu Tan, Jun Ling, Runnan Li, Sheng Zhao, Li Song, Jiang Bian

Figure 1 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 2 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 3 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 4 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Viaarxiv icon

Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models

Nov 09, 2022
Travis M. Bartley, Fei Jia, Krishna C. Puvvada, Samuel Kriman, Boris Ginsburg

Figure 1 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Figure 2 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Figure 3 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Figure 4 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Viaarxiv icon

A comparison of several AI techniques for authorship attribution on Romanian texts

Add code
Bookmark button
Alert button
Nov 09, 2022
Sanda Maria Avram, Mihai Oltean

Figure 1 for A comparison of several AI techniques for authorship attribution on Romanian texts
Figure 2 for A comparison of several AI techniques for authorship attribution on Romanian texts
Figure 3 for A comparison of several AI techniques for authorship attribution on Romanian texts
Figure 4 for A comparison of several AI techniques for authorship attribution on Romanian texts
Viaarxiv icon

Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection

Add code
Bookmark button
Alert button
Nov 20, 2021
Samuele Cornell, Thomas Balestri, Thibaud Sénéchal

Figure 1 for Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Figure 2 for Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Figure 3 for Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Figure 4 for Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Viaarxiv icon

User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning

Add code
Bookmark button
Alert button
Apr 05, 2022
Tiantian Feng, Raghuveer Peri, Shrikanth Narayanan

Figure 1 for User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning
Figure 2 for User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning
Figure 3 for User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning
Viaarxiv icon

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction

Add code
Bookmark button
Alert button
Dec 27, 2021
Jiangyu Han, Yanhua Long, Lukas Burget, Jan Cernocky

Figure 1 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 2 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 3 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Figure 4 for DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction
Viaarxiv icon

Robotic Speech Synthesis: Perspectives on Interactions, Scenarios, and Ethics

Mar 17, 2022
Yuanchao Li, Catherine Lai

Viaarxiv icon

Learnable Front Ends Based on Temporal Modulation for Music Tagging

Nov 28, 2022
Yinghao Ma, Richard M. Stern

Figure 1 for Learnable Front Ends Based on Temporal Modulation for Music Tagging
Figure 2 for Learnable Front Ends Based on Temporal Modulation for Music Tagging
Figure 3 for Learnable Front Ends Based on Temporal Modulation for Music Tagging
Figure 4 for Learnable Front Ends Based on Temporal Modulation for Music Tagging
Viaarxiv icon

Fusing ASR Outputs in Joint Training for Speech Emotion Recognition

Oct 29, 2021
Yuanchao Li, Peter Bell, Catherine Lai

Figure 1 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Figure 2 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Figure 3 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Figure 4 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Viaarxiv icon