Alert button

"speech": models, code, and papers
Alert button

Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces

Sep 19, 2022
Vishwa Shah, Aditya Sharma, Gautam Shroff, Lovekesh Vig, Tirtharaj Dash, Ashwin Srinivasan

Figure 1 for Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces
Figure 2 for Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces
Figure 3 for Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces
Figure 4 for Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces
Viaarxiv icon

Speaker Recognition in the Wild

Add code
Bookmark button
Alert button
May 05, 2022
Neeraj Chhimwal, Anirudh Gupta, Rishabh Gaur, Harveen Singh Chadha, Priyanshi Shah, Ankur Dhuriya, Vivek Raghavan

Figure 1 for Speaker Recognition in the Wild
Figure 2 for Speaker Recognition in the Wild
Figure 3 for Speaker Recognition in the Wild
Viaarxiv icon

MFFCN: Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement

Jan 15, 2021
Xinmeng Xu, Dongxiang Xu, Jie Jia, Yang Wang, Binbin Chen

Figure 1 for MFFCN: Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement
Figure 2 for MFFCN: Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement
Figure 3 for MFFCN: Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement
Figure 4 for MFFCN: Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement
Viaarxiv icon

Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments

Jun 28, 2021
Pasi Pertilä, Emre Cakir, Aapo Hakala, Eemi Fagerlund, Tuomas Virtanen, Archontis Politis, Antti Eronen

Figure 1 for Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments
Figure 2 for Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments
Figure 3 for Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments
Figure 4 for Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments
Viaarxiv icon

Attentive activation function for improving end-to-end spoofing countermeasure systems

May 03, 2022
Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan

Figure 1 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Figure 2 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Figure 3 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Figure 4 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Viaarxiv icon

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes

Add code
Bookmark button
Alert button
Jun 15, 2021
Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina

Figure 1 for Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
Figure 2 for Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
Figure 3 for Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
Figure 4 for Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
Viaarxiv icon

Reproducibility Report: Contextualizing Hate Speech Classifiers with Post-hoc Explanation

Add code
Bookmark button
Alert button
May 24, 2021
Kiran Purohit, Owais Iqbal, Ankan Mullick

Figure 1 for Reproducibility Report: Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Figure 2 for Reproducibility Report: Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Figure 3 for Reproducibility Report: Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Figure 4 for Reproducibility Report: Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Viaarxiv icon

Streaming Simultaneous Speech Translation with Augmented Memory Transformer

Add code
Bookmark button
Alert button
Oct 30, 2020
Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn, Juan Pino

Figure 1 for Streaming Simultaneous Speech Translation with Augmented Memory Transformer
Figure 2 for Streaming Simultaneous Speech Translation with Augmented Memory Transformer
Figure 3 for Streaming Simultaneous Speech Translation with Augmented Memory Transformer
Figure 4 for Streaming Simultaneous Speech Translation with Augmented Memory Transformer
Viaarxiv icon

Incorporating Broad Phonetic Information for Speech Enhancement

Aug 13, 2020
Yen-Ju Lu, Chien-Feng Liao, Xugang Lu, Jeih-weih Hung, Yu Tsao

Figure 1 for Incorporating Broad Phonetic Information for Speech Enhancement
Figure 2 for Incorporating Broad Phonetic Information for Speech Enhancement
Figure 3 for Incorporating Broad Phonetic Information for Speech Enhancement
Figure 4 for Incorporating Broad Phonetic Information for Speech Enhancement
Viaarxiv icon

KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset

Add code
Bookmark button
Alert button
Apr 26, 2021
Saida Mussakhojayeva, Aigerim Janaliyeva, Almas Mirzakhmetov, Yerbolat Khassanov, Huseyin Atakan Varol

Figure 1 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 2 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 3 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 4 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Viaarxiv icon