Alert button

"speech": models, code, and papers
Alert button

Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing

Jul 15, 2022
Ahsan Adeel, Mario Franco, Mohsin Raza, Khubaib Ahmed

Figure 1 for Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing
Figure 2 for Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing
Figure 3 for Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing
Figure 4 for Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing
Viaarxiv icon

Consistent Transcription and Translation of Speech

Aug 28, 2020
Matthias Sperber, Hendra Setiawan, Christian Gollan, Udhyakumar Nallasamy, Matthias Paulik

Figure 1 for Consistent Transcription and Translation of Speech
Figure 2 for Consistent Transcription and Translation of Speech
Figure 3 for Consistent Transcription and Translation of Speech
Figure 4 for Consistent Transcription and Translation of Speech
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Feb 15, 2022
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi

Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon

DeepFry: Identifying Vocal Fry Using Deep Neural Networks

Mar 31, 2022
Bronya R. Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer S. Cole, Joseph Keshet

Figure 1 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 2 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 3 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 4 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Viaarxiv icon

Data augmentation for learning predictive models on EEG: a systematic comparison

Jun 29, 2022
Cédric Rommel, Joseph Paillard, Thomas Moreau, Alexandre Gramfort

Figure 1 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 2 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 3 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 4 for Data augmentation for learning predictive models on EEG: a systematic comparison
Viaarxiv icon

Speech prosody and remote experiments: a technical report

Jun 21, 2021
Giuseppe Magistro

Figure 1 for Speech prosody and remote experiments: a technical report
Figure 2 for Speech prosody and remote experiments: a technical report
Figure 3 for Speech prosody and remote experiments: a technical report
Figure 4 for Speech prosody and remote experiments: a technical report
Viaarxiv icon

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

Jan 07, 2022
Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Peng Xu, Xu Yan, Zihan Liu, Rita Frieske, Tiezheng Yu, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

Figure 1 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Figure 2 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Figure 3 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Figure 4 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Viaarxiv icon

Deep Variational Generative Models for Audio-visual Speech Separation

Aug 17, 2020
Viet-Nhat Nguyen, Mostafa Sadeghi, Elisa Ricci, Xavier Alameda-Pineda

Figure 1 for Deep Variational Generative Models for Audio-visual Speech Separation
Figure 2 for Deep Variational Generative Models for Audio-visual Speech Separation
Figure 3 for Deep Variational Generative Models for Audio-visual Speech Separation
Viaarxiv icon

Unsupervised Speech Decomposition via Triple Information Bottleneck

Apr 23, 2020
Kaizhi Qian, Yang Zhang, Shiyu Chang, David Cox, Mark Hasegawa-Johnson

Figure 1 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Figure 2 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Figure 3 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Figure 4 for Unsupervised Speech Decomposition via Triple Information Bottleneck
Viaarxiv icon

Speaker Re-identification with Speaker Dependent Speech Enhancement

May 15, 2020
Yanpei Shi, Qiang Huang, Thomas Hain

Figure 1 for Speaker Re-identification with Speaker Dependent Speech Enhancement
Figure 2 for Speaker Re-identification with Speaker Dependent Speech Enhancement
Figure 3 for Speaker Re-identification with Speaker Dependent Speech Enhancement
Figure 4 for Speaker Re-identification with Speaker Dependent Speech Enhancement
Viaarxiv icon