Alert button

"speech": models, code, and papers
Alert button

Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos

Add code
Bookmark button
Alert button
Dec 01, 2021
Foivos Paraperas Papantoniou, Panagiotis P. Filntisis, Petros Maragos, Anastasios Roussos

Figure 1 for Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos
Figure 2 for Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos
Figure 3 for Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos
Figure 4 for Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos
Viaarxiv icon

Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge

Apr 14, 2020
Saturnino Luz, Fasih Haider, Sofia de la Fuente, Davida Fromm, Brian MacWhinney

Figure 1 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Figure 2 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Figure 3 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Figure 4 for Alzheimer's Dementia Recognition through Spontaneous Speech: The ADReSS Challenge
Viaarxiv icon

Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

Feb 08, 2022
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu

Figure 1 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Figure 2 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Figure 3 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Figure 4 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Viaarxiv icon

Dictionary Attacks on Speaker Verification

Apr 24, 2022
Mirko Marras, Pawel Korus, Anubhav Jain, Nasir Memon

Figure 1 for Dictionary Attacks on Speaker Verification
Figure 2 for Dictionary Attacks on Speaker Verification
Figure 3 for Dictionary Attacks on Speaker Verification
Figure 4 for Dictionary Attacks on Speaker Verification
Viaarxiv icon

A practical introduction to the Rational Speech Act modeling framework

May 20, 2021
Gregory Scontras, Michael Henry Tessler, Michael Franke

Viaarxiv icon

Sparse Mixture of Local Experts for Efficient Speech Enhancement

Add code
Bookmark button
Alert button
May 16, 2020
Aswin Sivaraman, Minje Kim

Figure 1 for Sparse Mixture of Local Experts for Efficient Speech Enhancement
Figure 2 for Sparse Mixture of Local Experts for Efficient Speech Enhancement
Viaarxiv icon

Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit

Jun 14, 2021
Einari Vaaras, Sari Ahlqvist-Björkroth, Konstantinos Drossos, Okko Räsänen

Figure 1 for Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit
Figure 2 for Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit
Figure 3 for Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit
Viaarxiv icon

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Add code
Bookmark button
Alert button
Sep 09, 2020
Quan Wang, Ignacio Lopez Moreno, Mert Saglam, Kevin Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein

Figure 1 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 2 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 3 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Viaarxiv icon

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

Add code
Bookmark button
Alert button
Dec 17, 2021
Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Peng Xu, Xu Yan, Zihan Liu, Rita Frieske, Tiezheng Yu, Wenliang Dai, Elham J. Barezi, Pascale Fung

Figure 1 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Figure 2 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Figure 3 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Figure 4 for ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Viaarxiv icon

DeepFry: Identifying Vocal Fry Using Deep Neural Networks

Add code
Bookmark button
Alert button
Mar 31, 2022
Bronya R. Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer S. Cole, Joseph Keshet

Figure 1 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 2 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 3 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Figure 4 for DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Viaarxiv icon