Alert button

"speech": models, code, and papers
Alert button

Source Tracing: Detecting Voice Spoofing

Add code
Bookmark button
Alert button
Dec 16, 2022
Tinglong Zhu, Xingming Wang, Xiaoyi Qin, Ming Li

Figure 1 for Source Tracing: Detecting Voice Spoofing
Figure 2 for Source Tracing: Detecting Voice Spoofing
Figure 3 for Source Tracing: Detecting Voice Spoofing
Figure 4 for Source Tracing: Detecting Voice Spoofing
Viaarxiv icon

Emotional Prosody Control for Speech Generation

Nov 07, 2021
Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi

Figure 1 for Emotional Prosody Control for Speech Generation
Figure 2 for Emotional Prosody Control for Speech Generation
Figure 3 for Emotional Prosody Control for Speech Generation
Viaarxiv icon

SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing

Add code
Bookmark button
Alert button
Nov 04, 2022
Siwen Ding, You Zhang, Zhiyao Duan

Figure 1 for SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing
Figure 2 for SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing
Figure 3 for SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing
Figure 4 for SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice Anti-Spoofing
Viaarxiv icon

Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning

Apr 08, 2022
Eesung Kim, Jae-Jin Jeon, Hyeji Seo, Hoon Kim

Figure 1 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 2 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 3 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 4 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Viaarxiv icon

Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada

Jul 27, 2022
Madhavaraj A, Bharathi Pilar, Ramakrishnan A G

Figure 1 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 2 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 3 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 4 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Viaarxiv icon

Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches

Jul 07, 2022
Tusarkanta Dalai, Tapas Kumar Mishra, Pankaj K Sa

Figure 1 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 2 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 3 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 4 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Viaarxiv icon

AVATAR: Unconstrained Audiovisual Speech Recognition

Add code
Bookmark button
Alert button
Jun 15, 2022
Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid

Figure 1 for AVATAR: Unconstrained Audiovisual Speech Recognition
Figure 2 for AVATAR: Unconstrained Audiovisual Speech Recognition
Figure 3 for AVATAR: Unconstrained Audiovisual Speech Recognition
Figure 4 for AVATAR: Unconstrained Audiovisual Speech Recognition
Viaarxiv icon

FFC-SE: Fast Fourier Convolution for Speech Enhancement

Apr 06, 2022
Ivan Shchekotov, Pavel Andreev, Oleg Ivanov, Aibek Alanov, Dmitry Vetrov

Figure 1 for FFC-SE: Fast Fourier Convolution for Speech Enhancement
Figure 2 for FFC-SE: Fast Fourier Convolution for Speech Enhancement
Figure 3 for FFC-SE: Fast Fourier Convolution for Speech Enhancement
Figure 4 for FFC-SE: Fast Fourier Convolution for Speech Enhancement
Viaarxiv icon

Affective Faces for Goal-Driven Dyadic Communication

Jan 26, 2023
Scott Geng, Revant Teotia, Purva Tendulkar, Sachit Menon, Carl Vondrick

Figure 1 for Affective Faces for Goal-Driven Dyadic Communication
Figure 2 for Affective Faces for Goal-Driven Dyadic Communication
Figure 3 for Affective Faces for Goal-Driven Dyadic Communication
Figure 4 for Affective Faces for Goal-Driven Dyadic Communication
Viaarxiv icon

Federated Self-supervised Speech Representations: Are We There Yet?

Apr 06, 2022
Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Abhinav Mehrotra, Nicholas D. Lane

Figure 1 for Federated Self-supervised Speech Representations: Are We There Yet?
Figure 2 for Federated Self-supervised Speech Representations: Are We There Yet?
Figure 3 for Federated Self-supervised Speech Representations: Are We There Yet?
Figure 4 for Federated Self-supervised Speech Representations: Are We There Yet?
Viaarxiv icon