Alert button

"speech": models, code, and papers
Alert button

The Impact of Silence on Speech Anti-Spoofing

Sep 21, 2023
Yuxiang Zhang, Zhuo Li, Jingze Lu, Hua Hua, Wenchao Wang, Pengyuan Zhang

Figure 1 for The Impact of Silence on Speech Anti-Spoofing
Figure 2 for The Impact of Silence on Speech Anti-Spoofing
Figure 3 for The Impact of Silence on Speech Anti-Spoofing
Figure 4 for The Impact of Silence on Speech Anti-Spoofing
Viaarxiv icon

Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction

Add code
Bookmark button
Alert button
Sep 25, 2023
Leying Zhang, Yao Qian, Linfeng Yu, Heming Wang, Xinkai Wang, Hemin Yang, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng

Viaarxiv icon

K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings

Add code
Bookmark button
Alert button
Oct 24, 2023
Chaewon Park, Soohwan Kim, Kyubyong Park, Kunwoo Park

Viaarxiv icon

Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer

Oct 05, 2023
Paul-Ambroise Duquenne, Holger Schwenk, Benoît Sagot

Figure 1 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 2 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 3 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 4 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Viaarxiv icon

Analysis of Visual Features for Continuous Lipreading in Spanish

Nov 21, 2023
David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos

Viaarxiv icon

Multi-dimensional Speech Quality Assessment in Crowdsourcing

Add code
Bookmark button
Alert button
Sep 14, 2023
Babak Naderi, Ross Cutler, Nicolae-Catalin Ristea

Figure 1 for Multi-dimensional Speech Quality Assessment in Crowdsourcing
Figure 2 for Multi-dimensional Speech Quality Assessment in Crowdsourcing
Figure 3 for Multi-dimensional Speech Quality Assessment in Crowdsourcing
Figure 4 for Multi-dimensional Speech Quality Assessment in Crowdsourcing
Viaarxiv icon

DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation

Add code
Bookmark button
Alert button
Nov 08, 2023
Guinan Su, Yanwu Yang, Zhifeng Li

Viaarxiv icon

Improving Startup Success with Text Analysis

Dec 11, 2023
Emily Gavrilenko, Foaad Khosmood, Mahdi Rastad, Sadra Amiri Moghaddam

Viaarxiv icon

XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words

Add code
Bookmark button
Alert button
Oct 08, 2023
Robin Algayres, Pablo Diego-Simon, Benoit Sagot, Emmanuel Dupoux

Figure 1 for XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
Figure 2 for XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
Figure 3 for XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
Figure 4 for XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
Viaarxiv icon

A Comprehensive Survey on Multi-modal Conversational Emotion Recognition with Deep Learning

Dec 10, 2023
Yuntao Shou, Tao Meng, Wei Ai, Nan Yin, Keqin Li

Viaarxiv icon