Alert button

"speech": models, code, and papers
Alert button

A Study on the Reliability of Automatic Dysarthric Speech Assessments

Add code
Bookmark button
Alert button
Jun 07, 2023
Xavier F. Cadet, Ranya Aloufi, Sara Ahmadi-Abhari, Hamed Haddadi

Figure 1 for A Study on the Reliability of Automatic Dysarthric Speech Assessments
Figure 2 for A Study on the Reliability of Automatic Dysarthric Speech Assessments
Figure 3 for A Study on the Reliability of Automatic Dysarthric Speech Assessments
Figure 4 for A Study on the Reliability of Automatic Dysarthric Speech Assessments
Viaarxiv icon

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

Sep 14, 2023
Tae Jin Park, Kunal Dhawan, Nithin Koluguri, Jagadeesh Balam

Figure 1 for Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach
Figure 2 for Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach
Figure 3 for Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach
Figure 4 for Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Bookmark button
Alert button
Sep 14, 2023
Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang

Figure 1 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 2 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 3 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 4 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Viaarxiv icon

Quantifying the perceptual value of lexical and non-lexical channels in speech

Add code
Bookmark button
Alert button
Jul 07, 2023
Sarenne Wallbridge, Peter Bell, Catherine Lai

Figure 1 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Figure 2 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Figure 3 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Viaarxiv icon

Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration

Add code
Bookmark button
Alert button
May 25, 2023
Rustem Yeshpanov, Saida Mussakhojayeva, Yerbolat Khassanov

Figure 1 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Figure 2 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Figure 3 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Viaarxiv icon

Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations

Add code
Bookmark button
Alert button
Jun 01, 2023
Salah Zaiem, Titouan Parcollet, Slim Essid

Figure 1 for Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Figure 2 for Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Figure 3 for Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Figure 4 for Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Viaarxiv icon

Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data

Add code
Bookmark button
Alert button
Jun 14, 2023
Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura

Figure 1 for Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data
Figure 2 for Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data
Figure 3 for Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data
Figure 4 for Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data
Viaarxiv icon

Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss

May 24, 2023
Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Kentaro Shinayama, Saki Mizuno, Mana Ihori, Tomohiro Tanaka, Nobukatsu Hojo

Figure 1 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Figure 2 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Figure 3 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Figure 4 for Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
Viaarxiv icon

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

Add code
Bookmark button
Alert button
May 13, 2023
Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao

Figure 1 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Figure 2 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Figure 3 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Figure 4 for AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Viaarxiv icon

The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN

Add code
Bookmark button
Alert button
Jun 08, 2023
Zheng Yuan, Aldo Pastore, Dorina de Jong, Hao Xu, Luciano Fadiga, Alessandro D'Ausilio

Figure 1 for The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN
Figure 2 for The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN
Figure 3 for The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN
Figure 4 for The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN
Viaarxiv icon