Alert button

"speech": models, code, and papers
Alert button

Neural inhibition during speech planning contributes to contrastive hyperarticulation

Sep 25, 2022
Michael C. Stern, Jason A. Shaw

Figure 1 for Neural inhibition during speech planning contributes to contrastive hyperarticulation
Figure 2 for Neural inhibition during speech planning contributes to contrastive hyperarticulation
Figure 3 for Neural inhibition during speech planning contributes to contrastive hyperarticulation
Figure 4 for Neural inhibition during speech planning contributes to contrastive hyperarticulation
Viaarxiv icon

MetaSpeech: Speech Effects Switch Along with Environment for Metaverse

Oct 25, 2022
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao

Figure 1 for MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Figure 2 for MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Figure 3 for MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Figure 4 for MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Viaarxiv icon

SQuId: Measuring Speech Naturalness in Many Languages

Oct 12, 2022
Thibault Sellam, Ankur Bapna, Joshua Camp, Diana Mackinnon, Ankur P. Parikh, Jason Riesa

Figure 1 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 2 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 3 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 4 for SQuId: Measuring Speech Naturalness in Many Languages
Viaarxiv icon

ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech

Add code
Bookmark button
Alert button
Sep 23, 2022
Saeed Ghorbani, Ylva Ferstl, Daniel Holden, Nikolaus F. Troje, Marc-André Carbonneau

Figure 1 for ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech
Figure 2 for ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech
Figure 3 for ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech
Figure 4 for ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech
Viaarxiv icon

Predictive Neural Speech Coding

Add code
Bookmark button
Alert button
Jul 18, 2022
Xue Jiang, Xiulian Peng, Huaying Xue, Yuan Zhang, Yan Lu

Figure 1 for Predictive Neural Speech Coding
Figure 2 for Predictive Neural Speech Coding
Figure 3 for Predictive Neural Speech Coding
Figure 4 for Predictive Neural Speech Coding
Viaarxiv icon

The 2022 NIST Language Recognition Evaluation

Feb 28, 2023
Yooyoung Lee, Craig Greenberg, Eliot Godard, Asad A. Butt, Elliot Singer, Trang Nguyen, Lisa Mason, Douglas Reynolds

Figure 1 for The 2022 NIST Language Recognition Evaluation
Figure 2 for The 2022 NIST Language Recognition Evaluation
Figure 3 for The 2022 NIST Language Recognition Evaluation
Figure 4 for The 2022 NIST Language Recognition Evaluation
Viaarxiv icon

Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

Add code
Bookmark button
Alert button
Sep 11, 2022
H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman

Figure 1 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Figure 2 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Figure 3 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Figure 4 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Viaarxiv icon

A Planning-Based Explainable Collaborative Dialogue System

Mar 02, 2023
Philip R. Cohen, Lucian Galescu

Figure 1 for A Planning-Based Explainable Collaborative Dialogue System
Figure 2 for A Planning-Based Explainable Collaborative Dialogue System
Figure 3 for A Planning-Based Explainable Collaborative Dialogue System
Figure 4 for A Planning-Based Explainable Collaborative Dialogue System
Viaarxiv icon

Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features

Add code
Bookmark button
Alert button
Nov 01, 2022
Alexandra Vioni, Georgia Maniati, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Aimilios Chalamandaris, Pirros Tsiakoulis

Figure 1 for Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features
Figure 2 for Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features
Figure 3 for Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features
Figure 4 for Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features
Viaarxiv icon

Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Nov 01, 2022
Karolos Nikitaras, Konstantinos Klapsas, Nikolaos Ellinas, Georgia Maniati, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

Figure 1 for Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis
Figure 2 for Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis
Figure 3 for Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis
Figure 4 for Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis
Viaarxiv icon