Alert button

"speech": models, code, and papers
Alert button

Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches

Jul 07, 2022
Tusarkanta Dalai, Tapas Kumar Mishra, Pankaj K Sa

Figure 1 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 2 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 3 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 4 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Viaarxiv icon

A Brief Overview of Unsupervised Neural Speech Representation Learning

Mar 01, 2022
Lasse Borgholt, Jakob Drachmann Havtorn, Joakim Edin, Lars Maaløe, Christian Igel

Figure 1 for A Brief Overview of Unsupervised Neural Speech Representation Learning
Figure 2 for A Brief Overview of Unsupervised Neural Speech Representation Learning
Figure 3 for A Brief Overview of Unsupervised Neural Speech Representation Learning
Figure 4 for A Brief Overview of Unsupervised Neural Speech Representation Learning
Viaarxiv icon

Everything is Connected: Graph Neural Networks

Jan 19, 2023
Petar Veličković

Viaarxiv icon

Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language

May 19, 2022
Martin Malmsten, Chris Haffenden, Love Börjeson

Figure 1 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 2 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 3 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 4 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Viaarxiv icon

Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

Nov 01, 2022
Akanksha Saran, Kush Desai, Mai Lee Chang, Rudolf Lioutikov, Andrea Thomaz, Scott Niekum

Figure 1 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots
Figure 2 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots
Figure 3 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots
Figure 4 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots
Viaarxiv icon

Learning Speech Emotion Representations in the Quaternion Domain

Apr 05, 2022
Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello

Figure 1 for Learning Speech Emotion Representations in the Quaternion Domain
Figure 2 for Learning Speech Emotion Representations in the Quaternion Domain
Figure 3 for Learning Speech Emotion Representations in the Quaternion Domain
Figure 4 for Learning Speech Emotion Representations in the Quaternion Domain
Viaarxiv icon

Interpretation and Analysis of the Steady-State Neural Response to Complex Sequential Structures: a Methodological Note

Jan 03, 2023
Nai Ding

Figure 1 for Interpretation and Analysis of the Steady-State Neural Response to Complex Sequential Structures: a Methodological Note
Figure 2 for Interpretation and Analysis of the Steady-State Neural Response to Complex Sequential Structures: a Methodological Note
Figure 3 for Interpretation and Analysis of the Steady-State Neural Response to Complex Sequential Structures: a Methodological Note
Figure 4 for Interpretation and Analysis of the Steady-State Neural Response to Complex Sequential Structures: a Methodological Note
Viaarxiv icon

Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial Training

Oct 24, 2022
Jinzi Qi, Hugo Van hamme

Figure 1 for Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial Training
Figure 2 for Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial Training
Figure 3 for Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial Training
Viaarxiv icon

Local-global speaker representation for target speaker extraction

Oct 28, 2022
Shulin He, Wei Rao, Kanghao Zhang, Yukai Ju, Yang Yang, Xueliang Zhang, Yannan Wang, Shidong Shang

Figure 1 for Local-global speaker representation for target speaker extraction
Figure 2 for Local-global speaker representation for target speaker extraction
Figure 3 for Local-global speaker representation for target speaker extraction
Figure 4 for Local-global speaker representation for target speaker extraction
Viaarxiv icon

Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders

Oct 28, 2022
Jason Fong, Yun Wang, Prabhav Agrawal, Vimal Manohar, Jilong Wu, Thilo Köhler, Qing He

Figure 1 for Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
Figure 2 for Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
Figure 3 for Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
Figure 4 for Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
Viaarxiv icon