Alert button

"speech": models, code, and papers
Alert button

Cognitive Coding of Speech

Oct 08, 2021
Reza Lotfidereshgi, Philippe Gournay

Figure 1 for Cognitive Coding of Speech
Figure 2 for Cognitive Coding of Speech
Figure 3 for Cognitive Coding of Speech
Viaarxiv icon

Language Agnostic Data-Driven Inverse Text Normalization

Jan 24, 2023
Szu-Jui Chen, Debjyoti Paul, Yutong Pang, Peng Su, Xuedong Zhang

Figure 1 for Language Agnostic Data-Driven Inverse Text Normalization
Figure 2 for Language Agnostic Data-Driven Inverse Text Normalization
Figure 3 for Language Agnostic Data-Driven Inverse Text Normalization
Figure 4 for Language Agnostic Data-Driven Inverse Text Normalization
Viaarxiv icon

Fearless Steps Challenge Phase-1 Evaluation Plan

Add code
Bookmark button
Alert button
Nov 03, 2022
Aditya Joglekar, John H. L. Hansen

Figure 1 for Fearless Steps Challenge Phase-1 Evaluation Plan
Figure 2 for Fearless Steps Challenge Phase-1 Evaluation Plan
Figure 3 for Fearless Steps Challenge Phase-1 Evaluation Plan
Figure 4 for Fearless Steps Challenge Phase-1 Evaluation Plan
Viaarxiv icon

Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language

May 06, 2022
Martin Malmsten, Chris Haffenden, Love Börjeson

Figure 1 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 2 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 3 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 4 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Viaarxiv icon

A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning

Feb 11, 2022
Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain

Figure 1 for A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning
Figure 2 for A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning
Figure 3 for A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning
Figure 4 for A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning
Viaarxiv icon

Bidirectional Representations for Low Resource Spoken Language Understanding

Nov 24, 2022
Quentin Meeus, Marie-Francine Moens, Hugo Van hamme

Figure 1 for Bidirectional Representations for Low Resource Spoken Language Understanding
Figure 2 for Bidirectional Representations for Low Resource Spoken Language Understanding
Figure 3 for Bidirectional Representations for Low Resource Spoken Language Understanding
Figure 4 for Bidirectional Representations for Low Resource Spoken Language Understanding
Viaarxiv icon

Differentiable Duration Modeling for End-to-End Text-to-Speech

Add code
Bookmark button
Alert button
Mar 21, 2022
Bac Nguyen, Fabien Cardinaux, Stefan Uhlich

Figure 1 for Differentiable Duration Modeling for End-to-End Text-to-Speech
Figure 2 for Differentiable Duration Modeling for End-to-End Text-to-Speech
Figure 3 for Differentiable Duration Modeling for End-to-End Text-to-Speech
Figure 4 for Differentiable Duration Modeling for End-to-End Text-to-Speech
Viaarxiv icon

Discourse and conversation impairments in patients with dementia

Dec 03, 2022
Charalambos Themistocleous

Viaarxiv icon

Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees

Jul 16, 2022
Michael Shoemate, Kevin Jett, Ethan Cowan, Sean Colbath, James Honaker, Prasanna Muthukumar

Figure 1 for Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees
Figure 2 for Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees
Figure 3 for Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees
Viaarxiv icon

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training

Jun 27, 2022
Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki

Figure 1 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 2 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 3 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 4 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Viaarxiv icon