Alert button

"speech": models, code, and papers
Alert button

Spell my name: keyword boosted speech recognition

Oct 06, 2021
Namkyu Jung, Geonmin Kim, Joon Son Chung

Figure 1 for Spell my name: keyword boosted speech recognition
Figure 2 for Spell my name: keyword boosted speech recognition
Figure 3 for Spell my name: keyword boosted speech recognition
Figure 4 for Spell my name: keyword boosted speech recognition
Viaarxiv icon

Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer

Sep 10, 2021
Krishna D N

Figure 1 for Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer
Figure 2 for Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer
Figure 3 for Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer
Figure 4 for Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer
Viaarxiv icon

Impact and dynamics of hate and counter speech online

Add code
Bookmark button
Alert button
Sep 18, 2020
Joshua Garland, Keyan Ghazi-Zahedi, Jean-Gabriel Young, Laurent Hébert-Dufresne, Mirta Galesic

Figure 1 for Impact and dynamics of hate and counter speech online
Figure 2 for Impact and dynamics of hate and counter speech online
Figure 3 for Impact and dynamics of hate and counter speech online
Figure 4 for Impact and dynamics of hate and counter speech online
Viaarxiv icon

The Use of Voice Source Features for Sung Speech Recognition

Add code
Bookmark button
Alert button
Feb 20, 2021
Gerardo Roa Dabike, Jon Barker

Figure 1 for The Use of Voice Source Features for Sung Speech Recognition
Figure 2 for The Use of Voice Source Features for Sung Speech Recognition
Figure 3 for The Use of Voice Source Features for Sung Speech Recognition
Figure 4 for The Use of Voice Source Features for Sung Speech Recognition
Viaarxiv icon

Multi-modal fusion with gating using audio, lexical and disfluency features for Alzheimer's Dementia recognition from spontaneous speech

Add code
Bookmark button
Alert button
Jun 17, 2021
Morteza Rohanian, Julian Hough, Matthew Purver

Figure 1 for Multi-modal fusion with gating using audio, lexical and disfluency features for Alzheimer's Dementia recognition from spontaneous speech
Figure 2 for Multi-modal fusion with gating using audio, lexical and disfluency features for Alzheimer's Dementia recognition from spontaneous speech
Figure 3 for Multi-modal fusion with gating using audio, lexical and disfluency features for Alzheimer's Dementia recognition from spontaneous speech
Figure 4 for Multi-modal fusion with gating using audio, lexical and disfluency features for Alzheimer's Dementia recognition from spontaneous speech
Viaarxiv icon

Learning robust speech representation with an articulatory-regularized variational autoencoder

Apr 07, 2021
Marc-Antoine Georges, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber

Figure 1 for Learning robust speech representation with an articulatory-regularized variational autoencoder
Figure 2 for Learning robust speech representation with an articulatory-regularized variational autoencoder
Figure 3 for Learning robust speech representation with an articulatory-regularized variational autoencoder
Figure 4 for Learning robust speech representation with an articulatory-regularized variational autoencoder
Viaarxiv icon

Practical Speech Re-use Prevention in Voice-driven Services

Jan 12, 2021
Yangyong Zhang, Maliheh Shirvanian, Sunpreet S. Arora, Jianwei Huang, Guofei Gu

Figure 1 for Practical Speech Re-use Prevention in Voice-driven Services
Figure 2 for Practical Speech Re-use Prevention in Voice-driven Services
Figure 3 for Practical Speech Re-use Prevention in Voice-driven Services
Figure 4 for Practical Speech Re-use Prevention in Voice-driven Services
Viaarxiv icon

Distribution Aware Metrics for Conditional Natural Language Generation

Sep 29, 2022
David M Chan, Yiming Ni, David A Ross, Sudheendra Vijayanarasimhan, Austin Myers, John Canny

Figure 1 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 2 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 3 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 4 for Distribution Aware Metrics for Conditional Natural Language Generation
Viaarxiv icon

Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

Oct 07, 2021
Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland

Figure 1 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 2 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 3 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Figure 4 for Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition
Viaarxiv icon