Alert button

"speech recognition": models, code, and papers
Alert button

Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows

Feb 15, 2021
Anubhab Ghosh, Antoine Honoré, Dong Liu, Gustav Eje Henter, Saikat Chatterjee

Figure 1 for Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows
Figure 2 for Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows
Figure 3 for Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows
Viaarxiv icon

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR

Aug 27, 2021
Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Oliver Ohneiser, Hartmut Helmke, Saeed Sarfjoo, Iuliia Nigmatulina

Figure 1 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Figure 2 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Figure 3 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Figure 4 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Viaarxiv icon

Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models

Add code
Bookmark button
Alert button
Oct 01, 2020
Thai Binh Nguyen, Quang Minh Nguyen, Thi Thu Hien Nguyen, Quoc Truong Do, Chi Mai Luong

Figure 1 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Figure 2 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Figure 3 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Figure 4 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Viaarxiv icon

StutterNet: Stuttering Detection Using Time Delay Neural Network

May 12, 2021
Shakeel A. Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

Figure 1 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Figure 2 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Figure 3 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Figure 4 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Viaarxiv icon

Semantic sentence similarity: size does not always matter

Add code
Bookmark button
Alert button
Jun 16, 2021
Danny Merkx, Stefan L. Frank, Mirjam Ernestus

Figure 1 for Semantic sentence similarity: size does not always matter
Figure 2 for Semantic sentence similarity: size does not always matter
Figure 3 for Semantic sentence similarity: size does not always matter
Figure 4 for Semantic sentence similarity: size does not always matter
Viaarxiv icon

Integrating Recurrence Dynamics for Speech Emotion Recognition

Nov 09, 2018
Efthymios Tzinis, Georgios Paraskevopoulos, Christos Baziotis, Alexandros Potamianos

Figure 1 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Figure 2 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Figure 3 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Figure 4 for Integrating Recurrence Dynamics for Speech Emotion Recognition
Viaarxiv icon

XY Neural Networks

Mar 31, 2021
Nikita Stroev, Natalia G. Berloff

Figure 1 for XY Neural Networks
Figure 2 for XY Neural Networks
Figure 3 for XY Neural Networks
Figure 4 for XY Neural Networks
Viaarxiv icon

On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition

Add code
Bookmark button
Alert button
Feb 14, 2017
Angel Mario Castro Martinez, Sri Harish Mallidi, Bernd T. Meyer

Figure 1 for On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition
Figure 2 for On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition
Figure 3 for On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition
Figure 4 for On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition
Viaarxiv icon

PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription

Add code
Bookmark button
Alert button
Sep 17, 2021
Chen Zhang, Jiaxing Yu, LuChin Chang, Xu Tan, Jiawei Chen, Tao Qin, Kejun Zhang

Figure 1 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Figure 2 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Figure 3 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Figure 4 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Viaarxiv icon

Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages

Sep 16, 2021
Anoop C S, Prathosh A P, A G Ramakrishnan

Figure 1 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Figure 2 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Figure 3 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Figure 4 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Viaarxiv icon