Alert button

"speech recognition": models, code, and papers
Alert button

StutterNet: Stuttering Detection Using Time Delay Neural Network

May 12, 2021
Shakeel A. Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

Figure 1 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Figure 2 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Figure 3 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Figure 4 for StutterNet: Stuttering Detection Using Time Delay Neural Network
Viaarxiv icon

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR

Aug 27, 2021
Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Oliver Ohneiser, Hartmut Helmke, Saeed Sarfjoo, Iuliia Nigmatulina

Figure 1 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Figure 2 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Figure 3 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Figure 4 for Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Viaarxiv icon

XY Neural Networks

Mar 31, 2021
Nikita Stroev, Natalia G. Berloff

Figure 1 for XY Neural Networks
Figure 2 for XY Neural Networks
Figure 3 for XY Neural Networks
Figure 4 for XY Neural Networks
Viaarxiv icon

Semantic sentence similarity: size does not always matter

Add code
Bookmark button
Alert button
Jun 16, 2021
Danny Merkx, Stefan L. Frank, Mirjam Ernestus

Figure 1 for Semantic sentence similarity: size does not always matter
Figure 2 for Semantic sentence similarity: size does not always matter
Figure 3 for Semantic sentence similarity: size does not always matter
Figure 4 for Semantic sentence similarity: size does not always matter
Viaarxiv icon

PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription

Add code
Bookmark button
Alert button
Sep 17, 2021
Chen Zhang, Jiaxing Yu, LuChin Chang, Xu Tan, Jiawei Chen, Tao Qin, Kejun Zhang

Figure 1 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Figure 2 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Figure 3 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Figure 4 for PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Viaarxiv icon

Muddling Label Regularization: Deep Learning for Tabular Datasets

Add code
Bookmark button
Alert button
Jun 29, 2021
Karim Lounici, Katia Meziani, Benjamin Riu

Figure 1 for Muddling Label Regularization: Deep Learning for Tabular Datasets
Figure 2 for Muddling Label Regularization: Deep Learning for Tabular Datasets
Figure 3 for Muddling Label Regularization: Deep Learning for Tabular Datasets
Figure 4 for Muddling Label Regularization: Deep Learning for Tabular Datasets
Viaarxiv icon

Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages

Sep 16, 2021
Anoop C S, Prathosh A P, A G Ramakrishnan

Figure 1 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Figure 2 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Figure 3 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Figure 4 for Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Viaarxiv icon

Kurdish (Sorani) Speech to Text: Presenting an Experimental Dataset

Add code
Bookmark button
Alert button
Dec 02, 2019
Akam Qader, Hossein Hassani

Figure 1 for Kurdish (Sorani) Speech to Text: Presenting an Experimental Dataset
Viaarxiv icon

The USTC-NEL Speech Translation system at IWSLT 2018

Add code
Bookmark button
Alert button
Dec 06, 2018
Dan Liu, Junhua Liu, Wu Guo, Shifu Xiong, Zhiqiang Ma, Rui Song, Chongliang Wu, Quan Liu

Figure 1 for The USTC-NEL Speech Translation system at IWSLT 2018
Figure 2 for The USTC-NEL Speech Translation system at IWSLT 2018
Figure 3 for The USTC-NEL Speech Translation system at IWSLT 2018
Figure 4 for The USTC-NEL Speech Translation system at IWSLT 2018
Viaarxiv icon

On Knowledge Distillation for Direct Speech Translation

Add code
Bookmark button
Alert button
Dec 09, 2020
Marco Gaido, Mattia A. Di Gangi, Matteo Negri, Marco Turchi

Figure 1 for On Knowledge Distillation for Direct Speech Translation
Figure 2 for On Knowledge Distillation for Direct Speech Translation
Figure 3 for On Knowledge Distillation for Direct Speech Translation
Figure 4 for On Knowledge Distillation for Direct Speech Translation
Viaarxiv icon