Alert button

"speech recognition": models, code, and papers
Alert button

Performance Evaluation of Deep Convolutional Maxout Neural Network in Speech Recognition

May 04, 2021
Arash Dehghani, Seyyed Ali Seyyedsalehi

Figure 1 for Performance Evaluation of Deep Convolutional Maxout Neural Network in Speech Recognition
Viaarxiv icon

Streaming Multi-talker Speech Recognition with Joint Speaker Identification

Add code
Bookmark button
Alert button
Apr 05, 2021
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 2 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 3 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Figure 4 for Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Viaarxiv icon

Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition

Jun 15, 2022
Shujie Hu, Xurong Xie, Mengzhe Geng, Mingyu Cui, Jiajun Deng, Tianzi Wang, Xunying Liu, Helen Meng

Figure 1 for Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition
Figure 2 for Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition
Figure 3 for Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition
Viaarxiv icon

A Token-Wise Beam Search Algorithm for RNN-T

Feb 28, 2023
Gil Keren

Figure 1 for A Token-Wise Beam Search Algorithm for RNN-T
Figure 2 for A Token-Wise Beam Search Algorithm for RNN-T
Figure 3 for A Token-Wise Beam Search Algorithm for RNN-T
Figure 4 for A Token-Wise Beam Search Algorithm for RNN-T
Viaarxiv icon

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

Add code
Bookmark button
Alert button
May 05, 2023
Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier

Viaarxiv icon

L2 proficiency assessment using self-supervised speech representations

Nov 16, 2022
Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales

Figure 1 for L2 proficiency assessment using self-supervised speech representations
Figure 2 for L2 proficiency assessment using self-supervised speech representations
Figure 3 for L2 proficiency assessment using self-supervised speech representations
Figure 4 for L2 proficiency assessment using self-supervised speech representations
Viaarxiv icon

TransFusion: Transcribing Speech with Multinomial Diffusion

Add code
Bookmark button
Alert button
Oct 14, 2022
Matthew Baas, Kevin Eloff, Herman Kamper

Figure 1 for TransFusion: Transcribing Speech with Multinomial Diffusion
Figure 2 for TransFusion: Transcribing Speech with Multinomial Diffusion
Figure 3 for TransFusion: Transcribing Speech with Multinomial Diffusion
Figure 4 for TransFusion: Transcribing Speech with Multinomial Diffusion
Viaarxiv icon

Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognit

Mar 23, 2023
Haoyu Tang, Zhaoyi Liu, Chang Zeng, Xinfeng Li

Figure 1 for Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognit
Figure 2 for Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognit
Figure 3 for Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognit
Figure 4 for Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognit
Viaarxiv icon

Neural-FST Class Language Model for End-to-End Speech Recognition

Jan 31, 2022
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 2 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 3 for Neural-FST Class Language Model for End-to-End Speech Recognition
Viaarxiv icon

Continuous Speech Recognition using EEG and Video

Dec 19, 2019
Gautam Krishna, Mason Carnahan, Co Tran, Ahmed H Tewfik

Figure 1 for Continuous Speech Recognition using EEG and Video
Figure 2 for Continuous Speech Recognition using EEG and Video
Figure 3 for Continuous Speech Recognition using EEG and Video
Figure 4 for Continuous Speech Recognition using EEG and Video
Viaarxiv icon