Alert button

"speech recognition": models, code, and papers
Alert button

Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models

Oct 05, 2016
Mahdi Khademian, Mohammad Mehdi Homayounpour

Figure 1 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Figure 2 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Figure 3 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Figure 4 for Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
Viaarxiv icon

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Jun 02, 2022
Sehoon Kim, Amir Gholami, Albert Shaw, Nicholas Lee, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Kurt Keutzer

Figure 1 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Figure 2 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Figure 3 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Figure 4 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Viaarxiv icon

cif-based collaborative decoding for end-to-end contextual speech recognition

Add code
Bookmark button
Alert button
Dec 17, 2020
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

Figure 1 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 2 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 3 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 4 for cif-based collaborative decoding for end-to-end contextual speech recognition
Viaarxiv icon

Speech Recognition by Machine, A Review

Jan 13, 2010
M. A. Anusuya, S. K. Katti

Figure 1 for Speech Recognition by Machine, A Review
Figure 2 for Speech Recognition by Machine, A Review
Figure 3 for Speech Recognition by Machine, A Review
Figure 4 for Speech Recognition by Machine, A Review
Viaarxiv icon

Multiple Confidence Gates For Joint Training Of SE And ASR

Apr 01, 2022
Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Figure 1 for Multiple Confidence Gates For Joint Training Of SE And ASR
Figure 2 for Multiple Confidence Gates For Joint Training Of SE And ASR
Figure 3 for Multiple Confidence Gates For Joint Training Of SE And ASR
Figure 4 for Multiple Confidence Gates For Joint Training Of SE And ASR
Viaarxiv icon

Learning ASR pathways: A sparse multilingual ASR model

Sep 13, 2022
Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli

Figure 1 for Learning ASR pathways: A sparse multilingual ASR model
Figure 2 for Learning ASR pathways: A sparse multilingual ASR model
Figure 3 for Learning ASR pathways: A sparse multilingual ASR model
Figure 4 for Learning ASR pathways: A sparse multilingual ASR model
Viaarxiv icon

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System

May 09, 2021
Guillermo Cámbara, Alex Peiró-Lilja, Mireia Farrús, Jordi Luque

Figure 1 for English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Figure 2 for English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Viaarxiv icon

Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition

Add code
Bookmark button
Alert button
Jan 24, 2021
Cheng Yi, Shiyu Zhou, Bo Xu

Figure 1 for Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
Figure 2 for Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
Figure 3 for Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
Figure 4 for Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
Viaarxiv icon

Transformer-based Acoustic Modeling for Hybrid Speech Recognition

Oct 22, 2019
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer

Figure 1 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Figure 2 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Figure 3 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Figure 4 for Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Viaarxiv icon

End-to-End Visual Speech Recognition for Small-Scale Datasets

Apr 02, 2019
Stavros Petridis, Yujiang Wang, Pingchuan Ma, Zuwei Li, Maja Pantic

Figure 1 for End-to-End Visual Speech Recognition for Small-Scale Datasets
Figure 2 for End-to-End Visual Speech Recognition for Small-Scale Datasets
Figure 3 for End-to-End Visual Speech Recognition for Small-Scale Datasets
Figure 4 for End-to-End Visual Speech Recognition for Small-Scale Datasets
Viaarxiv icon