Alert button

"speech recognition": models, code, and papers
Alert button

Automatic Speech Recognition Using Template Model for Man-Machine Interface

May 09, 2013
Neema Mishra, Urmila Shrawankar, V M Thakare

Figure 1 for Automatic Speech Recognition Using Template Model for Man-Machine Interface
Figure 2 for Automatic Speech Recognition Using Template Model for Man-Machine Interface
Figure 3 for Automatic Speech Recognition Using Template Model for Man-Machine Interface
Figure 4 for Automatic Speech Recognition Using Template Model for Man-Machine Interface
Viaarxiv icon

Understanding Audio Features via Trainable Basis Functions

Add code
Bookmark button
Alert button
Apr 25, 2022
Kwan Yee Heung, Kin Wai Cheuk, Dorien Herremans

Figure 1 for Understanding Audio Features via Trainable Basis Functions
Figure 2 for Understanding Audio Features via Trainable Basis Functions
Figure 3 for Understanding Audio Features via Trainable Basis Functions
Figure 4 for Understanding Audio Features via Trainable Basis Functions
Viaarxiv icon

Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition

Oct 01, 2019
Shahram Ghorbani, Soheil Khorram, John H. L. Hansen

Figure 1 for Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Figure 2 for Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Figure 3 for Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Viaarxiv icon

Automated speech tools for helping communities process restricted-access corpora for language revival efforts

Add code
Bookmark button
Alert button
Apr 24, 2022
Nay San, Martijn Bartelds, Tolúlopé Ògúnrèmí, Alison Mount, Ruben Thompson, Michael Higgins, Roy Barker, Jane Simpson, Dan Jurafsky

Figure 1 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Figure 2 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Figure 3 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Figure 4 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Viaarxiv icon

Cloud-Based Face and Speech Recognition for Access Control Applications

Apr 23, 2020
Nathalie Tkauc, Thao Tran, Kevin Hernandez-Diaz, Fernando Alonso-Fernandez

Figure 1 for Cloud-Based Face and Speech Recognition for Access Control Applications
Figure 2 for Cloud-Based Face and Speech Recognition for Access Control Applications
Figure 3 for Cloud-Based Face and Speech Recognition for Access Control Applications
Figure 4 for Cloud-Based Face and Speech Recognition for Access Control Applications
Viaarxiv icon

Research on several key technologies in practical speech emotion recognition

Sep 27, 2017
Chengwei Huang

Viaarxiv icon

Attention-Based Models for Speech Recognition

Add code
Bookmark button
Alert button
Jun 24, 2015
Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, Yoshua Bengio

Figure 1 for Attention-Based Models for Speech Recognition
Figure 2 for Attention-Based Models for Speech Recognition
Figure 3 for Attention-Based Models for Speech Recognition
Figure 4 for Attention-Based Models for Speech Recognition
Viaarxiv icon

End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model

Add code
Bookmark button
Alert button
Mar 12, 2019
Yangyang Shi, Mei-Yuh Hwang, Xin Lei

Figure 1 for End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Figure 2 for End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Figure 3 for End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Viaarxiv icon

Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition

Aug 17, 2016
Jeehye Lee, Myungin Lee, Joon-Hyuk Chang

Figure 1 for Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition
Figure 2 for Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition
Figure 3 for Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition
Figure 4 for Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition
Viaarxiv icon

Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

Add code
Bookmark button
Alert button
Jun 09, 2022
Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazım Kemal Ekenel

Figure 1 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 2 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 3 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 4 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Viaarxiv icon