Alert button

"speech recognition": models, code, and papers
Alert button

Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition

Nov 21, 2017
Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan

Figure 1 for Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Figure 2 for Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Figure 3 for Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Viaarxiv icon

SDS-200: A Swiss German Speech to Standard German Text Corpus

Add code
Bookmark button
Alert button
May 19, 2022
Michel Plüss, Manuela Hürlimann, Marc Cuny, Alla Stöckli, Nikolaos Kapotis, Julia Hartmann, Malgorzata Anna Ulasik, Christian Scheller, Yanick Schraner, Amit Jain, Jan Deriu, Mark Cieliebak, Manfred Vogel

Figure 1 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Figure 2 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Figure 3 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Figure 4 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Viaarxiv icon

Improving RNN-T ASR Performance with Date-Time and Location Awareness

Jun 11, 2021
Swayambhu Nath Ray, Soumyajit Mitra, Raghavendra Bilgi, Sri Garimella

Figure 1 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Figure 2 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Figure 3 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Figure 4 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Viaarxiv icon

Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers

Aug 22, 2021
Juntae Kim, Jeehye Lee, Yoonhan Lee

Figure 1 for Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Figure 2 for Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Figure 3 for Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Figure 4 for Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Viaarxiv icon

End-to-End Attention-based Large Vocabulary Speech Recognition

Add code
Bookmark button
Alert button
Mar 14, 2016
Dzmitry Bahdanau, Jan Chorowski, Dmitriy Serdyuk, Philemon Brakel, Yoshua Bengio

Figure 1 for End-to-End Attention-based Large Vocabulary Speech Recognition
Figure 2 for End-to-End Attention-based Large Vocabulary Speech Recognition
Figure 3 for End-to-End Attention-based Large Vocabulary Speech Recognition
Figure 4 for End-to-End Attention-based Large Vocabulary Speech Recognition
Viaarxiv icon

Speech Recognition Oriented Vowel Classification Using Temporal Radial Basis Functions

Dec 19, 2009
Mustapha Guezouri, Larbi Mesbahi, Abdelkader Benyettou

Figure 1 for Speech Recognition Oriented Vowel Classification Using Temporal Radial Basis Functions
Figure 2 for Speech Recognition Oriented Vowel Classification Using Temporal Radial Basis Functions
Figure 3 for Speech Recognition Oriented Vowel Classification Using Temporal Radial Basis Functions
Figure 4 for Speech Recognition Oriented Vowel Classification Using Temporal Radial Basis Functions
Viaarxiv icon

Distillation-Resistant Watermarking for Model Protection in NLP

Add code
Bookmark button
Alert button
Oct 07, 2022
Xuandong Zhao, Lei Li, Yu-Xiang Wang

Figure 1 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 2 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 3 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 4 for Distillation-Resistant Watermarking for Model Protection in NLP
Viaarxiv icon

Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR

Add code
Bookmark button
Alert button
Jan 26, 2022
Yufei Liu, Rao Ma, Haihua Xu, Yi He, Zejun Ma, Weibin Zhang

Figure 1 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 2 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 3 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 4 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Viaarxiv icon

Large Scale Language Modeling in Automatic Speech Recognition

Oct 31, 2012
Ciprian Chelba, Dan Bikel, Maria Shugrina, Patrick Nguyen, Shankar Kumar

Figure 1 for Large Scale Language Modeling in Automatic Speech Recognition
Figure 2 for Large Scale Language Modeling in Automatic Speech Recognition
Figure 3 for Large Scale Language Modeling in Automatic Speech Recognition
Figure 4 for Large Scale Language Modeling in Automatic Speech Recognition
Viaarxiv icon

Prediction of speech intelligibility with DNN-based performance measures

Mar 17, 2022
Angel Mario Castro Martinez, Constantin Spille, Jana Roßbach, Birger Kollmeier, Bernd T. Meyer

Figure 1 for Prediction of speech intelligibility with DNN-based performance measures
Figure 2 for Prediction of speech intelligibility with DNN-based performance measures
Figure 3 for Prediction of speech intelligibility with DNN-based performance measures
Figure 4 for Prediction of speech intelligibility with DNN-based performance measures
Viaarxiv icon