Alert button

"speech recognition": models, code, and papers
Alert button

Algorithms for Speech Recognition and Language Processing

Sep 17, 1996
Mehryar Mohri, Michael Riley, Richard Sproat

Viaarxiv icon

Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization

Feb 13, 2019
Jorge, Davila-Chacon, Jindong, Liu, Stefan, Wermter

Figure 1 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Figure 2 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Figure 3 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Figure 4 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Viaarxiv icon

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

Add code
Bookmark button
Alert button
Feb 07, 2022
Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli

Figure 1 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 2 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 3 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 4 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Viaarxiv icon

The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems

Jul 13, 2020
Hadi Abdullah, Kevin Warren, Vincent Bindschaedler, Nicolas Papernot, Patrick Traynor

Figure 1 for The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 2 for The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 3 for The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 4 for The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Viaarxiv icon

Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jun 22, 2021
Weidong Chen, Xiaofeng Xing, Xiangmin Xu, Jichen Yang, Jianxin Pang

Figure 1 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Figure 2 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Figure 3 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Figure 4 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Viaarxiv icon

Quantitative phase and absorption contrast imaging

Mar 23, 2022
Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

Figure 1 for Quantitative phase and absorption contrast imaging
Figure 2 for Quantitative phase and absorption contrast imaging
Figure 3 for Quantitative phase and absorption contrast imaging
Figure 4 for Quantitative phase and absorption contrast imaging
Viaarxiv icon

Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages

Add code
Bookmark button
Alert button
Dec 15, 2018
Siddique Latif, Adnan Qayyum, Muhammad Usman, Junaid Qadir

Figure 1 for Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
Figure 2 for Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
Figure 3 for Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
Figure 4 for Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
Viaarxiv icon

Integrating HMM-Based Speech Recognition With Direct Manipulation In A Multimodal Korean Natural Language Interface

Nov 18, 1996
Geunbae Lee, Jong-Hyeok Lee, Sangeok Kim

Figure 1 for Integrating HMM-Based Speech Recognition With Direct Manipulation In A Multimodal Korean Natural Language Interface
Viaarxiv icon

The IBM 2015 English Conversational Telephone Speech Recognition System

May 21, 2015
George Saon, Hong-Kwang J. Kuo, Steven Rennie, Michael Picheny

Figure 1 for The IBM 2015 English Conversational Telephone Speech Recognition System
Figure 2 for The IBM 2015 English Conversational Telephone Speech Recognition System
Figure 3 for The IBM 2015 English Conversational Telephone Speech Recognition System
Figure 4 for The IBM 2015 English Conversational Telephone Speech Recognition System
Viaarxiv icon

Light Gated Recurrent Units for Speech Recognition

Add code
Bookmark button
Alert button
Mar 26, 2018
Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Figure 1 for Light Gated Recurrent Units for Speech Recognition
Figure 2 for Light Gated Recurrent Units for Speech Recognition
Figure 3 for Light Gated Recurrent Units for Speech Recognition
Figure 4 for Light Gated Recurrent Units for Speech Recognition
Viaarxiv icon