Alert button

"speech recognition": models, code, and papers
Alert button

Trustera: A Live Conversation Redaction System

Mar 16, 2023
Evandro Gouvêa, Ali Dadgar, Shahab Jalalvand, Rathi Chengalvarayan, Badrinath Jayakumar, Ryan Price, Nicholas Ruiz, Jennifer McGovern, Srinivas Bangalore, Ben Stern

Figure 1 for Trustera: A Live Conversation Redaction System
Figure 2 for Trustera: A Live Conversation Redaction System
Figure 3 for Trustera: A Live Conversation Redaction System
Viaarxiv icon

Continuous Silent Speech Recognition using EEG

Feb 13, 2020
Gautam Krishna, Co Tran, Mason Carnahan, Ahmed Tewfik

Figure 1 for Continuous Silent Speech Recognition using EEG
Figure 2 for Continuous Silent Speech Recognition using EEG
Figure 3 for Continuous Silent Speech Recognition using EEG
Figure 4 for Continuous Silent Speech Recognition using EEG
Viaarxiv icon

Enhancing Cross-lingual Transfer via Phonemic Transcription Integration

Add code
Bookmark button
Alert button
Jul 10, 2023
Hoang H. Nguyen, Chenwei Zhang, Tao Zhang, Eugene Rohrbaugh, Philip S. Yu

Figure 1 for Enhancing Cross-lingual Transfer via Phonemic Transcription Integration
Figure 2 for Enhancing Cross-lingual Transfer via Phonemic Transcription Integration
Figure 3 for Enhancing Cross-lingual Transfer via Phonemic Transcription Integration
Figure 4 for Enhancing Cross-lingual Transfer via Phonemic Transcription Integration
Viaarxiv icon

DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition

Jul 06, 2022
Jiamin Xie, John H. L. Hansen

Figure 1 for DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
Figure 2 for DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
Figure 3 for DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
Figure 4 for DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
Viaarxiv icon

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

Mar 01, 2022
Yufeng Yang, Peidong Wang, DeLiang Wang

Figure 1 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 2 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 3 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 4 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Viaarxiv icon

Language Dependencies in Adversarial Attacks on Speech Recognition Systems

Add code
Bookmark button
Alert button
Feb 02, 2022
Karla Markert, Donika Mirdita, Konstantin Böttinger

Figure 1 for Language Dependencies in Adversarial Attacks on Speech Recognition Systems
Figure 2 for Language Dependencies in Adversarial Attacks on Speech Recognition Systems
Figure 3 for Language Dependencies in Adversarial Attacks on Speech Recognition Systems
Figure 4 for Language Dependencies in Adversarial Attacks on Speech Recognition Systems
Viaarxiv icon

Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification

Aug 04, 2021
Sangeeta Ghangam, Daniel Whitenack, Joshua Nemecek

Figure 1 for Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification
Figure 2 for Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification
Figure 3 for Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification
Figure 4 for Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification
Viaarxiv icon

Visualizing Automatic Speech Recognition -- Means for a Better Understanding?

Add code
Bookmark button
Alert button
Feb 01, 2022
Karla Markert, Romain Parracone, Mykhailo Kulakov, Philip Sperl, Ching-Yu Kao, Konstantin Böttinger

Figure 1 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?
Figure 2 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?
Figure 3 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?
Figure 4 for Visualizing Automatic Speech Recognition -- Means for a Better Understanding?
Viaarxiv icon

Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy

Mar 14, 2023
Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao

Figure 1 for Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy
Figure 2 for Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy
Figure 3 for Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy
Figure 4 for Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy
Viaarxiv icon

Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition

Jan 24, 2022
Xurong Xie, Xiang Sui, Xunying Liu, Lan Wang

Viaarxiv icon