Alert button

"speech recognition": models, code, and papers
Alert button

Massively Multilingual Shallow Fusion with Large Language Models

Feb 17, 2023
Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman

Figure 1 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 2 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 3 for Massively Multilingual Shallow Fusion with Large Language Models
Figure 4 for Massively Multilingual Shallow Fusion with Large Language Models
Viaarxiv icon

Improving EEG based Continuous Speech Recognition

Nov 30, 2019
Gautam Krishna, Co Tran, Mason Carnahan, Yan Han, Ahmed H Tewfik

Figure 1 for Improving EEG based Continuous Speech Recognition
Figure 2 for Improving EEG based Continuous Speech Recognition
Figure 3 for Improving EEG based Continuous Speech Recognition
Figure 4 for Improving EEG based Continuous Speech Recognition
Viaarxiv icon

Conformer-based End-to-end Speech Recognition With Rotary Position Embedding

Jul 13, 2021
Shengqiang Li, Menglong Xu, Xiao-Lei Zhang

Figure 1 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Figure 2 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Figure 3 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Figure 4 for Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Viaarxiv icon

DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks

Mar 08, 2023
Zohreh Aghababaeyan, Manel Abdellatif, Mahboubeh Dadkhah, Lionel Briand

Figure 1 for DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks
Figure 2 for DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks
Figure 3 for DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks
Figure 4 for DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks
Viaarxiv icon

Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition

Jan 11, 2022
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Naoyuki Kamo, Takafumi Moriya

Figure 1 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 2 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 3 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Viaarxiv icon

Jointly Learning Visual and Auditory Speech Representations from Raw Data

Add code
Bookmark button
Alert button
Dec 12, 2022
Alexandros Haliassos, Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Maja Pantic

Figure 1 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Figure 2 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Figure 3 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Figure 4 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Viaarxiv icon

Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives

Add code
Bookmark button
Alert button
Jan 25, 2023
Tanvi Dinkar, Chloé Clavel, Ioana Vasilescu

Figure 1 for Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives
Figure 2 for Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives
Viaarxiv icon

On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode

Jun 26, 2022
Raviraj Joshi, Subodh Kumar

Figure 1 for On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Figure 2 for On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Figure 3 for On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Viaarxiv icon

Self-Supervised Learning for speech recognition with Intermediate layer supervision

Add code
Bookmark button
Alert button
Dec 16, 2021
Chengyi Wang, Yu Wu, Sanyuan Chen, Shujie Liu, Jinyu Li, Yao Qian, Zhenglu Yang

Figure 1 for Self-Supervised Learning for speech recognition with Intermediate layer supervision
Figure 2 for Self-Supervised Learning for speech recognition with Intermediate layer supervision
Figure 3 for Self-Supervised Learning for speech recognition with Intermediate layer supervision
Figure 4 for Self-Supervised Learning for speech recognition with Intermediate layer supervision
Viaarxiv icon