Alert button

"speech recognition": models, code, and papers
Alert button

Character-Aware Attention-Based End-to-End Speech Recognition

Jan 06, 2020
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong

Figure 1 for Character-Aware Attention-Based End-to-End Speech Recognition
Figure 2 for Character-Aware Attention-Based End-to-End Speech Recognition
Figure 3 for Character-Aware Attention-Based End-to-End Speech Recognition
Figure 4 for Character-Aware Attention-Based End-to-End Speech Recognition
Viaarxiv icon

Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 20, 2022
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan

Figure 1 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Figure 2 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Figure 3 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Figure 4 for Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Viaarxiv icon

MM-ALT: A Multimodal Automatic Lyric Transcription System

Add code
Bookmark button
Alert button
Jul 13, 2022
Xiangming Gu, Longshen Ou, Danielle Ong, Ye Wang

Figure 1 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Figure 2 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Figure 3 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Figure 4 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Viaarxiv icon

Multilingual Alzheimer's Dementia Recognition through Spontaneous Speech: a Signal Processing Grand Challenge

Jan 13, 2023
Saturnino Luz, Fasih Haider, Davida Fromm, Ioulietta Lazarou, Ioannis Kompatsiaris, Brian MacWhinney

Viaarxiv icon

Open Source Automatic Speech Recognition for German

Add code
Bookmark button
Alert button
Jul 26, 2018
Benjamin Milde, Arne Köhn

Figure 1 for Open Source Automatic Speech Recognition for German
Figure 2 for Open Source Automatic Speech Recognition for German
Figure 3 for Open Source Automatic Speech Recognition for German
Figure 4 for Open Source Automatic Speech Recognition for German
Viaarxiv icon

Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-Sequence Model

Dec 02, 2017
Pin-Jung Chen, I-Hung Hsu, Yi-Yao Huang, Hung-Yi Lee

Figure 1 for Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-Sequence Model
Figure 2 for Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-Sequence Model
Figure 3 for Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-Sequence Model
Figure 4 for Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-Sequence Model
Viaarxiv icon

A Deep Dive into Deep Cluster

Jul 24, 2022
Ahmad Mustapha, Wael Khreich, Wasim Masr

Figure 1 for A Deep Dive into Deep Cluster
Figure 2 for A Deep Dive into Deep Cluster
Figure 3 for A Deep Dive into Deep Cluster
Figure 4 for A Deep Dive into Deep Cluster
Viaarxiv icon

Improving the Training Recipe for a Robust Conformer-based Hybrid Model

Add code
Bookmark button
Alert button
Jun 26, 2022
Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Ralf Schlüter, Hermann Ney

Figure 1 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Figure 2 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Figure 3 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Figure 4 for Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Viaarxiv icon

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition

Jul 10, 2019
Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny

Figure 1 for Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Figure 2 for Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Figure 3 for Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition
Viaarxiv icon

Meta Auxiliary Learning for Low-resource Spoken Language Understanding

Jun 26, 2022
Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

Figure 1 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Figure 2 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Figure 3 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Figure 4 for Meta Auxiliary Learning for Low-resource Spoken Language Understanding
Viaarxiv icon