Alert button

"speech recognition": models, code, and papers
Alert button

An evaluation of word-level confidence estimation for end-to-end automatic speech recognition

Jan 14, 2021
Dan Oneata, Alexandru Caranica, Adriana Stan, Horia Cucu

Figure 1 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 2 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 3 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 4 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Viaarxiv icon

Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks

Mar 09, 2022
Yizhou Lu, Mingkun Huang, Xinghua Qu, Pengfei Wei, Zejun Ma

Figure 1 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Figure 2 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Figure 3 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Figure 4 for Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Viaarxiv icon

Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition

Oct 22, 2020
Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman

Figure 1 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 2 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 3 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Figure 4 for Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
Viaarxiv icon

Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation

Aug 26, 2022
Zoey Liu, Justin Spence, Emily Prud'hommeaux

Figure 1 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Figure 2 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Figure 3 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Figure 4 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Viaarxiv icon

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement

May 20, 2022
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu

Figure 1 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 2 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 3 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 4 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Viaarxiv icon

Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset

Nov 14, 2022
Francesca Gasparini, Alessandra Grossi

Figure 1 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Figure 2 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Figure 3 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Figure 4 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Viaarxiv icon

Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition

Jun 21, 2022
Einari Vaaras, Manu Airaksinen, Okko Räsänen

Figure 1 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Figure 2 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Figure 3 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Figure 4 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Viaarxiv icon

Improving speech recognition by revising gated recurrent units

Add code
Bookmark button
Alert button
Sep 29, 2017
Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Figure 1 for Improving speech recognition by revising gated recurrent units
Figure 2 for Improving speech recognition by revising gated recurrent units
Figure 3 for Improving speech recognition by revising gated recurrent units
Figure 4 for Improving speech recognition by revising gated recurrent units
Viaarxiv icon

Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data

Oct 22, 2020
Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao

Figure 1 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 2 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 3 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 4 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Viaarxiv icon

Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks

Add code
Bookmark button
Alert button
Jan 10, 2017
Ying Zhang, Mohammad Pezeshki, Philemon Brakel, Saizheng Zhang, Cesar Laurent Yoshua Bengio, Aaron Courville

Figure 1 for Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Figure 2 for Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Figure 3 for Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Figure 4 for Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Viaarxiv icon