Alert button

"speech recognition": models, code, and papers
Alert button

Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling

Nov 10, 2018
Hainan Xu, Shuoyang Ding, Shinji Watanabe

Figure 1 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Figure 2 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Figure 3 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Figure 4 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Viaarxiv icon

Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection

Add code
Bookmark button
Alert button
Nov 10, 2017
Taku Kato, Takahiro Shinozaki

Figure 1 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Figure 2 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Figure 3 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Figure 4 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Viaarxiv icon

STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks

Oct 11, 2022
Chengting Yu, Zheming Gu, Da Li, Gaoang Wang, Aili Wang, Erping Li

Figure 1 for STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks
Figure 2 for STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks
Figure 3 for STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks
Figure 4 for STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks
Viaarxiv icon

A Novel Exploitative and Explorative GWO-SVM Algorithm for Smart Emotion Recognition

Jan 05, 2023
Xucun Yan, Zihuai Lin, Zhiyun Lin, Branka Vucetic

Figure 1 for A Novel Exploitative and Explorative GWO-SVM Algorithm for Smart Emotion Recognition
Figure 2 for A Novel Exploitative and Explorative GWO-SVM Algorithm for Smart Emotion Recognition
Figure 3 for A Novel Exploitative and Explorative GWO-SVM Algorithm for Smart Emotion Recognition
Figure 4 for A Novel Exploitative and Explorative GWO-SVM Algorithm for Smart Emotion Recognition
Viaarxiv icon

Speech Emotion Recognition Based on Multi-feature and Multi-lingual Fusion

Jan 16, 2020
Chunyi Wang

Figure 1 for Speech Emotion Recognition Based on Multi-feature and Multi-lingual Fusion
Figure 2 for Speech Emotion Recognition Based on Multi-feature and Multi-lingual Fusion
Figure 3 for Speech Emotion Recognition Based on Multi-feature and Multi-lingual Fusion
Figure 4 for Speech Emotion Recognition Based on Multi-feature and Multi-lingual Fusion
Viaarxiv icon

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Jul 06, 2022
Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan

Figure 1 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 2 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Figure 3 for Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Viaarxiv icon

Performance Monitoring for End-to-End Speech Recognition

Apr 09, 2019
Ruizhi Li, Gregory Sell, Hynek Hermansky

Figure 1 for Performance Monitoring for End-to-End Speech Recognition
Figure 2 for Performance Monitoring for End-to-End Speech Recognition
Figure 3 for Performance Monitoring for End-to-End Speech Recognition
Figure 4 for Performance Monitoring for End-to-End Speech Recognition
Viaarxiv icon

A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition

Jun 13, 2018
Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass

Figure 1 for A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
Figure 2 for A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
Figure 3 for A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
Figure 4 for A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
Viaarxiv icon

Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners

Apr 08, 2022
Zehai Tu, Ning Ma, Jon Barker

Figure 1 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Figure 2 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Figure 3 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Figure 4 for Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
Viaarxiv icon

Character-Level Incremental Speech Recognition with Recurrent Neural Networks

Add code
Bookmark button
Alert button
Jan 28, 2016
Kyuyeon Hwang, Wonyong Sung

Figure 1 for Character-Level Incremental Speech Recognition with Recurrent Neural Networks
Figure 2 for Character-Level Incremental Speech Recognition with Recurrent Neural Networks
Figure 3 for Character-Level Incremental Speech Recognition with Recurrent Neural Networks
Figure 4 for Character-Level Incremental Speech Recognition with Recurrent Neural Networks
Viaarxiv icon