Alert button

"speech recognition": models, code, and papers
Alert button

A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition

Jan 27, 2019
Elyas Rashno, Ahmad Akbari, Babak Nasersharif

Figure 1 for A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition
Figure 2 for A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition
Figure 3 for A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition
Figure 4 for A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition
Viaarxiv icon

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Mar 31, 2019
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara

Figure 1 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 2 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 3 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 4 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Viaarxiv icon

Hybridized Feature Extraction and Acoustic Modelling Approach for Dysarthric Speech Recognition

Jun 06, 2015
Megha Rughani, D. Shivakrishna

Figure 1 for Hybridized Feature Extraction and Acoustic Modelling Approach for Dysarthric Speech Recognition
Figure 2 for Hybridized Feature Extraction and Acoustic Modelling Approach for Dysarthric Speech Recognition
Figure 3 for Hybridized Feature Extraction and Acoustic Modelling Approach for Dysarthric Speech Recognition
Figure 4 for Hybridized Feature Extraction and Acoustic Modelling Approach for Dysarthric Speech Recognition
Viaarxiv icon

Neural Dependency Coding inspired Multimodal Fusion

Oct 04, 2021
Shiv Shankar

Figure 1 for Neural Dependency Coding inspired Multimodal Fusion
Figure 2 for Neural Dependency Coding inspired Multimodal Fusion
Viaarxiv icon

Learning a Neural Diff for Speech Models

Aug 17, 2021
Jonathan Macoskey, Grant P. Strimel, Ariya Rastrow

Figure 1 for Learning a Neural Diff for Speech Models
Figure 2 for Learning a Neural Diff for Speech Models
Viaarxiv icon

Back from the future: bidirectional CTC decoding using future information in speech recognition

Oct 07, 2021
Namkyu Jung, Geonmin Kim, Han-Gyu Kim

Figure 1 for Back from the future: bidirectional CTC decoding using future information in speech recognition
Figure 2 for Back from the future: bidirectional CTC decoding using future information in speech recognition
Figure 3 for Back from the future: bidirectional CTC decoding using future information in speech recognition
Figure 4 for Back from the future: bidirectional CTC decoding using future information in speech recognition
Viaarxiv icon

Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition

Oct 29, 2021
Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover

Figure 1 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Figure 2 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Figure 3 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Figure 4 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Viaarxiv icon

Coarse-To-Fine And Cross-Lingual ASR Transfer

Sep 02, 2021
Peter Polák, Ondřej Bojar

Figure 1 for Coarse-To-Fine And Cross-Lingual ASR Transfer
Figure 2 for Coarse-To-Fine And Cross-Lingual ASR Transfer
Figure 3 for Coarse-To-Fine And Cross-Lingual ASR Transfer
Figure 4 for Coarse-To-Fine And Cross-Lingual ASR Transfer
Viaarxiv icon

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data

Jan 19, 2021
Chengyi Wang, Yu Wu, Yao Qian, Kenichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang

Figure 1 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Figure 2 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Figure 3 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Figure 4 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Viaarxiv icon

Towards Identity Preserving Normal to Dysarthric Voice Conversion

Oct 15, 2021
Wen-Chin Huang, Bence Mark Halpern, Lester Phillip Violeta, Odette Scharenborg, Tomoki Toda

Figure 1 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Figure 2 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Figure 3 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Figure 4 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Viaarxiv icon