Alert button

"speech recognition": models, code, and papers
Alert button

Improved Regularization Techniques for End-to-End Speech Recognition

Dec 19, 2017
Yingbo Zhou, Caiming Xiong, Richard Socher

Figure 1 for Improved Regularization Techniques for End-to-End Speech Recognition
Figure 2 for Improved Regularization Techniques for End-to-End Speech Recognition
Figure 3 for Improved Regularization Techniques for End-to-End Speech Recognition
Figure 4 for Improved Regularization Techniques for End-to-End Speech Recognition
Viaarxiv icon

Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks

Dec 12, 2021
Chia-Yu Li, Ngoc Thang Vu

Figure 1 for Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Figure 2 for Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Figure 3 for Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Figure 4 for Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Viaarxiv icon

Sequence-based Multi-lingual Low Resource Speech Recognition

Mar 06, 2018
Siddharth Dalmia, Ramon Sanabria, Florian Metze, Alan W. Black

Figure 1 for Sequence-based Multi-lingual Low Resource Speech Recognition
Viaarxiv icon

Invariant Representations for Noisy Speech Recognition

Nov 27, 2016
Dmitriy Serdyuk, Kartik Audhkhasi, Philémon Brakel, Bhuvana Ramabhadran, Samuel Thomas, Yoshua Bengio

Figure 1 for Invariant Representations for Noisy Speech Recognition
Figure 2 for Invariant Representations for Noisy Speech Recognition
Viaarxiv icon

Speech Emotion Recognition using Semantic Information

Mar 04, 2021
Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller

Figure 1 for Speech Emotion Recognition using Semantic Information
Figure 2 for Speech Emotion Recognition using Semantic Information
Figure 3 for Speech Emotion Recognition using Semantic Information
Figure 4 for Speech Emotion Recognition using Semantic Information
Viaarxiv icon

A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition

Dec 01, 2022
Biao Ma, Chengben Xu, Ye Zhang

Figure 1 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 2 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 3 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 4 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Viaarxiv icon

r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation

Feb 21, 2022
Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, Jing Xiao

Figure 1 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Figure 2 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Figure 3 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Figure 4 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Viaarxiv icon

To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition

Dec 13, 2018
Yossi Adi, Neil Zeghidour, Ronan Collobert, Nicolas Usunier, Vitaliy Liptchinsky, Gabriel Synnaeve

Figure 1 for To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Figure 2 for To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Figure 3 for To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Viaarxiv icon

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

Feb 28, 2020
Erik McDermott, Hasim Sak, Ehsan Variani

Figure 1 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 2 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 3 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Figure 4 for A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Viaarxiv icon

Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units

Sep 06, 2018
Zhangyu Xiao, Zhijian Ou, Wei Chu, Hui Lin

Figure 1 for Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units
Viaarxiv icon