Alert button

"speech recognition": models, code, and papers
Alert button

Do End-to-End Speech Recognition Models Care About Context?

Feb 17, 2021
Lasse Borgholt, Jakob Drachmann Havtorn, Željko Agić, Anders Søgaard, Lars Maaløe, Christian Igel

Figure 1 for Do End-to-End Speech Recognition Models Care About Context?
Figure 2 for Do End-to-End Speech Recognition Models Care About Context?
Figure 3 for Do End-to-End Speech Recognition Models Care About Context?
Figure 4 for Do End-to-End Speech Recognition Models Care About Context?
Viaarxiv icon

End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English

Oct 26, 2022
Abhinav Goyal, Anupam Singh, Nikesh Garera

Figure 1 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Figure 2 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Figure 3 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Figure 4 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Viaarxiv icon

Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching

Add code
Bookmark button
Alert button
Apr 16, 2021
Wenxin Hou, Jindong Wang, Xu Tan, Tao Qin, Takahiro Shinozaki

Figure 1 for Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching
Figure 2 for Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching
Figure 3 for Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching
Figure 4 for Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching
Viaarxiv icon

Context-aware Fine-tuning of Self-supervised Speech Models

Add code
Bookmark button
Alert button
Dec 16, 2022
Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe

Figure 1 for Context-aware Fine-tuning of Self-supervised Speech Models
Figure 2 for Context-aware Fine-tuning of Self-supervised Speech Models
Figure 3 for Context-aware Fine-tuning of Self-supervised Speech Models
Figure 4 for Context-aware Fine-tuning of Self-supervised Speech Models
Viaarxiv icon

End-to-End Multi-Channel Transformer for Speech Recognition

Feb 08, 2021
Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Brian King, Siegfried Kunzmann

Figure 1 for End-to-End Multi-Channel Transformer for Speech Recognition
Figure 2 for End-to-End Multi-Channel Transformer for Speech Recognition
Figure 3 for End-to-End Multi-Channel Transformer for Speech Recognition
Figure 4 for End-to-End Multi-Channel Transformer for Speech Recognition
Viaarxiv icon

Does Joint Training Really Help Cascaded Speech Translation?

Add code
Bookmark button
Alert button
Oct 24, 2022
Viet Anh Khoa Tran, David Thulke, Yingbo Gao, Christian Herold, Hermann Ney

Figure 1 for Does Joint Training Really Help Cascaded Speech Translation?
Figure 2 for Does Joint Training Really Help Cascaded Speech Translation?
Viaarxiv icon

On using 2D sequence-to-sequence models for speech recognition

Add code
Bookmark button
Alert button
Nov 20, 2019
Parnia Bahar, Albert Zeyer, Ralf Schlüter, Hermann Ney

Figure 1 for On using 2D sequence-to-sequence models for speech recognition
Figure 2 for On using 2D sequence-to-sequence models for speech recognition
Figure 3 for On using 2D sequence-to-sequence models for speech recognition
Figure 4 for On using 2D sequence-to-sequence models for speech recognition
Viaarxiv icon

Hybrid phonetic-neural model for correction in speech recognition systems

Feb 12, 2021
Rafael Viana-Cámara, Mario Campos-Soberanis, Diego Campos-Sobrino

Figure 1 for Hybrid phonetic-neural model for correction in speech recognition systems
Figure 2 for Hybrid phonetic-neural model for correction in speech recognition systems
Figure 3 for Hybrid phonetic-neural model for correction in speech recognition systems
Figure 4 for Hybrid phonetic-neural model for correction in speech recognition systems
Viaarxiv icon

Speech-text based multi-modal training with bidirectional attention for improved speech recognition

Add code
Bookmark button
Alert button
Nov 01, 2022
Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li

Figure 1 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Figure 2 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Figure 3 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Figure 4 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Viaarxiv icon