Alert button

"speech recognition": models, code, and papers
Alert button

Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings

Feb 13, 2022
Nadee Seneviratne, Carol Espy-Wilson

Figure 1 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 2 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 3 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 4 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Viaarxiv icon

Analysis of Joint Speech-Text Embeddings for Semantic Matching

Apr 04, 2022
Muhammad Huzaifah, Ivan Kukanov

Figure 1 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Figure 2 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Figure 3 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Figure 4 for Analysis of Joint Speech-Text Embeddings for Semantic Matching
Viaarxiv icon

E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition

Jun 15, 2021
Jicheng Zhang, Yizhou Peng, Pham Van Tung, Haihua Xu, Hao Huang, Eng Siong Chng

Figure 1 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Figure 2 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Figure 3 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Figure 4 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Viaarxiv icon

Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network

Sep 09, 2021
Lance Ying, Amrit Romana, Emily Mower Provost

Figure 1 for Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network
Figure 2 for Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network
Figure 3 for Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network
Figure 4 for Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network
Viaarxiv icon

3D Feature Pyramid Attention Module for Robust Visual Speech Recognition

Oct 17, 2018
Jingyun Xiao, Shuang Yang, Yuanhang Zhang, Shiguang Shan, Xilin Chen

Figure 1 for 3D Feature Pyramid Attention Module for Robust Visual Speech Recognition
Figure 2 for 3D Feature Pyramid Attention Module for Robust Visual Speech Recognition
Figure 3 for 3D Feature Pyramid Attention Module for Robust Visual Speech Recognition
Figure 4 for 3D Feature Pyramid Attention Module for Robust Visual Speech Recognition
Viaarxiv icon

Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation

Add code
Bookmark button
Alert button
Aug 08, 2021
Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram

Figure 1 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Figure 2 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Figure 3 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Figure 4 for Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Viaarxiv icon

Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition

Jul 13, 2019
Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen

Figure 1 for Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Figure 2 for Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Figure 3 for Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Figure 4 for Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Viaarxiv icon

Improved training of end-to-end attention models for speech recognition

Add code
Bookmark button
Alert button
May 08, 2018
Albert Zeyer, Kazuki Irie, Ralf Schlüter, Hermann Ney

Figure 1 for Improved training of end-to-end attention models for speech recognition
Figure 2 for Improved training of end-to-end attention models for speech recognition
Figure 3 for Improved training of end-to-end attention models for speech recognition
Viaarxiv icon

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks

Add code
Bookmark button
Alert button
May 04, 2022
Marcely Zanon Boito, John Ortega, Hugo Riguidel, Antoine Laurent, Loïc Barrault, Fethi Bougares, Firas Chaabani, Ha Nguyen, Florentin Barbier, Souhir Gahbiche, Yannick Estève

Figure 1 for ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks
Figure 2 for ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks
Figure 3 for ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks
Figure 4 for ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks
Viaarxiv icon

Multilingual Speech Recognition With A Single End-To-End Model

Feb 15, 2018
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro Moreno, Eugene Weinstein, Kanishka Rao

Figure 1 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 2 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 3 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 4 for Multilingual Speech Recognition With A Single End-To-End Model
Viaarxiv icon