Alert button

"speech recognition": models, code, and papers
Alert button

Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2022
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng

Figure 1 for Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
Figure 2 for Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
Figure 3 for Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
Figure 4 for Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
Viaarxiv icon

Emotional Speech Recognition with Pre-trained Deep Visual Models

Add code
Bookmark button
Alert button
Apr 06, 2022
Waleed Ragheb, Mehdi Mirzapour, Ali Delfardi, Hélène Jacquenet, Lawrence Carbon

Figure 1 for Emotional Speech Recognition with Pre-trained Deep Visual Models
Figure 2 for Emotional Speech Recognition with Pre-trained Deep Visual Models
Figure 3 for Emotional Speech Recognition with Pre-trained Deep Visual Models
Figure 4 for Emotional Speech Recognition with Pre-trained Deep Visual Models
Viaarxiv icon

Improved Meta Learning for Low Resource Speech Recognition

May 11, 2022
Satwinder Singh, Ruili Wang, Feng Hou

Figure 1 for Improved Meta Learning for Low Resource Speech Recognition
Figure 2 for Improved Meta Learning for Low Resource Speech Recognition
Figure 3 for Improved Meta Learning for Low Resource Speech Recognition
Figure 4 for Improved Meta Learning for Low Resource Speech Recognition
Viaarxiv icon

Diagonal State Space Augmented Transformers for Speech Recognition

Add code
Bookmark button
Alert button
Feb 27, 2023
George Saon, Ankit Gupta, Xiaodong Cui

Figure 1 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 2 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 3 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 4 for Diagonal State Space Augmented Transformers for Speech Recognition
Viaarxiv icon

Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding

May 23, 2023
Zheng Chen, Ziyan Jiang, Fan Yang

Figure 1 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Figure 2 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Figure 3 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Figure 4 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Viaarxiv icon

VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition

Sep 12, 2022
Naoyuki Kanda, Jian Wu, Xiaofei Wang, Zhuo Chen, Jinyu Li, Takuya Yoshioka

Figure 1 for VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition
Figure 2 for VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition
Figure 3 for VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition
Figure 4 for VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition
Viaarxiv icon

Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition

May 17, 2022
Zengrui Jin, Mengzhe Geng, Jiajun Deng, Tianzi Wang, Shujie Hu, Guinan Li, Xunying Liu

Figure 1 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Figure 2 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Figure 3 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Figure 4 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Viaarxiv icon

Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR

Apr 25, 2023
Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris, Sravan Bodapati

Figure 1 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Figure 2 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Figure 3 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Figure 4 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Viaarxiv icon

Residual Language Model for End-to-end Speech Recognition

Jun 15, 2022
Emiru Tsunoo, Yosuke Kashiwagi, Chaitanya Narisetty, Shinji Watanabe

Figure 1 for Residual Language Model for End-to-end Speech Recognition
Figure 2 for Residual Language Model for End-to-end Speech Recognition
Viaarxiv icon