Alert button

"speech recognition": models, code, and papers
Alert button

PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models

Add code
Bookmark button
Alert button
Jun 08, 2023
Tiantian Feng, Shrikanth Narayanan

Figure 1 for PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models
Figure 2 for PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models
Figure 3 for PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models
Figure 4 for PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models
Viaarxiv icon

Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones

Oct 10, 2023
Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

Figure 1 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Figure 2 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Figure 3 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Figure 4 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Viaarxiv icon

Online Continual Learning of End-to-End Speech Recognition Models

Jul 11, 2022
Muqiao Yang, Ian Lane, Shinji Watanabe

Figure 1 for Online Continual Learning of End-to-End Speech Recognition Models
Figure 2 for Online Continual Learning of End-to-End Speech Recognition Models
Figure 3 for Online Continual Learning of End-to-End Speech Recognition Models
Figure 4 for Online Continual Learning of End-to-End Speech Recognition Models
Viaarxiv icon

Streaming Audio-Visual Speech Recognition with Alignment Regularization

Nov 03, 2022
Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic

Figure 1 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Figure 2 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Figure 3 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Figure 4 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Viaarxiv icon

MAC: A unified framework boosting low resource automatic speech recognition

Add code
Bookmark button
Alert button
Feb 15, 2023
Zeping Min, Qian Ge, Zhong Li, Weinan E

Figure 1 for MAC: A unified framework boosting low resource automatic speech recognition
Figure 2 for MAC: A unified framework boosting low resource automatic speech recognition
Figure 3 for MAC: A unified framework boosting low resource automatic speech recognition
Figure 4 for MAC: A unified framework boosting low resource automatic speech recognition
Viaarxiv icon

Improving Speech Emotion Recognition Performance using Differentiable Architecture Search

May 23, 2023
Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn Schuller

Figure 1 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Figure 2 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Figure 3 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Figure 4 for Improving Speech Emotion Recognition Performance using Differentiable Architecture Search
Viaarxiv icon

Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition

Add code
Bookmark button
Alert button
Feb 28, 2023
Zhijie Shen, Wu Guo, Bin Gu

Figure 1 for Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition
Figure 2 for Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition
Figure 3 for Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition
Figure 4 for Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition
Viaarxiv icon

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding

Jul 22, 2023
Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding
Figure 2 for Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding
Figure 3 for Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding
Figure 4 for Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding
Viaarxiv icon

Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks

Aug 18, 2023
Shu Wang, Kun Sun, Qi Li

Figure 1 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 2 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 3 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 4 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Viaarxiv icon

EM-Network: Oracle Guided Self-distillation for Sequence Learning

Add code
Bookmark button
Alert button
Jun 14, 2023
Ji Won Yoon, Sunghwan Ahn, Hyeonseung Lee, Minchan Kim, Seok Min Kim, Nam Soo Kim

Figure 1 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Figure 2 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Figure 3 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Figure 4 for EM-Network: Oracle Guided Self-distillation for Sequence Learning
Viaarxiv icon