Alert button

"speech": models, code, and papers
Alert button

Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces

Add code
Bookmark button
Alert button
May 27, 2023
Osman Berke Guney, Deniz Kucukahmetler, Huseyin Ozkan

Figure 1 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Figure 2 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Figure 3 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Figure 4 for Source Free Domain Adaptation of a DNN for SSVEP-based Brain-Computer Interfaces
Viaarxiv icon

A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition

Nov 24, 2022
Jiacheng Zhang, Wenyi Yan, Ye Zhang

Figure 1 for A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition
Figure 2 for A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition
Figure 3 for A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition
Figure 4 for A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition
Viaarxiv icon

Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder

Add code
Bookmark button
Alert button
Dec 16, 2022
Yusuke Yasuda, Tomoki Toda

Figure 1 for Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Figure 2 for Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Figure 3 for Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Viaarxiv icon

Describing emotions with acoustic property prompts for speech emotion recognition

Nov 14, 2022
Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

Figure 1 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 2 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 3 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 4 for Describing emotions with acoustic property prompts for speech emotion recognition
Viaarxiv icon

LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge

Oct 17, 2022
Yan Jia, Mi Hong, Jingyu Hou, Kailong Ren, Sifan Ma, Jin Wang, Fangzhen Peng, Yinglin Ji, Lin Yang, Junjie Wang

Figure 1 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Figure 2 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Figure 3 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Figure 4 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Viaarxiv icon

Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity

May 24, 2023
Raman Dutt, Linus Ericsson, Pedro Sanchez, Sotirios A. Tsaftaris, Timothy Hospedales

Figure 1 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Figure 2 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Figure 3 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Figure 4 for Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Viaarxiv icon

Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios

May 13, 2023
Morgan Sandler, Arun Ross

Figure 1 for Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios
Figure 2 for Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios
Figure 3 for Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios
Figure 4 for Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios
Viaarxiv icon

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition

Oct 27, 2022
Yujin Wang, Changli Tang, Ziyang Ma, Zhisheng Zheng, Xie Chen, Wei-Qiang Zhang

Figure 1 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 2 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 3 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 4 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Viaarxiv icon

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

Dec 15, 2022
Dongheon Lee, Jung-Woo Choi

Figure 1 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Figure 2 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Figure 3 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Figure 4 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Viaarxiv icon

Difference of Submodular Minimization via DC Programming

Add code
Bookmark button
Alert button
May 18, 2023
Marwa El Halabi, George Orfanides, Tim Hoheisel

Viaarxiv icon