Alert button

"speech": models, code, and papers
Alert button

C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification

Aug 15, 2022
Chunlei Zhang, Dong Yu

Figure 1 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Figure 2 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Figure 3 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Figure 4 for C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
Viaarxiv icon

Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition

Nov 04, 2020
Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

Figure 1 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Figure 2 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Figure 3 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Figure 4 for Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
Viaarxiv icon

Streaming non-autoregressive model for any-to-many voice conversion

Add code
Bookmark button
Alert button
Jun 15, 2022
Ziyi Chen, Haoran Miao, Pengyuan Zhang

Figure 1 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 2 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 3 for Streaming non-autoregressive model for any-to-many voice conversion
Figure 4 for Streaming non-autoregressive model for any-to-many voice conversion
Viaarxiv icon

The VoicePrivacy 2020 Challenge Evaluation Plan

Add code
Bookmark button
Alert button
May 14, 2022
Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

Figure 1 for The VoicePrivacy 2020 Challenge Evaluation Plan
Figure 2 for The VoicePrivacy 2020 Challenge Evaluation Plan
Figure 3 for The VoicePrivacy 2020 Challenge Evaluation Plan
Figure 4 for The VoicePrivacy 2020 Challenge Evaluation Plan
Viaarxiv icon

An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis

Jul 26, 2022
R. San Millán-Castillo, L. Martino, E. Morgado, F. Llorente

Figure 1 for An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis
Figure 2 for An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis
Figure 3 for An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis
Figure 4 for An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis
Viaarxiv icon

Personalized Keyword Spotting through Multi-task Learning

Jun 28, 2022
Seunghan Yang, Byeonggeun Kim, Inseop Chung, Simyung Chang

Figure 1 for Personalized Keyword Spotting through Multi-task Learning
Figure 2 for Personalized Keyword Spotting through Multi-task Learning
Figure 3 for Personalized Keyword Spotting through Multi-task Learning
Figure 4 for Personalized Keyword Spotting through Multi-task Learning
Viaarxiv icon

Speech-to-Singing Conversion based on Boundary Equilibrium GAN

Add code
Bookmark button
Alert button
May 30, 2020
Da-Yi Wu, Yi-Hsuan Yang

Figure 1 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Figure 2 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Figure 3 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Figure 4 for Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Viaarxiv icon

Meta-Transfer Learning for Code-Switched Speech Recognition

Add code
Bookmark button
Alert button
Apr 29, 2020
Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Peng Xu, Pascale Fung

Figure 1 for Meta-Transfer Learning for Code-Switched Speech Recognition
Figure 2 for Meta-Transfer Learning for Code-Switched Speech Recognition
Figure 3 for Meta-Transfer Learning for Code-Switched Speech Recognition
Figure 4 for Meta-Transfer Learning for Code-Switched Speech Recognition
Viaarxiv icon

Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning

Nov 11, 2020
Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Leichao Song

Figure 1 for Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning
Figure 2 for Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning
Figure 3 for Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning
Figure 4 for Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning
Viaarxiv icon

Finnish Parliament ASR corpus - Analysis, benchmarks and statistics

Add code
Bookmark button
Alert button
Mar 28, 2022
Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo

Figure 1 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 2 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 3 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 4 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Viaarxiv icon