Alert button
Picture for Midia Yousefi

Midia Yousefi

Alert button

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

Add code
Bookmark button
Alert button
Apr 10, 2024
Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng

Viaarxiv icon

Profile-Error-Tolerant Target-Speaker Voice Activity Detection

Add code
Bookmark button
Alert button
Sep 21, 2023
Dongmei Wang, Xiong Xiao, Naoyuki Kanda, Midia Yousefi, Takuya Yoshioka, Jian Wu

Figure 1 for Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Figure 2 for Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Figure 3 for Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Figure 4 for Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Viaarxiv icon

Single-channel speech separation using Soft-minimum Permutation Invariant Training

Add code
Bookmark button
Alert button
Nov 16, 2021
Midia Yousefi, John H. L. Hansen

Figure 1 for Single-channel speech separation using Soft-minimum Permutation Invariant Training
Figure 2 for Single-channel speech separation using Soft-minimum Permutation Invariant Training
Figure 3 for Single-channel speech separation using Soft-minimum Permutation Invariant Training
Figure 4 for Single-channel speech separation using Soft-minimum Permutation Invariant Training
Viaarxiv icon

Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition

Add code
Bookmark button
Alert button
Oct 30, 2021
Midia Yousefi, John H. L. Hanse

Figure 1 for Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
Figure 2 for Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
Figure 3 for Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
Figure 4 for Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition
Viaarxiv icon

Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network

Add code
Bookmark button
Alert button
Oct 30, 2021
Midia Yousefi, John H. L. Hansen

Figure 1 for Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network
Figure 2 for Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network
Figure 3 for Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network
Figure 4 for Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network
Viaarxiv icon

Probabilistic Permutation Invariant Training for Speech Separation

Add code
Bookmark button
Alert button
Aug 04, 2019
Midia Yousefi, Soheil Khorram, John H. L. Hansen

Figure 1 for Probabilistic Permutation Invariant Training for Speech Separation
Figure 2 for Probabilistic Permutation Invariant Training for Speech Separation
Figure 3 for Probabilistic Permutation Invariant Training for Speech Separation
Viaarxiv icon