Alert button
Picture for Yanmin Qian

Yanmin Qian

Alert button

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li

Figure 1 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Figure 2 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Figure 3 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Figure 4 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Viaarxiv icon

USED: Universal Speaker Extraction and Diarization

Add code
Bookmark button
Alert button
Sep 19, 2023
Junyi Ao, Mehmet Sinan Yıldırım, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li

Figure 1 for USED: Universal Speaker Extraction and Diarization
Figure 2 for USED: Universal Speaker Extraction and Diarization
Figure 3 for USED: Universal Speaker Extraction and Diarization
Figure 4 for USED: Universal Speaker Extraction and Diarization
Viaarxiv icon

Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer

Add code
Bookmark button
Alert button
Sep 13, 2023
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian

Figure 1 for Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Figure 2 for Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Figure 3 for Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Figure 4 for Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer
Viaarxiv icon

InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models

Add code
Bookmark button
Alert button
Sep 06, 2023
Bing Han, Junyu Dai, Xuchen Song, Weituo Hao, Xinyan He, Dong Guo, Jitong Chen, Yuxuan Wang, Yanmin Qian

Figure 1 for InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models
Figure 2 for InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models
Figure 3 for InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models
Figure 4 for InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models
Viaarxiv icon

Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation

Add code
Bookmark button
Alert button
Jul 23, 2023
Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe

Figure 1 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Figure 2 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Figure 3 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Viaarxiv icon

Exploring Binary Classification Loss For Speaker Verification

Add code
Bookmark button
Alert button
Jul 17, 2023
Bing Han, Zhengyang Chen, Yanmin Qian

Figure 1 for Exploring Binary Classification Loss For Speaker Verification
Figure 2 for Exploring Binary Classification Loss For Speaker Verification
Figure 3 for Exploring Binary Classification Loss For Speaker Verification
Viaarxiv icon

Adapting Multi-Lingual ASR Models for Handling Multiple Talkers

Add code
Bookmark button
Alert button
May 30, 2023
Chenda Li, Yao Qian, Zhuo Chen, Naoyuki Kanda, Dongmei Wang, Takuya Yoshioka, Yanmin Qian, Michael Zeng

Figure 1 for Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
Figure 2 for Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
Figure 3 for Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
Figure 4 for Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
Viaarxiv icon

Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition

Add code
Bookmark button
Alert button
May 25, 2023
Wangyou Zhang, Yanmin Qian

Figure 1 for Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Figure 2 for Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Figure 3 for Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Figure 4 for Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
Viaarxiv icon

Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR

Add code
Bookmark button
Alert button
May 18, 2023
Hang Shao, Wei Wang, Bei Liu, Xun Gong, Haoyu Wang, Yanmin Qian

Figure 1 for Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR
Figure 2 for Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR
Figure 3 for Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR
Viaarxiv icon