Alert button
Picture for Zhiyong Yan

Zhiyong Yan

Alert button

CED: Consistent ensemble distillation for audio tagging

Add code
Bookmark button
Alert button
Sep 08, 2023
Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang

Figure 1 for CED: Consistent ensemble distillation for audio tagging
Figure 2 for CED: Consistent ensemble distillation for audio tagging
Figure 3 for CED: Consistent ensemble distillation for audio tagging
Figure 4 for CED: Consistent ensemble distillation for audio tagging
Viaarxiv icon

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Add code
Bookmark button
Alert button
Jun 28, 2023
Jiuxin Lin, Peng Wang, Heinrich Dinkel, Jun Chen, Zhiyong Wu, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang

Figure 1 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 2 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 3 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 4 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Viaarxiv icon

AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction

Add code
Bookmark button
Alert button
Jun 25, 2023
Jiuxin Lin, Xinyu Cai, Heinrich Dinkel, Jun Chen, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Zhiyong Wu, Yujun Wang, Helen Meng

Figure 1 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 2 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 3 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 4 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Viaarxiv icon

Understanding temporally weakly supervised training: A case study for keyword spotting

Add code
Bookmark button
Alert button
May 30, 2023
Heinrich Dinkel, Weiji Zhuang, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang

Figure 1 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 2 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 3 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 4 for Understanding temporally weakly supervised training: A case study for keyword spotting
Viaarxiv icon

Streaming Audio Transformers for Online Audio Tagging

Add code
Bookmark button
Alert button
May 29, 2023
Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang

Figure 1 for Streaming Audio Transformers for Online Audio Tagging
Figure 2 for Streaming Audio Transformers for Online Audio Tagging
Figure 3 for Streaming Audio Transformers for Online Audio Tagging
Figure 4 for Streaming Audio Transformers for Online Audio Tagging
Viaarxiv icon

Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers

Add code
Bookmark button
Alert button
Mar 03, 2023
Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang

Figure 1 for Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Figure 2 for Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Figure 3 for Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Figure 4 for Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Viaarxiv icon

An empirical study of weakly supervised audio tagging embeddings for general audio representations

Add code
Bookmark button
Alert button
Sep 30, 2022
Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang

Figure 1 for An empirical study of weakly supervised audio tagging embeddings for general audio representations
Figure 2 for An empirical study of weakly supervised audio tagging embeddings for general audio representations
Figure 3 for An empirical study of weakly supervised audio tagging embeddings for general audio representations
Figure 4 for An empirical study of weakly supervised audio tagging embeddings for general audio representations
Viaarxiv icon

UniKW-AT: Unified Keyword Spotting and Audio Tagging

Add code
Bookmark button
Alert button
Sep 23, 2022
Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang

Figure 1 for UniKW-AT: Unified Keyword Spotting and Audio Tagging
Figure 2 for UniKW-AT: Unified Keyword Spotting and Audio Tagging
Figure 3 for UniKW-AT: Unified Keyword Spotting and Audio Tagging
Figure 4 for UniKW-AT: Unified Keyword Spotting and Audio Tagging
Viaarxiv icon

Pseudo strong labels for large scale weakly supervised audio tagging

Add code
Bookmark button
Alert button
Apr 28, 2022
Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang

Figure 1 for Pseudo strong labels for large scale weakly supervised audio tagging
Figure 2 for Pseudo strong labels for large scale weakly supervised audio tagging
Figure 3 for Pseudo strong labels for large scale weakly supervised audio tagging
Figure 4 for Pseudo strong labels for large scale weakly supervised audio tagging
Viaarxiv icon