Alert button
Picture for Xiang Hao

Xiang Hao

Alert button

College of Optical Science and Engineering, Zhejiang University, No.38 of Zheda Road, Hangzhou, Zhejiang Province, China

AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid Nodules: A ChatGPT-Style Assistant

Add code
Bookmark button
Alert button
Feb 04, 2024
Jincao Yao, Yunpeng Wang, Zhikai Lei, Kai Wang, Xiaoxian Li, Jianhua Zhou, Xiang Hao, Jiafei Shen, Zhenping Wang, Rongrong Ru, Yaqing Chen, Yahan Zhou, Chen Chen, Yanming Zhang, Ping Liang, Dong Xu

Viaarxiv icon

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction

Add code
Bookmark button
Alert button
Oct 15, 2023
Xiang Hao, Jibin Wu, Jianwei Yu, Chenglin Xu, Kay Chen Tan

Figure 1 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Figure 2 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Figure 3 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Figure 4 for Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction
Viaarxiv icon

Pink-Eggs Dataset V1: A Step Toward Invasive Species Management Using Deep Learning Embedded Solutions

Add code
Bookmark button
Alert button
May 16, 2023
Di Xu, Yang Zhao, Xiang Hao, Xin Meng

Figure 1 for Pink-Eggs Dataset V1: A Step Toward Invasive Species Management Using Deep Learning Embedded Solutions
Viaarxiv icon

Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge

Add code
Bookmark button
Alert button
Mar 14, 2023
Mingshuai Liu, Shubo Lv, Zihan Zhang, Runduo Han, Xiang Hao, Xianjun Xia, Li Chen, Yijian Xiao, Lei Xie

Figure 1 for Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge
Figure 2 for Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge
Viaarxiv icon

Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement

Add code
Bookmark button
Alert button
Dec 18, 2022
Xiang Hao, Xiaofei Li

Figure 1 for Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement
Figure 2 for Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement
Viaarxiv icon

Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes

Add code
Bookmark button
Alert button
Jun 16, 2022
Xiang Hao, Jingxiang Chen, Shixing Chen, Ahmed Saad, Raffay Hamid

Figure 1 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Figure 2 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Figure 3 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Figure 4 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Viaarxiv icon

Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers

Add code
Bookmark button
Alert button
Mar 30, 2022
Zhenhao Jin, Xiang Hao, Xiangdong Su

Figure 1 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 2 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 3 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 4 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Viaarxiv icon

Movies2Scenes: Learning Scene Representations Using Movie Similarities

Add code
Bookmark button
Alert button
Mar 12, 2022
Shixing Chen, Xiang Hao, Xiaohan Nie, Raffay Hamid

Figure 1 for Movies2Scenes: Learning Scene Representations Using Movie Similarities
Figure 2 for Movies2Scenes: Learning Scene Representations Using Movie Similarities
Figure 3 for Movies2Scenes: Learning Scene Representations Using Movie Similarities
Figure 4 for Movies2Scenes: Learning Scene Representations Using Movie Similarities
Viaarxiv icon