Alert button
Picture for Xiaohui Zhang

Xiaohui Zhang

Alert button

TST: Time-Sparse Transducer for Automatic Speech Recognition

Jul 17, 2023
Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao

Figure 1 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Figure 2 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Figure 3 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Figure 4 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Viaarxiv icon

Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection

Jun 09, 2023
Chenglong Wang, Jiangyan Yi, Xiaohui Zhang, Jianhua Tao, Le Xu, Ruibo Fu

Figure 1 for Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection
Figure 2 for Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection
Figure 3 for Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection
Figure 4 for Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection
Viaarxiv icon

Adaptive Fake Audio Detection with Low-Rank Model Squeezing

Jun 08, 2023
Xiaohui Zhang, Jiangyan Yi, Jianhua Tao, Chenlong Wang, Le Xu, Ruibo Fu

Figure 1 for Adaptive Fake Audio Detection with Low-Rank Model Squeezing
Figure 2 for Adaptive Fake Audio Detection with Low-Rank Model Squeezing
Figure 3 for Adaptive Fake Audio Detection with Low-Rank Model Squeezing
Viaarxiv icon

ADD 2023: the Second Audio Deepfake Detection Challenge

May 23, 2023
Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li

Figure 1 for ADD 2023: the Second Audio Deepfake Detection Challenge
Figure 2 for ADD 2023: the Second Audio Deepfake Detection Challenge
Figure 3 for ADD 2023: the Second Audio Deepfake Detection Challenge
Figure 4 for ADD 2023: the Second Audio Deepfake Detection Challenge
Viaarxiv icon

Scaling Speech Technology to 1,000+ Languages

May 22, 2023
Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli

Figure 1 for Scaling Speech Technology to 1,000+ Languages
Figure 2 for Scaling Speech Technology to 1,000+ Languages
Figure 3 for Scaling Speech Technology to 1,000+ Languages
Figure 4 for Scaling Speech Technology to 1,000+ Languages
Viaarxiv icon

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

Apr 11, 2023
Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polák, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe

Figure 1 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Figure 2 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Figure 3 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Figure 4 for ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Viaarxiv icon

TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio

Apr 04, 2023
Anurag Kumar, Ke Tan, Zhaoheng Ni, Pranay Manocha, Xiaohui Zhang, Ethan Henderson, Buye Xu

Figure 1 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 2 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 3 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 4 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Viaarxiv icon

Joint localization and classification of breast tumors on ultrasound images using a novel auxiliary attention-based framework

Oct 11, 2022
Zong Fan, Ping Gong, Shanshan Tang, Christine U. Lee, Xiaohui Zhang, Pengfei Song, Shigao Chen, Hua Li

Figure 1 for Joint localization and classification of breast tumors on ultrasound images using a novel auxiliary attention-based framework
Figure 2 for Joint localization and classification of breast tumors on ultrasound images using a novel auxiliary attention-based framework
Figure 3 for Joint localization and classification of breast tumors on ultrasound images using a novel auxiliary attention-based framework
Figure 4 for Joint localization and classification of breast tumors on ultrasound images using a novel auxiliary attention-based framework
Viaarxiv icon

A novel adversarial learning strategy for medical image classification

Jul 07, 2022
Zong Fan, Xiaohui Zhang, Jacob A. Gasienica, Jennifer Potts, Su Ruan, Wade Thorstad, Hiram Gay, Pengfei Song, Xiaowei Wang, Hua Li

Figure 1 for A novel adversarial learning strategy for medical image classification
Figure 2 for A novel adversarial learning strategy for medical image classification
Figure 3 for A novel adversarial learning strategy for medical image classification
Figure 4 for A novel adversarial learning strategy for medical image classification
Viaarxiv icon