Alert button
Picture for Zhengkun Tian

Zhengkun Tian

Alert button

CPPF: A contextual and post-processing-free model for automatic speech recognition

Sep 21, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

TST: Time-Sparse Transducer for Automatic Speech Recognition

Jul 17, 2023
Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao

Figure 1 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Figure 2 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Figure 3 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Figure 4 for TST: Time-Sparse Transducer for Automatic Speech Recognition
Viaarxiv icon

SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection

Nov 11, 2022
Jiangyan Yi, Chenglong Wang, Jianhua Tao, Zhengkun Tian, Cunhang Fan, Haoxin Ma, Ruibo Fu

Figure 1 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 2 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 3 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 4 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Viaarxiv icon

Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization

Nov 07, 2022
Zhengkun Tian, Hongyu Xiang, Min Li, Feifei Lin, Ke Ding, Guanglu Wan

Figure 1 for Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization
Figure 2 for Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization
Figure 3 for Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization
Viaarxiv icon

System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation

Aug 21, 2022
Xinrui Yan, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Haoxin Ma, Zhengkun Tian, Ruibo Fu

Figure 1 for System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation
Figure 2 for System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation
Figure 3 for System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation
Figure 4 for System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation
Viaarxiv icon

Fully Automated End-to-End Fake Audio Detection

Aug 20, 2022
Chenglong Wang, Jiangyan Yi, Jianhua Tao, Haiyang Sun, Xun Chen, Zhengkun Tian, Haoxin Ma, Cunhang Fan, Ruibo Fu

Figure 1 for Fully Automated End-to-End Fake Audio Detection
Figure 2 for Fully Automated End-to-End Fake Audio Detection
Figure 3 for Fully Automated End-to-End Fake Audio Detection
Figure 4 for Fully Automated End-to-End Fake Audio Detection
Viaarxiv icon

ADD 2022: the First Audio Deep Synthesis Detection Challenge

Feb 26, 2022
Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu

Figure 1 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Figure 2 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Figure 3 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Figure 4 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Viaarxiv icon

Reducing language context confusion for end-to-end code-switching automatic speech recognition

Jan 28, 2022
Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Yu Ting Yeung, Liqun Deng

Figure 1 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 2 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 3 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 4 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Viaarxiv icon