Alert button
Picture for Jianhua Tao

Jianhua Tao

Alert button

TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection

Add code
Bookmark button
Alert button
May 23, 2023
Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chuyuan Zhang, Shuai Zhang, Ruibo Fu, Xun Chen

Figure 1 for TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
Figure 2 for TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
Figure 3 for TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
Figure 4 for TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
Viaarxiv icon

Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features

Add code
Bookmark button
Alert button
May 23, 2023
Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chuyuan Zhang, Shuai Zhang, Xun Chen

Figure 1 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Figure 2 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Figure 3 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Figure 4 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Viaarxiv icon

M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
May 03, 2023
Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang

Figure 1 for M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis
Figure 2 for M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis
Figure 3 for M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis
Viaarxiv icon

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning

Add code
Bookmark button
Alert button
Apr 18, 2023
Zheng Lian, Haiyang Sun, Licai Sun, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao

Figure 1 for MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Figure 2 for MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Figure 3 for MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Figure 4 for MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Viaarxiv icon

DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning

Add code
Bookmark button
Alert button
Jan 28, 2023
Mingyu Xu, Zheng Lian, Lei Feng, Bin Liu, Jianhua Tao

Figure 1 for DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning
Figure 2 for DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning
Figure 3 for DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning
Figure 4 for DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning
Viaarxiv icon

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion

Add code
Bookmark button
Alert button
Jan 10, 2023
Haogeng Liu, Tao Wang, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Jianhua Tao

Figure 1 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 2 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 3 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 4 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Viaarxiv icon

Emotion Selectable End-to-End Text-based Speech Editing

Add code
Bookmark button
Alert button
Dec 20, 2022
Tao Wang, Jiangyan Yi, Ruibo Fu, Jianhua Tao, Zhengqi Wen, Chu Yuan Zhang

Figure 1 for Emotion Selectable End-to-End Text-based Speech Editing
Figure 2 for Emotion Selectable End-to-End Text-based Speech Editing
Figure 3 for Emotion Selectable End-to-End Text-based Speech Editing
Figure 4 for Emotion Selectable End-to-End Text-based Speech Editing
Viaarxiv icon

SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection

Add code
Bookmark button
Alert button
Nov 11, 2022
Jiangyan Yi, Chenglong Wang, Jianhua Tao, Zhengkun Tian, Cunhang Fan, Haoxin Ma, Ruibo Fu

Figure 1 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 2 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 3 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 4 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Viaarxiv icon

EmoFake: An Initial Dataset for Emotion Fake Audio Detection

Add code
Bookmark button
Alert button
Nov 11, 2022
Yan Zhao, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Chu Yuan Zhang, Tao Wang, Yongfeng Dong

Figure 1 for EmoFake: An Initial Dataset for Emotion Fake Audio Detection
Figure 2 for EmoFake: An Initial Dataset for Emotion Fake Audio Detection
Figure 3 for EmoFake: An Initial Dataset for Emotion Fake Audio Detection
Figure 4 for EmoFake: An Initial Dataset for Emotion Fake Audio Detection
Viaarxiv icon

EMOFAKE: An Initial Dataset For Emotion Fake Audio Detection

Add code
Bookmark button
Alert button
Nov 10, 2022
Yan Zhao, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Chu Yuan Zhang, Tao Wang, Yongfeng Dong

Figure 1 for EMOFAKE: An Initial Dataset For Emotion Fake Audio Detection
Figure 2 for EMOFAKE: An Initial Dataset For Emotion Fake Audio Detection
Figure 3 for EMOFAKE: An Initial Dataset For Emotion Fake Audio Detection
Figure 4 for EMOFAKE: An Initial Dataset For Emotion Fake Audio Detection
Viaarxiv icon