Picture for Jianhua Tao

Jianhua Tao

MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics

Add code
Jul 17, 2024
Figure 1 for MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics
Figure 2 for MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics
Figure 3 for MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics
Figure 4 for MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics
Viaarxiv icon

An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio

Add code
Jul 11, 2024
Figure 1 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Figure 2 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Figure 3 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Figure 4 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Viaarxiv icon

ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation

Add code
Jul 07, 2024
Figure 1 for ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation
Figure 2 for ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation
Figure 3 for ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation
Figure 4 for ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation
Viaarxiv icon

Fake News Detection and Manipulation Reasoning via Large Vision-Language Models

Add code
Jul 02, 2024
Viaarxiv icon

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation

Add code
Jun 15, 2024
Figure 1 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Figure 2 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Figure 3 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Figure 4 for MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation
Viaarxiv icon

Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio

Add code
Jun 12, 2024
Figure 1 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Figure 2 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Figure 3 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Figure 4 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Viaarxiv icon

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

Add code
Jun 10, 2024
Figure 1 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Figure 2 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Figure 3 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Figure 4 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Viaarxiv icon

PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation

Add code
Jun 07, 2024
Figure 1 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Figure 2 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Figure 3 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Figure 4 for PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Viaarxiv icon

TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Add code
Jun 07, 2024
Figure 1 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Figure 2 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Figure 3 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Figure 4 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Viaarxiv icon

Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection

Add code
Jun 05, 2024
Figure 1 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Figure 2 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Figure 3 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Figure 4 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Viaarxiv icon