Picture for Jianhua Tao

Jianhua Tao

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation

Add code
Jun 15, 2024
Viaarxiv icon

Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio

Add code
Jun 12, 2024
Viaarxiv icon

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

Add code
Jun 10, 2024
Viaarxiv icon

PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation

Jun 07, 2024
Viaarxiv icon

TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Add code
Jun 07, 2024
Viaarxiv icon

Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion strategy

Jun 05, 2024
Viaarxiv icon

Generalized Fake Audio Detection via Deep Stable Learning

Jun 05, 2024
Viaarxiv icon

Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection

Jun 05, 2024
Viaarxiv icon

EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark

May 15, 2024
Viaarxiv icon

Can large language models understand uncommon meanings of common words?

Add code
May 09, 2024
Viaarxiv icon