Picture for Yi Lu

Yi Lu

ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation

Add code
Jul 07, 2024
Viaarxiv icon

Zero-Shot Long-Form Video Understanding through Screenplay

Add code
Jun 25, 2024
Figure 1 for Zero-Shot Long-Form Video Understanding through Screenplay
Figure 2 for Zero-Shot Long-Form Video Understanding through Screenplay
Figure 3 for Zero-Shot Long-Form Video Understanding through Screenplay
Figure 4 for Zero-Shot Long-Form Video Understanding through Screenplay
Viaarxiv icon

A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge

Add code
Jun 22, 2024
Viaarxiv icon

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation

Add code
Jun 15, 2024
Viaarxiv icon

Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio

Add code
Jun 12, 2024
Figure 1 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Figure 2 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Figure 3 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Figure 4 for Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio
Viaarxiv icon

PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation

Add code
Jun 07, 2024
Viaarxiv icon

Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection

Add code
Jun 05, 2024
Figure 1 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Figure 2 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Figure 3 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Figure 4 for Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
Viaarxiv icon

Generalized Fake Audio Detection via Deep Stable Learning

Add code
Jun 05, 2024
Viaarxiv icon

The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio

Add code
May 08, 2024
Viaarxiv icon

Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories

Add code
May 05, 2024
Figure 1 for Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories
Figure 2 for Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories
Figure 3 for Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories
Figure 4 for Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories
Viaarxiv icon