Picture for Haohe Liu

Haohe Liu

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Add code
Jul 19, 2024
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Viaarxiv icon

Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review

Add code
Jun 20, 2024
Viaarxiv icon

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Add code
Jun 10, 2024
Viaarxiv icon

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Apr 30, 2024
Figure 1 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 2 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 3 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 4 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Apr 27, 2024
Figure 1 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 2 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 3 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 4 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Viaarxiv icon

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Apr 25, 2024
Figure 1 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 2 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 3 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 4 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Mar 15, 2024
Figure 1 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 2 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 3 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 4 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Feb 05, 2024
Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon

Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation

Add code
Dec 25, 2023
Viaarxiv icon