Picture for Wenwu Wang

Wenwu Wang

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Add code
Jul 19, 2024
Viaarxiv icon

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

Add code
Jul 16, 2024
Viaarxiv icon

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

Add code
Jul 06, 2024
Viaarxiv icon

Improving Audio Generation with Visual Enhanced Caption

Add code
Jul 05, 2024
Viaarxiv icon

Learning Retrieval Augmentation for Personalized Dialogue Generation

Add code
Jun 27, 2024
Figure 1 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Figure 2 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Figure 3 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Figure 4 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Viaarxiv icon

Selective Prompting Tuning for Personalized Conversations with LLMs

Add code
Jun 26, 2024
Figure 1 for Selective Prompting Tuning for Personalized Conversations with LLMs
Figure 2 for Selective Prompting Tuning for Personalized Conversations with LLMs
Figure 3 for Selective Prompting Tuning for Personalized Conversations with LLMs
Figure 4 for Selective Prompting Tuning for Personalized Conversations with LLMs
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Viaarxiv icon

Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review

Add code
Jun 20, 2024
Viaarxiv icon

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Add code
Jun 10, 2024
Viaarxiv icon

Soundscape Captioning using Sound Affective Quality Network and Large Language Model

Add code
Jun 09, 2024
Viaarxiv icon