Picture for Mengyue Wu

Mengyue Wu

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Add code
Jul 19, 2024
Viaarxiv icon

Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models

Add code
Jul 19, 2024
Viaarxiv icon

DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation

Add code
Jul 18, 2024
Viaarxiv icon

AudioTime: A Temporally-aligned Audio-text Benchmark Dataset

Add code
Jul 03, 2024
Viaarxiv icon

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

Add code
Jul 03, 2024
Viaarxiv icon

FakeSound: Deepfake General Audio Detection

Add code
Jun 12, 2024
Figure 1 for FakeSound: Deepfake General Audio Detection
Figure 2 for FakeSound: Deepfake General Audio Detection
Figure 3 for FakeSound: Deepfake General Audio Detection
Figure 4 for FakeSound: Deepfake General Audio Detection
Viaarxiv icon

Evaluation of data inconsistency for multi-modal sentiment analysis

Add code
Jun 05, 2024
Figure 1 for Evaluation of data inconsistency for multi-modal sentiment analysis
Figure 2 for Evaluation of data inconsistency for multi-modal sentiment analysis
Figure 3 for Evaluation of data inconsistency for multi-modal sentiment analysis
Figure 4 for Evaluation of data inconsistency for multi-modal sentiment analysis
Viaarxiv icon

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Apr 30, 2024
Figure 1 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 2 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 3 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Figure 4 for SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
Viaarxiv icon

Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats

Add code
Apr 07, 2024
Viaarxiv icon

A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds

Add code
Mar 07, 2024
Figure 1 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 2 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 3 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 4 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Viaarxiv icon