Picture for Haohe Liu

Haohe Liu

Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows

Add code
Apr 22, 2025
Viaarxiv icon

Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture

Add code
Apr 21, 2025
Viaarxiv icon

HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering

Add code
Mar 18, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

Audio-FLAN: A Preliminary Release

Add code
Feb 23, 2025
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models

Add code
Nov 28, 2024
Viaarxiv icon

FlowSep: Language-Queried Sound Separation with Rectified Flow Matching

Add code
Sep 11, 2024
Viaarxiv icon

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Add code
Jul 19, 2024
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Figure 1 for Text-Queried Target Sound Event Localization
Figure 2 for Text-Queried Target Sound Event Localization
Figure 3 for Text-Queried Target Sound Event Localization
Figure 4 for Text-Queried Target Sound Event Localization
Viaarxiv icon