Picture for Yihan Wu

Yihan Wu

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

Add code
Sep 24, 2024
Viaarxiv icon

Text-To-Speech Synthesis In The Wild

Add code
Sep 13, 2024
Viaarxiv icon

YuLan: An Open-source Large Language Model

Add code
Jun 28, 2024
Viaarxiv icon

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

Add code
Jun 11, 2024
Viaarxiv icon

Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

Add code
Jun 02, 2024
Viaarxiv icon

Few-Shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt

Add code
Mar 25, 2024
Viaarxiv icon

Entity Alignment with Unlabeled Dangling Cases

Add code
Mar 16, 2024
Viaarxiv icon

Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection

Add code
Feb 19, 2024
Viaarxiv icon

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

Add code
Jan 31, 2024
Viaarxiv icon

GPT-4 Vision on Medical Image Classification -- A Case Study on COVID-19 Dataset

Add code
Oct 27, 2023
Viaarxiv icon