Picture for Kuan-Yu Chen

Kuan-Yu Chen

Guitar Tone Morphing by Diffusion-based Model

Add code
Oct 09, 2025
Viaarxiv icon

Bloodroot: When Watermarking Turns Poisonous For Stealthy Backdoor

Add code
Oct 09, 2025
Viaarxiv icon

Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems

Add code
Sep 18, 2025
Viaarxiv icon

Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations

Add code
May 30, 2025
Figure 1 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 2 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 3 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 4 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Viaarxiv icon

Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models

Add code
May 28, 2025
Figure 1 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 2 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 3 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 4 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Viaarxiv icon

Creativity in LLM-based Multi-Agent Systems: A Survey

Add code
May 27, 2025
Viaarxiv icon

SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement

Add code
May 20, 2025
Viaarxiv icon

DOFEN: Deep Oblivious Forest ENsemble

Add code
Dec 24, 2024
Figure 1 for DOFEN: Deep Oblivious Forest ENsemble
Figure 2 for DOFEN: Deep Oblivious Forest ENsemble
Figure 3 for DOFEN: Deep Oblivious Forest ENsemble
Figure 4 for DOFEN: Deep Oblivious Forest ENsemble
Viaarxiv icon

An Attention-based Framework with Multistation Information for Earthquake Early Warnings

Add code
Dec 24, 2024
Viaarxiv icon

Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer

Add code
May 14, 2024
Figure 1 for Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
Figure 2 for Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
Figure 3 for Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer
Viaarxiv icon