Picture for Kuan-Yu Chen

Kuan-Yu Chen

The Binding Effect: Analyzing How Multi-Dimensional Cues Form Gender Bias in Instruction TTS

Add code
Mar 21, 2026
Viaarxiv icon

Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting

Add code
Mar 17, 2026
Viaarxiv icon

Concept-Aware Privacy Mechanisms for Defending Embedding Inversion Attacks

Add code
Feb 06, 2026
Viaarxiv icon

Guitar Tone Morphing by Diffusion-based Model

Add code
Oct 09, 2025
Figure 1 for Guitar Tone Morphing by Diffusion-based Model
Figure 2 for Guitar Tone Morphing by Diffusion-based Model
Figure 3 for Guitar Tone Morphing by Diffusion-based Model
Viaarxiv icon

Bloodroot: When Watermarking Turns Poisonous For Stealthy Backdoor

Add code
Oct 09, 2025
Viaarxiv icon

Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems

Add code
Sep 18, 2025
Viaarxiv icon

Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations

Add code
May 30, 2025
Figure 1 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 2 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 3 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Figure 4 for Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations
Viaarxiv icon

Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models

Add code
May 28, 2025
Figure 1 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 2 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 3 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Figure 4 for Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models
Viaarxiv icon

Creativity in LLM-based Multi-Agent Systems: A Survey

Add code
May 27, 2025
Figure 1 for Creativity in LLM-based Multi-Agent Systems: A Survey
Figure 2 for Creativity in LLM-based Multi-Agent Systems: A Survey
Figure 3 for Creativity in LLM-based Multi-Agent Systems: A Survey
Figure 4 for Creativity in LLM-based Multi-Agent Systems: A Survey
Viaarxiv icon

SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement

Add code
May 20, 2025
Figure 1 for SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
Figure 2 for SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
Figure 3 for SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
Figure 4 for SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
Viaarxiv icon