Picture for Yi-Chiao Wu

Yi-Chiao Wu

SAM Audio Judge: A Unified Multimodal Framework for Perceptual Evaluation of Audio Separation

Add code
Jan 27, 2026
Viaarxiv icon

T-Mimi: A Transformer-based Mimi Decoder for Real-Time On-Phone TTS

Add code
Jan 27, 2026
Viaarxiv icon

SAM Audio: Segment Anything in Audio

Add code
Dec 19, 2025
Figure 1 for SAM Audio: Segment Anything in Audio
Figure 2 for SAM Audio: Segment Anything in Audio
Figure 3 for SAM Audio: Segment Anything in Audio
Figure 4 for SAM Audio: Segment Anything in Audio
Viaarxiv icon

FlowDec: A flow-based full-band general audio codec with high perceptual quality

Add code
Mar 03, 2025
Figure 1 for FlowDec: A flow-based full-band general audio codec with high perceptual quality
Figure 2 for FlowDec: A flow-based full-band general audio codec with high perceptual quality
Figure 3 for FlowDec: A flow-based full-band general audio codec with high perceptual quality
Figure 4 for FlowDec: A flow-based full-band general audio codec with high perceptual quality
Viaarxiv icon

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Add code
Feb 07, 2025
Figure 1 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 2 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 3 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 4 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Viaarxiv icon

ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling

Add code
Feb 04, 2025
Figure 1 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 2 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 3 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 4 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models

Add code
Sep 21, 2024
Figure 1 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 2 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 3 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 4 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Viaarxiv icon

EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 30, 2024
Figure 1 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 2 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 3 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 4 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Viaarxiv icon

EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 24, 2024
Figure 1 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 2 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 3 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 4 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Viaarxiv icon