Picture for Yifu Chen

Yifu Chen

Diffusion Model as a Generalist Segmentation Learner

Add code
Apr 27, 2026
Viaarxiv icon

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Add code
Apr 16, 2026
Viaarxiv icon

Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models

Add code
Apr 16, 2026
Viaarxiv icon

Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness

Add code
Mar 16, 2026
Viaarxiv icon

WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models

Add code
Feb 13, 2026
Viaarxiv icon

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

Add code
May 14, 2025
Viaarxiv icon

WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models

Add code
Feb 20, 2025
Figure 1 for WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models
Figure 2 for WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models
Figure 3 for WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models
Figure 4 for WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models
Viaarxiv icon

Speech Watermarking with Discrete Intermediate Representations

Add code
Dec 18, 2024
Viaarxiv icon

WavChat: A Survey of Spoken Dialogue Models

Add code
Nov 26, 2024
Figure 1 for WavChat: A Survey of Spoken Dialogue Models
Figure 2 for WavChat: A Survey of Spoken Dialogue Models
Figure 3 for WavChat: A Survey of Spoken Dialogue Models
Figure 4 for WavChat: A Survey of Spoken Dialogue Models
Viaarxiv icon

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

Add code
Sep 12, 2024
Viaarxiv icon