Picture for Qian Chen

Qian Chen

BareWave: Waveform-Native Flow-Matching Text-to-Speech

Add code
Jun 08, 2026
Viaarxiv icon

SkillPyramid: A Hierarchical Skill Consolidation Framework for Self-Evolving Agents

Add code
Jun 02, 2026
Viaarxiv icon

Look on Demand: A Cognitive Scheduling Framework for Visual Evidence Acquisition in Multimodal Reasoning

Add code
May 27, 2026
Viaarxiv icon

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling

Add code
May 07, 2026
Viaarxiv icon

To Fuse or to Drop? Dual-Path Learning for Resolving Modality Conflicts in Multimodal Emotion Recognition

Add code
May 06, 2026
Viaarxiv icon

Physics-Informed Conditional Diffusion for Motion-Robust Retinal Temporal Laser Speckle Contrast Imaging

Add code
Apr 22, 2026
Viaarxiv icon

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Add code
Apr 16, 2026
Viaarxiv icon

Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models

Add code
Apr 16, 2026
Viaarxiv icon

Joint Task Offloading, Inference Optimization and UAV Trajectory Planning for Generative AI Empowered Intelligent Transportation Digital Twin

Add code
Apr 09, 2026
Viaarxiv icon

Structure-Dependent Regret and Constraint Violation Bounds for Online Convex Optimization with Time-Varying Constraints

Add code
Mar 15, 2026
Viaarxiv icon