Picture for Lei Xie

Lei Xie

Nanjing University

YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance

Add code
Mar 25, 2026
Viaarxiv icon

Semantic-Aware Interruption Detection in Spoken Dialogue Systems: Benchmark, Metric, and Model

Add code
Mar 25, 2026
Viaarxiv icon

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement

Add code
Mar 21, 2026
Viaarxiv icon

Iterative Learning Control-Informed Reinforcement Learning for Batch Process Control

Add code
Mar 16, 2026
Viaarxiv icon

SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation

Add code
Mar 16, 2026
Viaarxiv icon

Robust Spatiotemporal Motion Planning for Multi-Agent Autonomous Racing via Topological Gap Identification and Accelerated MPC

Add code
Mar 10, 2026
Viaarxiv icon

Vision-Augmented On-Track System Identification for Autonomous Racing via Attention-Based Priors and Iterative Neural Correction

Add code
Mar 10, 2026
Viaarxiv icon

EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMs

Add code
Feb 25, 2026
Viaarxiv icon

BrainRVQ: A High-Fidelity EEG Foundation Model via Dual-Domain Residual Quantization and Hierarchical Autoregression

Add code
Feb 18, 2026
Viaarxiv icon

AugVLA-3D: Depth-Driven Feature Augmentation for Vision-Language-Action Models

Add code
Feb 11, 2026
Viaarxiv icon