Picture for Xipeng Qiu

Xipeng Qiu

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Viaarxiv icon

MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation

Add code
Oct 01, 2025
Viaarxiv icon

UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets

Add code
Sep 18, 2025
Viaarxiv icon

Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue

Add code
Sep 18, 2025
Viaarxiv icon

Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Alignment in MLLM

Add code
Sep 18, 2025
Viaarxiv icon

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance

Add code
Aug 28, 2025
Viaarxiv icon

CodecBench: A Comprehensive Benchmark for Acoustic and Semantic Evaluation

Add code
Aug 28, 2025
Viaarxiv icon

Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark

Add code
Aug 26, 2025
Viaarxiv icon

Attention Layers Add Into Low-Dimensional Residual Subspaces

Add code
Aug 23, 2025
Viaarxiv icon

Dynamic and Generalizable Process Reward Modeling

Add code
Jul 23, 2025
Viaarxiv icon