Picture for Xipeng Qiu

Xipeng Qiu

Multi-hop Reasoning via Early Knowledge Alignment

Add code
Dec 23, 2025
Viaarxiv icon

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Add code
Dec 23, 2025
Viaarxiv icon

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Add code
Dec 08, 2025
Viaarxiv icon

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Add code
Nov 19, 2025
Viaarxiv icon

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Add code
Nov 06, 2025
Figure 1 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 2 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 3 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 4 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Viaarxiv icon

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

Add code
Nov 06, 2025
Viaarxiv icon

MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval

Add code
Oct 31, 2025
Figure 1 for MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Figure 2 for MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Figure 3 for MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Figure 4 for MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Viaarxiv icon

Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning

Add code
Oct 30, 2025
Figure 1 for Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Figure 2 for Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Figure 3 for Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Figure 4 for Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Viaarxiv icon

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Figure 1 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 2 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 3 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 4 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Viaarxiv icon

MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation

Add code
Oct 01, 2025
Viaarxiv icon