Picture for Xipeng Qiu

Xipeng Qiu

How to Set the Batch Size for Large-Scale Pre-training?

Add code
Jan 08, 2026
Viaarxiv icon

How to Set the Learning Rate for Large-Scale Pre-training?

Add code
Jan 08, 2026
Viaarxiv icon

WESR: Scaling and Evaluating Word-level Event-Speech Recognition

Add code
Jan 08, 2026
Viaarxiv icon

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Add code
Jan 08, 2026
Viaarxiv icon

Multi-hop Reasoning via Early Knowledge Alignment

Add code
Dec 23, 2025
Figure 1 for Multi-hop Reasoning via Early Knowledge Alignment
Figure 2 for Multi-hop Reasoning via Early Knowledge Alignment
Figure 3 for Multi-hop Reasoning via Early Knowledge Alignment
Figure 4 for Multi-hop Reasoning via Early Knowledge Alignment
Viaarxiv icon

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Add code
Dec 23, 2025
Viaarxiv icon

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Add code
Dec 08, 2025
Viaarxiv icon

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Add code
Nov 19, 2025
Viaarxiv icon

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

Add code
Nov 06, 2025
Figure 1 for RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization
Figure 2 for RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization
Figure 3 for RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization
Figure 4 for RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization
Viaarxiv icon

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Add code
Nov 06, 2025
Figure 1 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 2 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 3 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Figure 4 for Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Viaarxiv icon