Picture for Qipeng Guo

Qipeng Guo

Eric

Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training

Add code
Feb 08, 2026
Viaarxiv icon

Prompt Reinjection: Alleviating Prompt Forgetting in Multimodal Diffusion Transformers

Add code
Feb 06, 2026
Viaarxiv icon

Explicit Multi-head Attention for Inter-head Interaction in Large Language Models

Add code
Jan 27, 2026
Viaarxiv icon

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Add code
Jan 23, 2026
Viaarxiv icon

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

Add code
Jan 23, 2026
Viaarxiv icon

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Add code
Jan 23, 2026
Viaarxiv icon

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Add code
Jan 20, 2026
Viaarxiv icon

How to Set the Learning Rate for Large-Scale Pre-training?

Add code
Jan 08, 2026
Viaarxiv icon

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Add code
Dec 08, 2025
Viaarxiv icon