Picture for Qingxiu Dong

Qingxiu Dong

Decoding in Geometry: Alleviating Embedding-Space Crowding for Complex Reasoning

Add code
Jan 30, 2026
Viaarxiv icon

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Add code
Jan 13, 2026
Viaarxiv icon

The Era of Agentic Organization: Learning to Organize with Language Models

Add code
Oct 30, 2025
Figure 1 for The Era of Agentic Organization: Learning to Organize with Language Models
Figure 2 for The Era of Agentic Organization: Learning to Organize with Language Models
Figure 3 for The Era of Agentic Organization: Learning to Organize with Language Models
Figure 4 for The Era of Agentic Organization: Learning to Organize with Language Models
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Figure 1 for Reinforcement Pre-Training
Figure 2 for Reinforcement Pre-Training
Figure 3 for Reinforcement Pre-Training
Figure 4 for Reinforcement Pre-Training
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Figure 1 for Reward Reasoning Model
Figure 2 for Reward Reasoning Model
Figure 3 for Reward Reasoning Model
Figure 4 for Reward Reasoning Model
Viaarxiv icon

RICo: Refined In-Context Contribution for Automatic Instruction-Tuning Data Selection

Add code
May 18, 2025
Viaarxiv icon

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Add code
May 16, 2025
Viaarxiv icon

ICon: In-Context Contribution for Automatic Data Selection

Add code
May 08, 2025
Viaarxiv icon

Scaling Laws of Synthetic Data for Language Models

Add code
Mar 26, 2025
Figure 1 for Scaling Laws of Synthetic Data for Language Models
Figure 2 for Scaling Laws of Synthetic Data for Language Models
Figure 3 for Scaling Laws of Synthetic Data for Language Models
Figure 4 for Scaling Laws of Synthetic Data for Language Models
Viaarxiv icon