Picture for Zhanming Shen

Zhanming Shen

Momentum for Reasoning: Dense Intrinsic Signals in Policy Optimization

Add code
Jun 07, 2026
Viaarxiv icon

FLaG: Fine-Grained Latent Grouping for Hallucination Detection

Add code
May 29, 2026
Viaarxiv icon

From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment

Add code
May 20, 2026
Viaarxiv icon

Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation

Add code
May 19, 2026
Viaarxiv icon

Supervised Fine-Tuning Needs to Unlock the Potential of Token Priority

Add code
Feb 01, 2026
Viaarxiv icon

Training-Trajectory-Aware Token Selection

Add code
Jan 15, 2026
Viaarxiv icon

Merge-of-Thought Distillation

Add code
Sep 10, 2025
Figure 1 for Merge-of-Thought Distillation
Figure 2 for Merge-of-Thought Distillation
Figure 3 for Merge-of-Thought Distillation
Figure 4 for Merge-of-Thought Distillation
Viaarxiv icon