Picture for Xiaoqi Jian

Xiaoqi Jian

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Add code
Nov 18, 2025
Viaarxiv icon

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Add code
Jun 05, 2025
Viaarxiv icon

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Add code
Mar 06, 2025
Figure 1 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 2 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 3 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Figure 4 for TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation
Viaarxiv icon

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Add code
Feb 18, 2025
Viaarxiv icon