Picture for Mengdi Zhang

Mengdi Zhang

Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization

Add code
Aug 13, 2025
Viaarxiv icon

Simulating Human-Like Learning Dynamics with LLM-Empowered Agents

Add code
Aug 07, 2025
Viaarxiv icon

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

Add code
May 22, 2025
Viaarxiv icon

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Add code
May 21, 2025
Viaarxiv icon

FRAbench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities

Add code
May 19, 2025
Viaarxiv icon

Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning

Add code
Apr 18, 2025
Viaarxiv icon

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Add code
Mar 09, 2025
Figure 1 for InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
Figure 2 for InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
Figure 3 for InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
Figure 4 for InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
Viaarxiv icon

The Role of Visual Modality in Multimodal Mathematical Reasoning: Challenges and Insights

Add code
Mar 06, 2025
Viaarxiv icon

MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task

Add code
Feb 17, 2025
Figure 1 for MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
Figure 2 for MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
Figure 3 for MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
Figure 4 for MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
Viaarxiv icon

RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation

Add code
Feb 13, 2025
Viaarxiv icon