Picture for Mengdi Zhang

Mengdi Zhang

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

Add code
May 22, 2025
Viaarxiv icon

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Add code
May 21, 2025
Viaarxiv icon

FRAbench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities

Add code
May 19, 2025
Viaarxiv icon

Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning

Add code
Apr 18, 2025
Viaarxiv icon

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Add code
Mar 09, 2025
Viaarxiv icon

The Role of Visual Modality in Multimodal Mathematical Reasoning: Challenges and Insights

Add code
Mar 06, 2025
Viaarxiv icon

MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task

Add code
Feb 17, 2025
Viaarxiv icon

RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation

Add code
Feb 13, 2025
Viaarxiv icon

Bi-Level Graph Structure Learning for Next POI Recommendation

Add code
Nov 02, 2024
Figure 1 for Bi-Level Graph Structure Learning for Next POI Recommendation
Figure 2 for Bi-Level Graph Structure Learning for Next POI Recommendation
Figure 3 for Bi-Level Graph Structure Learning for Next POI Recommendation
Figure 4 for Bi-Level Graph Structure Learning for Next POI Recommendation
Viaarxiv icon

LLMScan: Causal Scan for LLM Misbehavior Detection

Add code
Oct 23, 2024
Figure 1 for LLMScan: Causal Scan for LLM Misbehavior Detection
Figure 2 for LLMScan: Causal Scan for LLM Misbehavior Detection
Figure 3 for LLMScan: Causal Scan for LLM Misbehavior Detection
Figure 4 for LLMScan: Causal Scan for LLM Misbehavior Detection
Viaarxiv icon