Picture for Mengyu Zhou

Mengyu Zhou

MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination

Add code
Mar 25, 2026
Viaarxiv icon

Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models

Add code
Mar 17, 2026
Viaarxiv icon

Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models

Add code
Mar 17, 2026
Viaarxiv icon

Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric

Add code
Feb 15, 2026
Viaarxiv icon

SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm

Add code
Feb 08, 2026
Viaarxiv icon

GUI-360$^\circ$: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 10, 2025
Viaarxiv icon

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 06, 2025
Viaarxiv icon

SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets

Add code
Oct 22, 2025
Viaarxiv icon

Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search

Add code
Sep 11, 2025
Figure 1 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Figure 2 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Figure 3 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Figure 4 for Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Viaarxiv icon

Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning

Add code
Jun 09, 2025
Figure 1 for Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning
Figure 2 for Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning
Figure 3 for Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning
Figure 4 for Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning
Viaarxiv icon