Picture for Muhao Chen

Muhao Chen

University of California Davis

Unbiased Visual Reasoning with Controlled Visual Inputs

Add code
Dec 19, 2025
Viaarxiv icon

FRIEDA: Benchmarking Multi-Step Cartographic Reasoning in Vision-Language Models

Add code
Dec 08, 2025
Viaarxiv icon

Optimizing Diversity and Quality through Base-Aligned Model Collaboration

Add code
Nov 07, 2025
Figure 1 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Figure 2 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Figure 3 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Figure 4 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Viaarxiv icon

ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation

Add code
Oct 09, 2025
Figure 1 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 2 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 3 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 4 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Viaarxiv icon

False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize

Add code
Sep 04, 2025
Figure 1 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 2 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 3 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 4 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Viaarxiv icon

Code Execution as Grounded Supervision for LLM Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA

Add code
Jun 09, 2025
Figure 1 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 2 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 3 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 4 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Viaarxiv icon

DiscoSum: Discourse-aware News Summarization

Add code
Jun 07, 2025
Viaarxiv icon

Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

Add code
May 29, 2025
Viaarxiv icon

QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

Add code
May 29, 2025
Viaarxiv icon