Picture for Zhihan Zhang

Zhihan Zhang

RADAR: Benchmarking Language Models on Imperfect Tabular Data

Add code
Jun 09, 2025
Viaarxiv icon

Aligning Large Language Models with Implicit Preferences from User-Generated Content

Add code
Jun 04, 2025
Viaarxiv icon

GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments

Add code
Apr 01, 2025
Viaarxiv icon

Rethinking Graph Structure Learning in the Era of LLMs

Add code
Mar 27, 2025
Viaarxiv icon

Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model

Add code
Feb 19, 2025
Viaarxiv icon

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Add code
Feb 12, 2025
Viaarxiv icon

Adaptivity can help exponentially for shadow tomography

Add code
Dec 26, 2024
Viaarxiv icon

MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems

Add code
Oct 18, 2024
Figure 1 for MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems
Figure 2 for MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems
Figure 3 for MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems
Figure 4 for MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems
Viaarxiv icon

On the sample complexity of purity and inner product estimation

Add code
Oct 16, 2024
Viaarxiv icon

Enhancing Mathematical Reasoning in LLMs by Stepwise Correction

Add code
Oct 16, 2024
Figure 1 for Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Figure 2 for Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Figure 3 for Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Figure 4 for Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Viaarxiv icon