Picture for Chenrui Shi

Chenrui Shi

GUI Knowledge Bench: Revealing the Knowledge Gap Behind VLM Failures in GUI Tasks

Add code
Oct 30, 2025
Viaarxiv icon

Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning

Add code
May 06, 2025
Viaarxiv icon

Iterative Trajectory Exploration for Multimodal Agents

Add code
Apr 30, 2025
Viaarxiv icon

MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge

Add code
Feb 27, 2025
Figure 1 for MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge
Figure 2 for MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge
Figure 3 for MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge
Figure 4 for MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge
Viaarxiv icon