Picture for Mehrab Tanjim

Mehrab Tanjim

Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles

Add code
Sep 10, 2025
Viaarxiv icon

Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

GUI Agents: A Survey

Add code
Dec 18, 2024
Figure 1 for GUI Agents: A Survey
Figure 2 for GUI Agents: A Survey
Figure 3 for GUI Agents: A Survey
Figure 4 for GUI Agents: A Survey
Viaarxiv icon