Picture for James Burgess

James Burgess

SalArt-VQA: Diagnosing Whether VLMs Understand Salient Artifacts in Generated Images

Add code
Jun 10, 2026
Viaarxiv icon

MMBU: A Massive Multi-modal Biomedical Understanding Benchmark to Probe the Perception Capabilities of Vision-Language Models

Add code
Jun 04, 2026
Viaarxiv icon

ArtifactLens: Hundreds of Labels Are Enough for Artifact Detection with VLMs

Add code
Feb 10, 2026
Viaarxiv icon

Visual Personalization Turing Test

Add code
Jan 30, 2026
Viaarxiv icon

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

Add code
Jan 26, 2026
Viaarxiv icon

Squeezed Diffusion Models

Add code
Aug 20, 2025
Viaarxiv icon

Can Large Language Models Match the Conclusions of Systematic Reviews?

Add code
May 28, 2025
Viaarxiv icon

MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

Add code
Mar 17, 2025
Viaarxiv icon

Video Action Differencing

Add code
Mar 10, 2025
Viaarxiv icon

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Add code
Jan 14, 2025
Viaarxiv icon