Picture for Zirui Gao

Zirui Gao

Paul Scherrer Institute, Brookhaven National Laboratory

Do MLLMs Understand Pointing? Benchmarking and Enhancing Referential Reasoning in Egocentric Vision

Add code
Apr 23, 2026
Viaarxiv icon

Towards Scalable Web Accessibility Audit with MLLMs as Copilots

Add code
Nov 05, 2025
Figure 1 for Towards Scalable Web Accessibility Audit with MLLMs as Copilots
Figure 2 for Towards Scalable Web Accessibility Audit with MLLMs as Copilots
Figure 3 for Towards Scalable Web Accessibility Audit with MLLMs as Copilots
Figure 4 for Towards Scalable Web Accessibility Audit with MLLMs as Copilots
Viaarxiv icon

Learning neural representations for X-ray ptychography reconstruction with unknown probes

Add code
Sep 04, 2025
Figure 1 for Learning neural representations for X-ray ptychography reconstruction with unknown probes
Figure 2 for Learning neural representations for X-ray ptychography reconstruction with unknown probes
Figure 3 for Learning neural representations for X-ray ptychography reconstruction with unknown probes
Figure 4 for Learning neural representations for X-ray ptychography reconstruction with unknown probes
Viaarxiv icon

Single-shot X-ray ptychography as a structured illumination method

Add code
Oct 24, 2024
Figure 1 for Single-shot X-ray ptychography as a structured illumination method
Figure 2 for Single-shot X-ray ptychography as a structured illumination method
Figure 3 for Single-shot X-ray ptychography as a structured illumination method
Viaarxiv icon