Picture for Neel Jay

Neel Jay

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Evaluating Precise Geolocation Inference Capabilities of Vision Language Models

Add code
Feb 20, 2025
Figure 1 for Evaluating Precise Geolocation Inference Capabilities of Vision Language Models
Figure 2 for Evaluating Precise Geolocation Inference Capabilities of Vision Language Models
Figure 3 for Evaluating Precise Geolocation Inference Capabilities of Vision Language Models
Figure 4 for Evaluating Precise Geolocation Inference Capabilities of Vision Language Models
Viaarxiv icon