Picture for Masanari Oi

Masanari Oi

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

DISCODE: Distribution-Aware Score Decoder for Robust Automatic Evaluation of Image Captioning

Add code
Dec 16, 2025
Viaarxiv icon