Picture for Takeshi Tohyama

Takeshi Tohyama

Uncovering Overconfident Failures in CXR Models via Augmentation-Sensitivity Risk Scoring

Add code
Oct 02, 2025
Viaarxiv icon

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation

Add code
Oct 16, 2024
Viaarxiv icon