Picture for Luke Handley

Luke Handley

AstroAlertBench: Evaluating the Accuracy, Reasoning, and Honesty of Multimodal LLMs in Astronomical Classification

Add code
May 07, 2026
Viaarxiv icon