Picture for Benjamin H. Sims

Benjamin H. Sims

NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

Add code
Jun 25, 2026
Viaarxiv icon