Picture for Noel Thomas

Noel Thomas

FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks

Add code
May 27, 2026
Viaarxiv icon

ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale

Add code
May 23, 2026
Viaarxiv icon

Regime-Conditioned Evaluation in Multi-Context Bayesian Optimization

Add code
May 06, 2026
Viaarxiv icon

ChaosBench-Logic: A Benchmark for Logical and Symbolic Reasoning on Chaotic Dynamical Systems

Add code
Jan 05, 2026
Viaarxiv icon