Picture for Philip Quirke

Philip Quirke

The Capability Frontier: Benchmarks Miss 82% of Model Performance

Add code
Jun 25, 2026
Viaarxiv icon

Ablation-Reversible Heads Don't Transfer: A Stress Test for Mechanistic Role Claims in Transformers

Add code
Jun 06, 2026
Viaarxiv icon

Riemannian-Manifold Steering: Geometry-Aware Generative Autoencoders for Label-Free Steering

Add code
May 24, 2026
Viaarxiv icon

Position: Require Frontier AI Labs To Release Small "Analog" Models

Add code
Oct 15, 2025
Viaarxiv icon

TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research

Add code
Mar 17, 2025
Viaarxiv icon

Increasing Trust in Language Models through the Reuse of Verified Circuits

Add code
Feb 06, 2024
Viaarxiv icon

Understanding Addition in Transformers

Add code
Oct 23, 2023
Figure 1 for Understanding Addition in Transformers
Figure 2 for Understanding Addition in Transformers
Figure 3 for Understanding Addition in Transformers
Figure 4 for Understanding Addition in Transformers
Viaarxiv icon

Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study

Add code
Jan 23, 2023
Figure 1 for Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study
Figure 2 for Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study
Figure 3 for Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study
Figure 4 for Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study
Viaarxiv icon