Picture for Brecht Verbeken

Brecht Verbeken

Michael Pokorny

Human-in-the-Loop LLM Grading for Handwritten Mathematics Assessments

Add code
Mar 13, 2026
Viaarxiv icon

Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance

Add code
Mar 13, 2026
Viaarxiv icon

Early Evidence of Vibe-Proving with Consumer LLMs: A Case Study on Spectral Region Characterization with ChatGPT-5.2 (Thinking)

Add code
Feb 21, 2026
Viaarxiv icon

Probing the Trajectories of Reasoning Traces in Large Language Models

Add code
Jan 30, 2026
Viaarxiv icon

Benchmarks Saturate When The Model Gets Smarter Than The Judge

Add code
Jan 27, 2026
Viaarxiv icon

Estimating problem difficulty without ground truth using Large Language Model comparisons

Add code
Dec 16, 2025
Viaarxiv icon

How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?

Add code
Apr 03, 2025
Figure 1 for How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?
Figure 2 for How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?
Figure 3 for How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?
Figure 4 for How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon