Picture for Preslav Nakov

Preslav Nakov

MBZUAI

Schützen: Evaluating LLM Safety in Bulgarian and German Contexts

Add code
Jun 09, 2026
Viaarxiv icon

TABVERSE: Benchmarking Cross-Format Table Understanding in LLMs and VLMs

Add code
Jun 08, 2026
Viaarxiv icon

SurgiQ: A Large-Scale Multi-Domain Benchmark for Evaluating Surgical Understanding in Large Language Models

Add code
Jun 06, 2026
Viaarxiv icon

UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding

Add code
Jun 05, 2026
Viaarxiv icon

ThinkBooster: A Unified Framework for Seamless Test-Time Scaling of LLM Reasoning

Add code
Jun 05, 2026
Viaarxiv icon

Better with Experience: Self-Evolving LLM Agents for Evidence-Grounded Health Community Notes

Add code
Jun 01, 2026
Viaarxiv icon

Uncovering Temporal Framing in the News

Add code
May 29, 2026
Viaarxiv icon

The Geometry of Forgetting: Temporal Knowledge Drift as an Independent Axis in LLM Representations

Add code
May 09, 2026
Viaarxiv icon

FMI_SU_Yotkova_Kastreva at SemEval-2026 Task 13: Lightweight Detection of LLM-Generated Code via Stylometric Signals

Add code
May 05, 2026
Viaarxiv icon

The Cylindrical Representation Hypothesis for Language Model Steering

Add code
May 03, 2026
Viaarxiv icon