Picture for Tomas Pfister

Tomas Pfister

PI-Hunter: Automated Red-Teaming for Exposing and Localizing Prompt Injections

Add code
Jun 10, 2026
Viaarxiv icon

The ACUTE Protocol: Operationalizing Language Model Activations for Better Calibration, Utility, and Trust

Add code
Jun 05, 2026
Viaarxiv icon

LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks

Add code
Jun 03, 2026
Viaarxiv icon

Converted, Not Equivalent: Benchmarking Codebase Conversion via Observational Equivalence

Add code
May 27, 2026
Viaarxiv icon

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

Add code
May 25, 2026
Viaarxiv icon

Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems

Add code
May 22, 2026
Viaarxiv icon

Nexus : An Agentic Framework for Time Series Forecasting

Add code
May 14, 2026
Viaarxiv icon

LiSA: Lifelong Safety Adaptation via Conservative Policy Induction

Add code
May 14, 2026
Viaarxiv icon

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Add code
May 11, 2026
Viaarxiv icon

SkillOS: Learning Skill Curation for Self-Evolving Agents

Add code
May 07, 2026
Viaarxiv icon