Picture for Marc Wetter

Marc Wetter

Implicit Intelligence -- Evaluating Agents on What Users Don't Say

Add code
Feb 23, 2026
Viaarxiv icon

Intent Laundering: AI Safety Datasets Are Not What They Seem

Add code
Feb 17, 2026
Viaarxiv icon

R-ConstraintBench: Evaluating LLMs on NP-Complete Scheduling

Add code
Aug 21, 2025
Viaarxiv icon