Picture for William Lugoloobi

William Lugoloobi

Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments

Add code
Jun 12, 2026
Viaarxiv icon

Known By Their Actions: Fingerprinting LLM Browser Agents via UI Traces

Add code
May 14, 2026
Viaarxiv icon

LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

Add code
Feb 10, 2026
Viaarxiv icon