Picture for Ariel Weizman

Ariel Weizman

The Self-Execution Benchmark: Measuring LLMs' Attempts to Overcome Their Lack of Self-Execution

Add code
Aug 17, 2025
Viaarxiv icon

Fool Me, Fool Me: User Attitudes Toward LLM Falsehoods

Add code
Dec 16, 2024
Viaarxiv icon