Picture for Fatih Uenal

Fatih Uenal

Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts

Add code
Apr 07, 2026
Viaarxiv icon

Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks

Add code
Mar 24, 2026
Viaarxiv icon