Picture for Andrea Roque

Andrea Roque

LLM-Based Persuasion Enables Guardrail Override in Frontier LLMs

Add code
May 13, 2026
Viaarxiv icon

Measuring Opinion Bias and Sycophancy via LLM-based Coercion

Add code
Apr 23, 2026
Viaarxiv icon