Picture for Shir Rozenfeld

Shir Rozenfeld

GAVEL: Towards rule-based safety through activation monitoring

Add code
Jan 29, 2026
Viaarxiv icon

Love, Lies, and Language Models: Investigating AI's Role in Romance-Baiting Scams

Add code
Dec 22, 2025
Viaarxiv icon