Picture for Clemens Grange

Clemens Grange

SAVeS: Steering Safety Judgments in Vision-Language Models via Semantic Cues

Add code
Mar 19, 2026
Viaarxiv icon