Picture for Casey Ford

Casey Ford

Alignment Drift in Multimodal LLMs: A Two-Phase, Longitudinal Evaluation of Harm Across Eight Model Releases

Add code
Feb 04, 2026
Viaarxiv icon

"Be My Cheese?": Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs

Add code
Feb 04, 2026
Viaarxiv icon

Red Teaming Multimodal Language Models: Evaluating Harm Across Prompt Modalities and Models

Add code
Sep 18, 2025
Viaarxiv icon