Picture for Eugenia Kim

Eugenia Kim

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

Add code
May 07, 2026
Viaarxiv icon

Expert Evaluation and the Limits of Human Feedback in Mental Health AI Safety Testing

Add code
Jan 26, 2026
Viaarxiv icon

Seeking Late Night Life Lines: Experiences of Conversational AI Use in Mental Health Crisis

Add code
Dec 29, 2025
Viaarxiv icon

Lessons From Red Teaming 100 Generative AI Products

Add code
Jan 13, 2025
Viaarxiv icon