Picture for Naseem Machlovi

Naseem Machlovi

GuardEval: A Multi-Perspective Benchmark for Evaluating Safety, Fairness, and Robustness in LLM Moderators

Add code
Dec 22, 2025
Viaarxiv icon

Towards Safer AI Moderation: Evaluating LLM Moderators Through a Unified Benchmark Dataset and Advocating a Human-First Approach

Add code
Aug 09, 2025
Viaarxiv icon