Picture for Usman Naseem

Usman Naseem

Are Aligned Large Language Models Still Misaligned?

Add code
Feb 11, 2026
Viaarxiv icon

From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection

Add code
Feb 11, 2026
Viaarxiv icon

Can Large Language Models Make Everyone Happy?

Add code
Feb 11, 2026
Viaarxiv icon

Do Large Language Models Reflect Demographic Pluralism in Safety?

Add code
Feb 07, 2026
Viaarxiv icon

From Native Memes to Global Moderation: Cros-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection

Add code
Feb 07, 2026
Viaarxiv icon

When the Model Said 'No Comment', We Knew Helpfulness Was Dead, Honesty Was Alive, and Safety Was Terrified

Add code
Feb 07, 2026
Viaarxiv icon

PersoPilot: An Adaptive AI-Copilot for Transparent Contextualized Persona Classification and Personalized Response Generation

Add code
Feb 04, 2026
Viaarxiv icon

PersoDPO: Scalable Preference Optimization for Instruction-Adherent, Persona-Grounded Dialogue via Multi-LLM Evaluation

Add code
Feb 04, 2026
Viaarxiv icon

They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References

Add code
Feb 03, 2026
Viaarxiv icon

Robust Harmful Meme Detection under Missing Modalities via Shared Representation Learning

Add code
Feb 01, 2026
Viaarxiv icon