Picture for Yash Mishra

Yash Mishra

Are Language Models Sensitive to Morally Irrelevant Distractors?

Add code
Feb 10, 2026
Viaarxiv icon