Picture for Junjie Mu

Junjie Mu

Probing Social Identity Bias in Chinese LLMs with Gendered Pronouns and Social Groups

Add code
Oct 08, 2025
Viaarxiv icon

AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions

Add code
Jun 17, 2025
Viaarxiv icon

Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

Add code
Jun 14, 2025
Viaarxiv icon