Picture for Sophia Xiao Pu

Sophia Xiao Pu

WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction

Add code
May 28, 2026
Viaarxiv icon

Survive or Collapse: The Asymmetric Roles of Data Gating and Reward Grounding in Self-Play RL

Add code
May 21, 2026
Viaarxiv icon

Dynamic Evaluation for Oversensitivity in LLMs

Add code
Oct 21, 2025
Viaarxiv icon