Picture for Wendong Xu

Wendong Xu

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Add code
Sep 09, 2025
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Viaarxiv icon

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Add code
May 21, 2025
Viaarxiv icon