Picture for Wendong Xu

Wendong Xu

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Add code
Sep 09, 2025
Figure 1 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Figure 2 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Figure 3 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Figure 4 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Figure 1 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 2 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 3 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 4 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Viaarxiv icon

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Add code
May 21, 2025
Figure 1 for PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Figure 2 for PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Figure 3 for PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Figure 4 for PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
Viaarxiv icon