Picture for Ryan Hsieh

Ryan Hsieh

SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?

Add code
Mar 16, 2025
Figure 1 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Figure 2 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Figure 3 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Figure 4 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Viaarxiv icon

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Add code
Jul 06, 2024
Viaarxiv icon