Picture for Weihan Peng

Weihan Peng

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?

Add code
May 28, 2026
Viaarxiv icon

SWE-QA: Can Language Models Answer Repository-level Code Questions?

Add code
Sep 18, 2025
Viaarxiv icon