Alert button

AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability

Add code
Bookmark button
Alert button
Feb 14, 2024
Siwei Yang, Bingchen Zhao, Cihang Xie

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: