Alert button

NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes

Add code
Bookmark button
Alert button
Jan 12, 2024
Lizhou Fan, Wenyue Hua, Lingyao Li, Haoyang Ling, Yongfeng Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: