Alert button
Picture for Haoyang Ling

Haoyang Ling

Alert button

NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models

Add code
Bookmark button
Alert button
Mar 05, 2024
Lizhou Fan, Wenyue Hua, Xiang Li, Kaijie Zhu, Mingyu Jin, Lingyao Li, Haoyang Ling, Jinkui Chi, Jindong Wang, Xin Ma, Yongfeng Zhang

Figure 1 for NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models
Figure 2 for NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models
Figure 3 for NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models
Figure 4 for NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models
Viaarxiv icon

NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes

Add code
Bookmark button
Alert button
Jan 12, 2024
Lizhou Fan, Wenyue Hua, Lingyao Li, Haoyang Ling, Yongfeng Zhang

Viaarxiv icon