Picture for Mutsumi Nakamura

Mutsumi Nakamura

VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks

Add code
Oct 17, 2024
Viaarxiv icon

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?

Add code
Jul 20, 2024
Viaarxiv icon

Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models

Add code
Jun 24, 2024
Viaarxiv icon

Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models

Add code
Apr 23, 2024
Viaarxiv icon