Picture for Tomohiro Sawada

Tomohiro Sawada

Cascaded Information Disclosure for Generalized Evaluation of Problem Solving Capabilities

Add code
Jul 31, 2025
Viaarxiv icon

Towards a Unified Multimodal Reasoning Framework

Add code
Dec 22, 2023
Viaarxiv icon

ARB: Advanced Reasoning Benchmark for Large Language Models

Add code
Jul 28, 2023
Figure 1 for ARB: Advanced Reasoning Benchmark for Large Language Models
Figure 2 for ARB: Advanced Reasoning Benchmark for Large Language Models
Figure 3 for ARB: Advanced Reasoning Benchmark for Large Language Models
Figure 4 for ARB: Advanced Reasoning Benchmark for Large Language Models
Viaarxiv icon