Picture for Kavish

Kavish

DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models

Add code
Feb 10, 2025
Figure 1 for DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models
Figure 2 for DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models
Figure 3 for DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models
Figure 4 for DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models
Viaarxiv icon