Picture for Łukasz Borchmann

Łukasz Borchmann

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Add code
Mar 12, 2026
Viaarxiv icon

Language Models Model Language

Add code
Oct 14, 2025
Viaarxiv icon

Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

Add code
Apr 15, 2025
Figure 1 for Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Figure 2 for Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Figure 3 for Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Figure 4 for Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
Viaarxiv icon

Query and Conquer: Execution-Guided SQL Generation

Add code
Mar 31, 2025
Viaarxiv icon

In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Add code
Dec 23, 2024
Viaarxiv icon

Tackling prediction tasks in relational databases with LLMs

Add code
Nov 18, 2024
Figure 1 for Tackling prediction tasks in relational databases with LLMs
Figure 2 for Tackling prediction tasks in relational databases with LLMs
Figure 3 for Tackling prediction tasks in relational databases with LLMs
Figure 4 for Tackling prediction tasks in relational databases with LLMs
Viaarxiv icon

Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists

Add code
Oct 30, 2024
Viaarxiv icon

Arctic-TILT. Business Document Understanding at Sub-Billion Scale

Add code
Aug 08, 2024
Viaarxiv icon

Notes on Applicability of GPT-4 to Document Understanding

Add code
May 28, 2024
Viaarxiv icon

Document Understanding Dataset and Evaluation (DUDE)

Add code
May 15, 2023
Figure 1 for Document Understanding Dataset and Evaluation (DUDE)
Figure 2 for Document Understanding Dataset and Evaluation (DUDE)
Figure 3 for Document Understanding Dataset and Evaluation (DUDE)
Figure 4 for Document Understanding Dataset and Evaluation (DUDE)
Viaarxiv icon