Picture for Leon Bergen

Leon Bergen

The Surprising Soupability of Documents in State Space Models

Add code
May 29, 2025
Viaarxiv icon

Quiet Feature Learning in Algorithmic Tasks

Add code
May 06, 2025
Viaarxiv icon

EvidenceBench: A Benchmark for Extracting Evidence from Biomedical Papers

Add code
Apr 25, 2025
Viaarxiv icon

Single-Pass Document Scanning for Question Answering

Add code
Apr 04, 2025
Viaarxiv icon

Measuring Risk of Bias in Biomedical Reports: The RoBBR Benchmark

Add code
Nov 28, 2024
Figure 1 for Measuring Risk of Bias in Biomedical Reports: The RoBBR Benchmark
Figure 2 for Measuring Risk of Bias in Biomedical Reports: The RoBBR Benchmark
Figure 3 for Measuring Risk of Bias in Biomedical Reports: The RoBBR Benchmark
Figure 4 for Measuring Risk of Bias in Biomedical Reports: The RoBBR Benchmark
Viaarxiv icon

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation

Add code
Nov 01, 2024
Figure 1 for Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Figure 2 for Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Figure 3 for Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Figure 4 for Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Viaarxiv icon

ClimaQA: An Automated Evaluation Framework for Climate Foundation Models

Add code
Oct 22, 2024
Viaarxiv icon

Dissociation of Faithful and Unfaithful Reasoning in LLMs

Add code
May 23, 2024
Figure 1 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 2 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 3 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 4 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Viaarxiv icon

IR2: Information Regularization for Information Retrieval

Add code
Feb 25, 2024
Viaarxiv icon

BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives

Add code
Feb 21, 2024
Viaarxiv icon