Alert button
Picture for Benjamin Van Durme

Benjamin Van Durme

Alert button

AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

Add code
Bookmark button
Alert button
Apr 12, 2024
William Fleshman, Aleem Khan, Marc Marone, Benjamin Van Durme

Viaarxiv icon

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Add code
Bookmark button
Alert button
Apr 05, 2024
Jingyu Zhang, Marc Marone, Tianjian Li, Benjamin Van Durme, Daniel Khashabi

Viaarxiv icon

SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses

Add code
Bookmark button
Alert button
Apr 04, 2024
Dongwei Jiang, Jingyu Zhang, Orion Weller, Nathaniel Weir, Benjamin Van Durme, Daniel Khashabi

Viaarxiv icon

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Add code
Bookmark button
Alert button
Mar 22, 2024
Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn Lawrie, Luca Soldaini

Viaarxiv icon

Tur[k]ingBench: A Challenge Benchmark for Web Agents

Add code
Bookmark button
Alert button
Mar 21, 2024
Kevin Xu, Yeganeh Kordi, Kate Sanders, Yizhong Wang, Adam Byerly, Jack Zhang, Benjamin Van Durme, Daniel Khashabi

Figure 1 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 2 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 3 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 4 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Viaarxiv icon

Dated Data: Tracing Knowledge Cutoffs in Large Language Models

Add code
Bookmark button
Alert button
Mar 19, 2024
Jeffrey Cheng, Marc Marone, Orion Weller, Dawn Lawrie, Daniel Khashabi, Benjamin Van Durme

Figure 1 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 2 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 3 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 4 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Viaarxiv icon

A Closer Look at Claim Decomposition

Add code
Bookmark button
Alert button
Mar 18, 2024
Miriam Wanner, Seth Ebner, Zhengping Jiang, Mark Dredze, Benjamin Van Durme

Figure 1 for A Closer Look at Claim Decomposition
Figure 2 for A Closer Look at Claim Decomposition
Figure 3 for A Closer Look at Claim Decomposition
Figure 4 for A Closer Look at Claim Decomposition
Viaarxiv icon

TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning

Add code
Bookmark button
Alert button
Mar 11, 2024
Kate Sanders, Nathaniel Weir, Benjamin Van Durme

Viaarxiv icon

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Add code
Bookmark button
Alert button
Mar 07, 2024
Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su

Figure 1 for LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Figure 2 for LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Figure 3 for LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Figure 4 for LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Viaarxiv icon