Picture for Benjamin Van Durme

Benjamin Van Durme

Johns Hopkins University

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Add code
Feb 19, 2025
Viaarxiv icon

LM Agents for Coordinating Multi-User Information Gathering

Add code
Feb 17, 2025
Viaarxiv icon

mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval

Add code
Jan 31, 2025
Figure 1 for mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval
Figure 2 for mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval
Figure 3 for mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval
Figure 4 for mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval
Viaarxiv icon

LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts

Add code
Dec 31, 2024
Viaarxiv icon

From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering

Add code
Dec 24, 2024
Figure 1 for From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
Figure 2 for From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
Figure 3 for From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
Figure 4 for From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
Viaarxiv icon

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Add code
Dec 17, 2024
Figure 1 for Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Figure 2 for Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Figure 3 for Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Figure 4 for Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Viaarxiv icon

DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation

Add code
Dec 17, 2024
Figure 1 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 2 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 3 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 4 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Viaarxiv icon

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

Add code
Nov 08, 2024
Figure 1 for Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
Figure 2 for Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
Figure 3 for Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
Figure 4 for Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
Viaarxiv icon

Multi-Field Adaptive Retrieval

Add code
Oct 26, 2024
Figure 1 for Multi-Field Adaptive Retrieval
Figure 2 for Multi-Field Adaptive Retrieval
Figure 3 for Multi-Field Adaptive Retrieval
Figure 4 for Multi-Field Adaptive Retrieval
Viaarxiv icon

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval

Add code
Oct 15, 2024
Viaarxiv icon