Picture for Benjamin Van Durme

Benjamin Van Durme

Johns Hopkins University

All Claims Are Equal, but Some Claims Are More Equal Than Others: Importance-Sensitive Factuality Evaluation of LLM Generations

Add code
Oct 08, 2025
Viaarxiv icon

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Add code
Sep 08, 2025
Viaarxiv icon

Enabling Equitable Access to Trustworthy Financial Reasoning

Add code
Aug 28, 2025
Viaarxiv icon

MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources

Add code
Aug 05, 2025
Viaarxiv icon

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Add code
Jul 15, 2025
Viaarxiv icon

How Grounded is Wikipedia? A Study on Structured Evidential Support

Add code
Jun 14, 2025
Viaarxiv icon

Jailbreak Distillation: Renewable Safety Benchmarking

Add code
May 28, 2025
Viaarxiv icon

Rank-K: Test-Time Reasoning for Listwise Reranking

Add code
May 20, 2025
Figure 1 for Rank-K: Test-Time Reasoning for Listwise Reranking
Figure 2 for Rank-K: Test-Time Reasoning for Listwise Reranking
Figure 3 for Rank-K: Test-Time Reasoning for Listwise Reranking
Figure 4 for Rank-K: Test-Time Reasoning for Listwise Reranking
Viaarxiv icon

Always Tell Me The Odds: Fine-grained Conditional Probability Estimation

Add code
May 02, 2025
Figure 1 for Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Figure 2 for Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Figure 3 for Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Figure 4 for Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Viaarxiv icon

MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools

Add code
Apr 28, 2025
Viaarxiv icon