Alert button
Picture for Benjamin Van Durme

Benjamin Van Durme

Alert button

Dated Data: Tracing Knowledge Cutoffs in Large Language Models

Mar 19, 2024
Jeffrey Cheng, Marc Marone, Orion Weller, Dawn Lawrie, Daniel Khashabi, Benjamin Van Durme

Viaarxiv icon

Tur[k]ingBench: A Challenge Benchmark for Web Agents

Mar 18, 2024
Kevin Xu, Yeganeh Kordi, Kate Sanders, Yizhong Wang, Adam Byerly, Jack Zhang, Benjamin Van Durme, Daniel Khashabi

Viaarxiv icon

A Closer Look at Claim Decomposition

Mar 18, 2024
Miriam Wanner, Seth Ebner, Zhengping Jiang, Mark Dredze, Benjamin Van Durme

Viaarxiv icon

TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning

Mar 11, 2024
Kate Sanders, Nathaniel Weir, Benjamin Van Durme

Viaarxiv icon

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Mar 07, 2024
Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su

Viaarxiv icon

RORA: Robust Free-Text Rationale Evaluation

Mar 01, 2024
Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu

Viaarxiv icon

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

Feb 27, 2024
Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme

Viaarxiv icon