Alert button
Picture for Daniel Khashabi

Daniel Khashabi

Alert button

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Add code
Bookmark button
Alert button
Apr 05, 2024
Jingyu Zhang, Marc Marone, Tianjian Li, Benjamin Van Durme, Daniel Khashabi

Viaarxiv icon

SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses

Add code
Bookmark button
Alert button
Apr 04, 2024
Dongwei Jiang, Jingyu Zhang, Orion Weller, Nathaniel Weir, Benjamin Van Durme, Daniel Khashabi

Viaarxiv icon

Tur[k]ingBench: A Challenge Benchmark for Web Agents

Add code
Bookmark button
Alert button
Mar 21, 2024
Kevin Xu, Yeganeh Kordi, Kate Sanders, Yizhong Wang, Adam Byerly, Jack Zhang, Benjamin Van Durme, Daniel Khashabi

Figure 1 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 2 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 3 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 4 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Viaarxiv icon

Dated Data: Tracing Knowledge Cutoffs in Large Language Models

Add code
Bookmark button
Alert button
Mar 19, 2024
Jeffrey Cheng, Marc Marone, Orion Weller, Dawn Lawrie, Daniel Khashabi, Benjamin Van Durme

Figure 1 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 2 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 3 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Figure 4 for Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Viaarxiv icon

RORA: Robust Free-Text Rationale Evaluation

Add code
Bookmark button
Alert button
Mar 01, 2024
Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu

Viaarxiv icon

AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

Add code
Bookmark button
Alert button
Feb 19, 2024
Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Tiyyala, Nicholas Andrews, Daniel Khashabi

Viaarxiv icon

k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text

Add code
Bookmark button
Alert button
Feb 17, 2024
Abe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He

Viaarxiv icon

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

Add code
Bookmark button
Alert button
Jan 23, 2024
Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi

Viaarxiv icon