Alert button
Picture for Dan Roth

Dan Roth

Alert button

Deceiving Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?

Add code
Bookmark button
Alert button
Nov 16, 2023
Bangzheng Li, Ben Zhou, Fei Wang, Xingyu Fu, Dan Roth, Muhao Chen

Viaarxiv icon

Pachinko: Patching Interpretable QA Models through Natural Language Feedback

Add code
Bookmark button
Alert button
Nov 16, 2023
Chaitanya Malaviya, Subin Lee, Dan Roth, Mark Yatskar

Viaarxiv icon

Understanding Calibration for Multilingual Question Answering Models

Add code
Bookmark button
Alert button
Nov 15, 2023
Yahan Yang, Soham Dan, Dan Roth, Insup Lee

Viaarxiv icon

Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets

Add code
Bookmark button
Alert button
Nov 15, 2023
Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth

Figure 1 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 2 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 3 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 4 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Viaarxiv icon

Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations

Add code
Bookmark button
Alert button
Nov 07, 2023
Sihao Chen, Hongming Zhang, Tong Chen, Ben Zhou, Wenhao Yu, Dian Yu, Baolin Peng, Hongwei Wang, Dan Roth, Dong Yu

Figure 1 for Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
Figure 2 for Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
Figure 3 for Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
Figure 4 for Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
Viaarxiv icon

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Add code
Bookmark button
Alert button
Oct 19, 2023
Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao

Figure 1 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 2 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 3 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 4 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Viaarxiv icon

CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion

Add code
Bookmark button
Alert button
Oct 17, 2023
Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth, Bing Xiang

Viaarxiv icon

SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation

Add code
Bookmark button
Alert button
Sep 29, 2023
Hangfeng He, Hongming Zhang, Dan Roth

Figure 1 for SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
Figure 2 for SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
Figure 3 for SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
Figure 4 for SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
Viaarxiv icon

ExpertQA: Expert-Curated Questions and Attributed Answers

Add code
Bookmark button
Alert button
Sep 14, 2023
Chaitanya Malaviya, Subin Lee, Sihao Chen, Elizabeth Sieber, Mark Yatskar, Dan Roth

Figure 1 for ExpertQA: Expert-Curated Questions and Attributed Answers
Figure 2 for ExpertQA: Expert-Curated Questions and Attributed Answers
Figure 3 for ExpertQA: Expert-Curated Questions and Attributed Answers
Figure 4 for ExpertQA: Expert-Curated Questions and Attributed Answers
Viaarxiv icon

Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning

Add code
Bookmark button
Alert button
Aug 10, 2023
Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Wang, Kathleen McKeown, Vittorio Castelli, Dan Roth, Bing Xiang

Figure 1 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 2 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 3 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 4 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Viaarxiv icon