Picture for Dan Roth

Dan Roth

Shammie

Conceptual and Unbiased Reasoning in Language Models

Add code
Mar 30, 2024
Viaarxiv icon

Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering

Add code
Mar 21, 2024
Figure 1 for Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering
Figure 2 for Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering
Figure 3 for Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering
Figure 4 for Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering
Viaarxiv icon

From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification

Add code
Mar 10, 2024
Viaarxiv icon

Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering

Add code
Feb 29, 2024
Viaarxiv icon

DeAL: Decoding-time Alignment for Large Language Models

Add code
Feb 05, 2024
Figure 1 for DeAL: Decoding-time Alignment for Large Language Models
Figure 2 for DeAL: Decoding-time Alignment for Large Language Models
Figure 3 for DeAL: Decoding-time Alignment for Large Language Models
Figure 4 for DeAL: Decoding-time Alignment for Large Language Models
Viaarxiv icon

Code Representation Learning At Scale

Add code
Feb 02, 2024
Figure 1 for Code Representation Learning At Scale
Figure 2 for Code Representation Learning At Scale
Figure 3 for Code Representation Learning At Scale
Figure 4 for Code Representation Learning At Scale
Viaarxiv icon

Pachinko: Patching Interpretable QA Models through Natural Language Feedback

Add code
Nov 16, 2023
Viaarxiv icon

Deceiving Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?

Add code
Nov 16, 2023
Viaarxiv icon

Understanding Calibration for Multilingual Question Answering Models

Add code
Nov 15, 2023
Figure 1 for Understanding Calibration for Multilingual Question Answering Models
Figure 2 for Understanding Calibration for Multilingual Question Answering Models
Figure 3 for Understanding Calibration for Multilingual Question Answering Models
Figure 4 for Understanding Calibration for Multilingual Question Answering Models
Viaarxiv icon

Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets

Add code
Nov 15, 2023
Figure 1 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 2 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 3 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 4 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Viaarxiv icon