Alert button
Picture for Peter Clark

Peter Clark

Alert button

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

Add code
Bookmark button
Alert button
Nov 08, 2023
Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot

Figure 1 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Figure 2 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Figure 3 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Figure 4 for Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Viaarxiv icon

ADaPT: As-Needed Decomposition and Planning with Language Models

Add code
Bookmark button
Alert button
Nov 08, 2023
Archiki Prasad, Alexander Koller, Mareike Hartmann, Peter Clark, Ashish Sabharwal, Mohit Bansal, Tushar Khot

Viaarxiv icon

QualEval: Qualitative Evaluation for Model Improvement

Add code
Bookmark button
Alert button
Nov 06, 2023
Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan

Viaarxiv icon

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization

Add code
Bookmark button
Alert button
Oct 16, 2023
Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Peter Jansen, Oyvind Tafjord, Niket Tandon, Li Zhang, Chris Callison-Burch, Peter Clark

Figure 1 for CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Figure 2 for CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Figure 3 for CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Figure 4 for CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Viaarxiv icon

Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy

Add code
Bookmark button
Alert button
May 24, 2023
Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal

Figure 1 for Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy
Figure 2 for Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy
Figure 3 for Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy
Figure 4 for Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy
Viaarxiv icon

Language Models with Rationality

Add code
Bookmark button
Alert button
May 23, 2023
Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schutze, Peter Clark

Figure 1 for Language Models with Rationality
Figure 2 for Language Models with Rationality
Figure 3 for Language Models with Rationality
Figure 4 for Language Models with Rationality
Viaarxiv icon

IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions

Add code
Bookmark button
Alert button
May 23, 2023
Wenhao Yu, Meng Jiang, Peter Clark, Ashish Sabharwal

Figure 1 for IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
Figure 2 for IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
Figure 3 for IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
Figure 4 for IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
Viaarxiv icon

Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation

Add code
Bookmark button
Alert button
May 22, 2023
Zhenwen Liang, Wenhao Yu, Tanmay Rajpurohit, Peter Clark, Xiangliang Zhang, Ashwin Kaylan

Figure 1 for Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
Figure 2 for Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
Figure 3 for Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
Figure 4 for Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
Viaarxiv icon

RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs

Add code
Bookmark button
Alert button
May 15, 2023
Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon

Figure 1 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 2 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 3 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 4 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Viaarxiv icon

Self-Refine: Iterative Refinement with Self-Feedback

Add code
Bookmark button
Alert button
Mar 30, 2023
Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark

Figure 1 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 2 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 3 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 4 for Self-Refine: Iterative Refinement with Self-Feedback
Viaarxiv icon