Picture for Alicia Parrish

Alicia Parrish

Shammie

A Framework to Assess agreement Among Diverse Rater Groups

Add code
Nov 09, 2023
Viaarxiv icon

"Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models

Add code
Jun 27, 2023
Figure 1 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Figure 2 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Figure 3 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Figure 4 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Figure 1 for Inverse Scaling: When Bigger Isn't Better
Figure 2 for Inverse Scaling: When Bigger Isn't Better
Figure 3 for Inverse Scaling: When Bigger Isn't Better
Figure 4 for Inverse Scaling: When Bigger Isn't Better
Viaarxiv icon

Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs

Add code
May 23, 2023
Figure 1 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Figure 2 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Figure 3 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Figure 4 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Viaarxiv icon

Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models

Add code
May 22, 2023
Figure 1 for Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Figure 2 for Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Figure 3 for Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions

Add code
Oct 19, 2022
Figure 1 for Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions
Figure 2 for Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions
Figure 3 for Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions
Figure 4 for Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions
Viaarxiv icon

What Do NLP Researchers Believe? Results of the NLP Community Metasurvey

Add code
Aug 26, 2022
Figure 1 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Figure 2 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Figure 3 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Figure 4 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions

Add code
Apr 13, 2022
Figure 1 for Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions
Figure 2 for Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions
Figure 3 for Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions
Figure 4 for Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions
Viaarxiv icon