Picture for Alicia Parrish

Alicia Parrish

Shammie

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models

Add code
Mar 18, 2024
Figure 1 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Figure 2 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Figure 3 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Figure 4 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Add code
Nov 21, 2023
Figure 1 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Figure 2 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Figure 3 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Viaarxiv icon

A Framework to Assess agreement Among Diverse Rater Groups

Add code
Nov 09, 2023
Figure 1 for A Framework to Assess agreement Among Diverse Rater Groups
Figure 2 for A Framework to Assess agreement Among Diverse Rater Groups
Figure 3 for A Framework to Assess agreement Among Diverse Rater Groups
Figure 4 for A Framework to Assess agreement Among Diverse Rater Groups
Viaarxiv icon

"Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models

Add code
Jun 27, 2023
Figure 1 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Figure 2 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Figure 3 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Figure 4 for "Is a picture of a bird a bird": Policy recommendations for dealing with ambiguity in machine vision models
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Figure 1 for Inverse Scaling: When Bigger Isn't Better
Figure 2 for Inverse Scaling: When Bigger Isn't Better
Figure 3 for Inverse Scaling: When Bigger Isn't Better
Figure 4 for Inverse Scaling: When Bigger Isn't Better
Viaarxiv icon

Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs

Add code
May 23, 2023
Figure 1 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Figure 2 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Figure 3 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Figure 4 for Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Viaarxiv icon

Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models

Add code
May 22, 2023
Figure 1 for Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Figure 2 for Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Figure 3 for Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon