Picture for Melanie Mitchell

Melanie Mitchell

Using Counterfactual Tasks to Evaluate the Generality of Analogical Reasoning in Large Language Models

Add code
Feb 14, 2024
Viaarxiv icon

Perspectives on the State and Future of Deep Learning - 2023

Add code
Dec 19, 2023
Viaarxiv icon

Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks

Add code
Nov 26, 2023
Figure 1 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 2 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 3 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Figure 4 for Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
Viaarxiv icon

The ConceptARC Benchmark: Evaluating Understanding and Generalization in the ARC Domain

Add code
May 11, 2023
Figure 1 for The ConceptARC Benchmark: Evaluating Understanding and Generalization in the ARC Domain
Figure 2 for The ConceptARC Benchmark: Evaluating Understanding and Generalization in the ARC Domain
Figure 3 for The ConceptARC Benchmark: Evaluating Understanding and Generalization in the ARC Domain
Figure 4 for The ConceptARC Benchmark: Evaluating Understanding and Generalization in the ARC Domain
Viaarxiv icon

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

Add code
Oct 27, 2022
Viaarxiv icon

Embodied, Situated, and Grounded Intelligence: Implications for AI

Oct 24, 2022
Viaarxiv icon

Evaluating Understanding on Conceptual Abstraction Benchmarks

Jun 28, 2022
Figure 1 for Evaluating Understanding on Conceptual Abstraction Benchmarks
Figure 2 for Evaluating Understanding on Conceptual Abstraction Benchmarks
Figure 3 for Evaluating Understanding on Conceptual Abstraction Benchmarks
Figure 4 for Evaluating Understanding on Conceptual Abstraction Benchmarks
Viaarxiv icon

Abstraction for Deep Reinforcement Learning

Feb 18, 2022
Figure 1 for Abstraction for Deep Reinforcement Learning
Viaarxiv icon

Frontiers in Collective Intelligence: A Workshop Report

Dec 13, 2021
Viaarxiv icon

Frontiers in Evolutionary Computation: A Workshop Report

Oct 20, 2021
Viaarxiv icon