Picture for J. D. Zamfirescu-Pereira

J. D. Zamfirescu-Pereira

A Knowledge-Component-Based Methodology for Evaluating AI Assistants

Add code
Jun 09, 2024
Viaarxiv icon

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Add code
Apr 18, 2024
Viaarxiv icon

Trucks Don't Mean Trump: Diagnosing Human Error in Image Analysis

Add code
May 15, 2022
Figure 1 for Trucks Don't Mean Trump: Diagnosing Human Error in Image Analysis
Figure 2 for Trucks Don't Mean Trump: Diagnosing Human Error in Image Analysis
Figure 3 for Trucks Don't Mean Trump: Diagnosing Human Error in Image Analysis
Figure 4 for Trucks Don't Mean Trump: Diagnosing Human Error in Image Analysis
Viaarxiv icon