Picture for Michael Hardy

Michael Hardy

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

Add code
Mar 05, 2026
Viaarxiv icon

Knowledge without Wisdom: Measuring Misalignment between LLMs and Intended Impact

Add code
Mar 01, 2026
Viaarxiv icon

Measuring Teaching with LLMs

Add code
Oct 27, 2025
Viaarxiv icon

"All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations

Add code
Nov 23, 2024
Figure 1 for "All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations
Figure 2 for "All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations
Figure 3 for "All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations
Figure 4 for "All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations
Viaarxiv icon