Picture for Jaime Raldua Veuthey

Jaime Raldua Veuthey

MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks

Add code
Apr 18, 2025
Viaarxiv icon

Rethinking CyberSecEval: An LLM-Aided Approach to Evaluation Critique

Add code
Nov 13, 2024
Figure 1 for Rethinking CyberSecEval: An LLM-Aided Approach to Evaluation Critique
Viaarxiv icon