Picture for Lorenzo Pacchiardi

Lorenzo Pacchiardi

Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents

Add code
Jun 10, 2025
Viaarxiv icon

General Scales Unlock AI Evaluation with Explanatory and Predictive Power

Add code
Mar 09, 2025
Viaarxiv icon

Paradigms of AI Evaluation: Mapping Goals, Methodologies and Culture

Add code
Feb 21, 2025
Viaarxiv icon

PredictaBoard: Benchmarking LLM Score Predictability

Add code
Feb 20, 2025
Viaarxiv icon

Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers

Add code
Oct 15, 2024
Viaarxiv icon

100 instances is all you need: predicting the success of a new LLM on unseen data by testing on a few instances

Add code
Sep 05, 2024
Viaarxiv icon

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Add code
Sep 26, 2023
Figure 1 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Figure 2 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Figure 3 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Figure 4 for How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Viaarxiv icon

Likelihood-Free Inference with Generative Neural Networks via Scoring Rule Minimization

Add code
May 31, 2022
Figure 1 for Likelihood-Free Inference with Generative Neural Networks via Scoring Rule Minimization
Figure 2 for Likelihood-Free Inference with Generative Neural Networks via Scoring Rule Minimization
Figure 3 for Likelihood-Free Inference with Generative Neural Networks via Scoring Rule Minimization
Figure 4 for Likelihood-Free Inference with Generative Neural Networks via Scoring Rule Minimization
Viaarxiv icon

Probabilistic Forecasting with Conditional Generative Networks via Scoring Rule Minimization

Add code
Dec 15, 2021
Figure 1 for Probabilistic Forecasting with Conditional Generative Networks via Scoring Rule Minimization
Figure 2 for Probabilistic Forecasting with Conditional Generative Networks via Scoring Rule Minimization
Figure 3 for Probabilistic Forecasting with Conditional Generative Networks via Scoring Rule Minimization
Figure 4 for Probabilistic Forecasting with Conditional Generative Networks via Scoring Rule Minimization
Viaarxiv icon

Score Matched Conditional Exponential Families for Likelihood-Free Inference

Add code
Jan 15, 2021
Figure 1 for Score Matched Conditional Exponential Families for Likelihood-Free Inference
Figure 2 for Score Matched Conditional Exponential Families for Likelihood-Free Inference
Figure 3 for Score Matched Conditional Exponential Families for Likelihood-Free Inference
Figure 4 for Score Matched Conditional Exponential Families for Likelihood-Free Inference
Viaarxiv icon