Picture for Harry Coppock

Harry Coppock

Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats

Add code
Jan 22, 2026
Viaarxiv icon

A Real-World Evaluation of LLM Medication Safety Reviews in NHS Primary Care

Add code
Dec 24, 2025
Viaarxiv icon

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Add code
Jul 03, 2025
Figure 1 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Figure 2 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Figure 3 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Figure 4 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Viaarxiv icon

HiBayES: A Hierarchical Bayesian Modeling Framework for AI Evaluation Statistics

Add code
May 08, 2025
Viaarxiv icon

Synthia's Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio

Add code
Sep 26, 2023
Viaarxiv icon

Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

Add code
Dec 15, 2022
Figure 1 for Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19
Figure 2 for Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19
Figure 3 for Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19
Figure 4 for Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19
Viaarxiv icon

A large-scale and PCR-referenced vocal audio dataset for COVID-19

Add code
Dec 15, 2022
Viaarxiv icon

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Add code
Dec 15, 2022
Figure 1 for Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
Figure 2 for Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
Figure 3 for Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
Figure 4 for Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
Viaarxiv icon

Audio Barlow Twins: Self-Supervised Audio Representation Learning

Add code
Sep 28, 2022
Figure 1 for Audio Barlow Twins: Self-Supervised Audio Representation Learning
Figure 2 for Audio Barlow Twins: Self-Supervised Audio Representation Learning
Figure 3 for Audio Barlow Twins: Self-Supervised Audio Representation Learning
Figure 4 for Audio Barlow Twins: Self-Supervised Audio Representation Learning
Viaarxiv icon

The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes

Add code
May 13, 2022
Figure 1 for The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes
Figure 2 for The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes
Figure 3 for The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes
Viaarxiv icon