Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!
While the importance of automatic image analysis is increasing at an enormous pace, recent meta-research revealed major flaws with respect to algorithm validation. Specifically, performance metrics are key for objective, transparent and comparative performance assessment, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. A common mission of several international initiatives is therefore to provide researchers with guidelines and tools to choose the performance metrics in a problem-aware manner. This dynamically updated document has the purpose to illustrate important limitations of performance metrics commonly applied in the field of image analysis. The current version is based on a Delphi process on metrics conducted by an international consortium of image analysis experts.
* This is a dynamic paper on limitations of commonly used metrics. The
current version discusses segmentation metrics only, while future versions
will also include metrics for classification and detection. For missing use
cases, comments or questions, please contact a.reinke@dkfz.de or
l.maier-hein@dkfz.de. Substantial contributions to this document will be
acknowledged with a co-authorship