2025-04-12
The Dutch scaler performance indicator: How much did my model actually learn?
Publication
Publication
Journal of Classification , Volume 2025
Evaluation metrics provide a means for quantifying and comparing performances of supervised learning models, but drawing meaningful conclusions from acquired scores requires a contextual framework. Our paper addresses this by introducing the Dutch scaler (DS), a novel performance indicator for binary classification models. It quantifies a model’s learning by contextualizing empirical metric scores with a baseline (Dutch draw) and a new instrument (Dutch oracle) representing the prediction quality of an “optimal” classifier. The DS performance indicator expresses the relative contribution of these components to obtain a model’s score, specifying the actual learning quality. We derived closed-form expressions to map metric scores to DS scores for common evaluation metrics and categorized them by their functional form and second derivative. The DS enhances the assessment of classifiers and facilitates a framework to compare prediction quality differences between models with varying metric scores.
Additional Metadata | |
---|---|
, , , , , | |
doi.org/10.1007/s00357-025-09510-9 | |
Journal of Classification | |
Organisation | Stochastics |
van de Bijl, E., Klein, J., Pries, J., Bhulai, S., & van der Mei, R. (2025). The Dutch scaler performance indicator: How much did my model actually learn?. Journal of Classification, 2025. doi:10.1007/s00357-025-09510-9 |