Evaluating Anomaly Detection Algorithms: The Role of Hyperparameters and Standardized Benchmarks
Anomaly detection is a cornerstone of machine learning with applications spanning healthcare, fraud detection, and scientific discovery. Despite extensive research, fair benchmarking remains a significant challenge due to the unsupervised nature of anomaly detection. Hyperparameter selection, a crucial determinant of algorithm performance, is often overlooked or biased, leading to inflated or misleading results. Current practices, including reliance on default configurations, random choices, or limited optimization, hinder reproducibility and impede progress. This work presents a novel pipeline for standardized hyperparameter optimization in anomaly detection. Leveraging a curated collection of nearly 500 datasets, the largest of its kind, our approach systematically optimizes over 80 hyperparameters for 13 widely used anomaly detection algorithms. Our comparison reveals that the performance variance from hyperparameters often surpasses inter-algorithm differences, emphasizing the need for hyperparameter-specific evaluations. We establish a reproducible foundation for anomaly detection research by providing open-access datasets and code. Our findings not only challenge existing evaluation norms but also pave the way for more robust and reliable comparisons toward better anomaly detection research.
- Published in:
2025 IEEE International Conference on Data Science and Advanced Analytics (DSAA) - Type:
Inproceedings - Year:
2025
Citation information
: Evaluating Anomaly Detection Algorithms: The Role of Hyperparameters and Standardized Benchmarks, 2025 IEEE International Conference on Data Science and Advanced Analytics (DSAA), 2025, Kluettermann.etal.2025b,
@Inproceedings{Kluettermann.etal.2025b,
author={Klüttermann, Simon; Gupta, Shubham; Müller, Emmanuel},
title={Evaluating Anomaly Detection Algorithms: The Role of Hyperparameters and Standardized Benchmarks},
booktitle={2025 IEEE International Conference on Data Science and Advanced Analytics (DSAA)},
year={2025},
abstract={Anomaly detection is a cornerstone of machine learning with applications spanning healthcare, fraud detection, and scientific discovery. Despite extensive research, fair benchmarking remains a significant challenge due to the unsupervised nature of anomaly detection. Hyperparameter selection, a crucial determinant of algorithm performance, is often overlooked or biased, leading to inflated or...}}