From Imbalanced Classification to Supervised Outlier Detection Problems

Imbalanced datasets pose severe challenges in training well performing classifiers. This problem is also prevalent in the domain of outlier detection since outliers occur infrequently and are generally treated as minorities. One simple yet powerful approach is to use autoencoders which are trained on majority samples and then to classify samples based on the reconstruction loss. However, this approach fails to classify samples whenever reconstruction errors of minorities overlap with that of majorities. To overcome this limitation, we propose an adversarial loss function that maximizes the loss of minorities while minimizing the loss for majorities. This way, we obtain a well-separated reconstruction error distribution that facilitates classification. We show that this approach is robust in a wide variety of settings, such as imbalanced data classification or outlier- and novelty detection.

Published in:
ICANN 2020: Artificial Neural Networks and Machine Learning International Conference on Artificial Neural Networks (ICANN)
Type:
Inproceedings
Authors:
M. Lübbering, R. Ramamurthy, M. Gebauer, T. Bell, R. Sifa, C. Bauckhage
Year:
2020

Citation information

M. Lübbering, R. Ramamurthy, M. Gebauer, T. Bell, R. Sifa, C. Bauckhage: From Imbalanced Classification to Supervised Outlier Detection Problems, International Conference on Artificial Neural Networks (ICANN), ICANN 2020: Artificial Neural Networks and Machine Learning, 2020, https://doi.org/10.1007/978-3-030-61609-0_3, Luebbering.etal.2020,

Open BibTeX citation