Approximately Pareto-optimal Solutions for Bi-Objective k-Clustering
As a major unsupervised learning method, clustering has received a lot of attention over multiple decades. The various clustering problems that have been studied intensively include, e.g., the $k$-means problem and the $k$-center problem. However, in applications, it is common that good clusterings should optimize multiple objectives (e.g., visualizing data on a map by clustering districts into areas that are both geographically compact but also homogeneous with respect to the data). We study combinations of different objectives, for example optimizing $k$-center and $k$-means simultaneously or optimizing $k$-center with respect to two different metrics. Usually these objectives are conflicting and cannot be optimized simultaneously, making it necessary to find trade-offs. We develop novel algorithms for approximating the set of Pareto-optimal solutions for various combinations of two objectives. Our algorithms achieve provable approximation guarantees and we demonstrate in several experiments that the approximate Pareto front contains good clusterings that cannot be found by considering one of the objectives separately.
- Published in:
Annual Conference on Neural Information Processing Systems - Type:
Inproceedings - Authors:
Arutyunova, Anna; Eube, Jan; Röglin, Heiko; Schmidt, Melanie; Sturm, Sarah; Wargalla, Julian - Year:
2024
Citation information
Arutyunova, Anna; Eube, Jan; Röglin, Heiko; Schmidt, Melanie; Sturm, Sarah; Wargalla, Julian: Approximately Pareto-optimal Solutions for Bi-Objective k-Clustering, Annual Conference on Neural Information Processing Systems, 2024, https://neurips.cc/virtual/2024/poster/95536, Arutyunova.etal.2024a,
@Inproceedings{Arutyunova.etal.2024a,
author={Arutyunova, Anna; Eube, Jan; Röglin, Heiko; Schmidt, Melanie; Sturm, Sarah; Wargalla, Julian},
title={Approximately Pareto-optimal Solutions for Bi-Objective k-Clustering},
booktitle={Annual Conference on Neural Information Processing Systems},
url={https://neurips.cc/virtual/2024/poster/95536},
year={2024},
abstract={As a major unsupervised learning method, clustering has received a lot of attention over multiple decades. The various clustering problems that have been studied intensively include, e.g., the $k$-means problem and the $k$-center problem. However, in applications, it is common that good clusterings should optimize multiple objectives (e.g., visualizing data on a map by clustering districts into...}}