K-means is probabilistically poor

Authors

  • Mieczysław Kłopotek Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland

DOI:

https://doi.org/10.34739/si.2022.27.01

Keywords:

k-means, clustering, probabilistic k-richness

Abstract

Kleinberg introduced the concept of k-richness as a requirement for an algorithm to be a clustering algorithm. The most popular algorithm k means dos not fit this definition because of its probabilistic nature. Hence Ackerman et al. proposed the notion of probabilistic k-richness claiming without proof that k-means has this property. It is proven in this paper, by example, that the version of k-means with random initialization does not have the property probabilistic k-richness, just rebuking Ackeman's claim.

Downloads

Download data is not yet available.

Downloads

Published

2023-01-11

How to Cite

Kłopotek, M. (2023). K-means is probabilistically poor. Studia Informatica. System and Information Technology, 27(2), 5–26. https://doi.org/10.34739/si.2022.27.01