The influence of selected metrics on the result of an examination of clusters

Łukasz Paśko, Galina Setlak


The aim of this paper is to present the metrics used to measure a distance between objects in a feature space. The analyses were performed on seven datasets. For each of them, the occurrence of clusters of similar objects was examined, and the measures of clusters’ dispersion were calculated. The calculations were carried out using fourteen metrics known from the literature. The article contains selected results with particular emphasis on the differences arising from the use of various metrics.


data mining; metrics; measures of the quality of clusters

