Home  | Publications | MYP+18

Discovering Non-Redundant K-Means Clusterings in Optimal Subspaces

MCML Authors

Christian Böhm

Prof. Dr.

Principal Investigator

* Former Principal Investigator

Abstract

A huge object collection in high-dimensional space can often be clustered in more than one way, for instance, objects could be clustered by their shape or alternatively by their color. Each grouping represents a different view of the data set. The new research field of non-redundant clustering addresses this class of problems. In this paper, we follow the approach that different, non-redundant k-means-like clusterings may exist in different, arbitrarily oriented subspaces of the high-dimensional space. We assume that these subspaces (and optionally a further noise space without any cluster structure) are orthogonal to each other. This assumption enables a particularly rigorous mathematical treatment of the non-redundant clustering problem and thus a particularly efficient algorithm, which we call Nr-Kmeans (for non-redundant k-means). The superiority of our algorithm is demonstrated both theoretically, as well as in extensive experiments.

inproceedings


KDD 2018

24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. London, UK, Aug 19-23, 2018.
Conference logo
A* Conference

Authors

D. Mautz • W. Ye • C. Plant • C. Böhm

Links

DOI

Research Area

 A3 | Computational Models

BibTeXKey: MYP+18

Back to Top