Abstract
In recent years many different subspace clustering algorithms and related methods have been proposed. They promise to not only find hidden structures in data sets, but also to select for each structure the features, which are most prominent. Yet, most of these methods suffer from the same problem: finding a satisfactory clustering result heavily depends on an adequate configuration of the parameters. In case of insufficient parameterization, a result is potentially hard to interpret and might contain hundreds of clusters. For traditional clustering algorithms different ensemble methods have been developed, which mitigate these effects by incorporating multiple clustering outputs into a consensus result. However, most of these methods cannot be straightforwardly adopted to include subspace information. We propose a novel subspace clustering ensemble algorithm SubCluEns(1) based on the minimum description length principle. It allows combining multiple results of subspace and projected clustering algorithms into a consensus clustering.
Item Type: | Conference or Workshop Item (Report) |
---|---|
Faculties: | Mathematics, Computer Science and Statistics > Computer Science |
Subjects: | 000 Computer science, information and general works > 004 Data processing computer science |
ISSN: | 2375-9232 |
Language: | English |
Item ID: | 47375 |
Date Deposited: | 27. Apr 2018, 08:12 |
Last Modified: | 13. Aug 2024, 12:54 |