Logo Logo
Hilfe
Hilfe
Switch Language to English

Leiber, Collin ORCID logoORCID: https://orcid.org/0000-0001-5368-5697; Miklautz, Lukas ORCID logoORCID: https://orcid.org/0000-0002-2585-5895; Plant, Claudia ORCID logoORCID: https://orcid.org/0000-0001-5274-8123 und Böhm, Christian ORCID logoORCID: https://orcid.org/0000-0002-2237-9969 (2023): Application of Deep Clustering Algorithms. CIKM '23: The 32nd ACM International Conference on Information and Knowledge Management, Birmingham, United Kingdom, October 21 - 25, 2023. ACM International Conference on Information and Knowledge Management (Hrsg.), In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, New York, NY, United States: Association for Computing Machinery. S. 5208-5211 [PDF, 1MB]

Abstract

Deep clustering algorithms have gained popularity for clustering complex, large-scale data sets, but getting started is difficult because of numerous decisions regarding architecture, optimizer, and other hyperparameters. Theoretical foundations must be known to obtain meaningful results. At the same time, ease of use is necessary to get used by a broader audience. Therefore, we require a unified framework that allows for easy execution in diverse settings. While this applies to established clustering methods like k-Means and DBSCAN, deep clustering algorithms lack a standard structure, resulting in significant programming overhead. This complicates empirical evaluations, which are essential in both scientific and practical applications. We present a solution to this problem by providing a theoretical background on deep clustering as well as practical implementation techniques and a unified structure with predefined neural networks. For the latter, we use the Python package ClustPy. The aim is to share best practices and facilitate community participation in deep clustering research.

Dokument bearbeiten Dokument bearbeiten