Clustering Big Data streams: recent challenges and contributions

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Hassani, Marwan und Seidl, Thomas (2016): Clustering Big Data streams: recent challenges and contributions. In: It-information Technology, Bd. 58, Nr. 4: S. 206-213

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1515/itit-2016-0007

Abstract

Traditional clustering algorithms merely considered static data. Today's various applications and research issues in big data mining have however to deal with continuous, possibly infinite streams of data, arriving at high velocity. Web traffic data, surveillance data, sensor measurements and stock trading are only some examples of these daily-increasing applications. Since the growth of data volumes is accompanied by a similar raise in their dimensionalities, clusters cannot be expected to completely appear when considering all attributes together. Subspace clustering is a general approach that solved that issue by automatically finding the hidden clusters within different subsets of the attributes rather than considering all attributes together. In this article, novel methods for an efficient subspace clustering of high-dimensional big data streams are presented. Approaches that efficiently combine the anytime clustering concept with the stream subspace clustering paradigm are discussed. Additionally, efficient and adaptive density-based clustering algorithms are presented for high-dimensional data streams. Novel open-source assessment framework and evaluation measures are additionally presented for subspace stream clustering.

Dokumententyp:	Zeitschriftenartikel
Fakultät:	Mathematik, Informatik und Statistik > Informatik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISSN:	1611-2776
Sprache:	Englisch
Dokumenten ID:	47305
Datum der Veröffentlichung auf Open Access LMU:	27. Apr. 2018 08:12
Letzte Änderungen:	13. Aug. 2024 12:54

Dokument bearbeiten