MeGS: Partitioning Meaningful Subgraph Structures using Minimum Description Length

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Goebl, Sebastian; Tonch, Annika; Böhm, Christian und Plant, Claudia (2016): MeGS: Partitioning Meaningful Subgraph Structures using Minimum Description Length. 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, Catalonia, Spain, 12-15 Dec. 2016. IEEE Computer Soc.. S. 889-894

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1109/ICDM.2016.0108

Abstract

How can we fully structure a graph into pieces of meaningful information? Into structures that provide us with insights and carry a meaning beyond simple clustering. How can we also exploit these patterns to compress the graph for fast transmission and easier storage? In many applications of graph analysis like network analysis or medical information extraction we are searching for special patterns. Here, it is not sufficient to extract only parts of the relevant information in a graph, but to understand the complete underlying structure. Therefore, we propose our algorithm MeGS (Partitioning Meaningful Subgraph Structures using Minimum Description Length) to fully understand how a graph is constructed. The most common primitives (clique, hub, tree, bipartite, and sparse) serve as models to split a graph into meaningful structures. Using the principle of Minimum Description Length (MDL) structure types and counts are determined by the best fitting model. These structures achieve the best compression of the adjacency matrix. As result, every node is part of exactly one structure and has an interpretable context. No unknown areas remain in the graph. The higher a model compresses its section of the graph, the stronger its match with the corresponding structural assumption. MeGS, a fast and parameter-free split-and-merge algorithm, automatically finds the optimal structures achieving the best compression. We compare to state-of-the-art algorithms to prove MeGS' ability for interpretation and compression.

Dokumententyp:	Konferenzbeitrag (Bericht)
Fakultät:	Mathematik, Informatik und Statistik > Informatik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISSN:	1550-4786
Sprache:	Englisch
Dokumenten ID:	47371
Datum der Veröffentlichung auf Open Access LMU:	27. Apr. 2018 08:12
Letzte Änderungen:	13. Aug. 2024 12:54

Dokument bearbeiten