Abstract
Fuzzy matching and ranking are two information retrieval techniques widely used in web search. Their application to structured data, however, remains an open problem. This article investigates how eigenvector-centrality can be used for approximate matching in multi-relation graphs, that is, graphs where connections of many different types may exist. Based on an extension of the PageRank matrix, eigenvectors representing the distribution of a term after propagating term weights between related data items are computed. The result is an index which takes the document structure into account and can be used with standard document retrieval techniques. As the scheme takes the shape of an index transformation, all necessary calculations are performed during index time
Item Type: | Journal article |
---|---|
Form of publication: | Postprint |
Keywords: | approximate matching; structured data; eigenvector centrality; indexing methods; keyword search; social networks; semantic web; PageRank; multi-relation networks; fuzzy matching; fuzzy ranking; information retrieval; web search; structured data; multi-relation graphs; eigenvectors; term distribution; document structure; document retrieval |
Faculties: | Mathematics, Computer Science and Statistics > Computer Science |
Subjects: | 000 Computer science, information and general works > 004 Data processing computer science |
URN: | urn:nbn:de:bvb:19-epub-14913-1 |
ISSN: | 1757-8493 |
Language: | English |
Item ID: | 14913 |
Date Deposited: | 22. Apr 2013, 09:21 |
Last Modified: | 13. Aug 2024, 12:51 |