UMBC ebiquity

Topic Modeling for RDF Graphs

Authors: Jennifer Sleeman, Tim Finin, and Anupam Joshi

Book Title: 3rd International Workshop on Linked Data for Information Extraction, 14th International Semantic Web Conference

Date: October 12, 2015

Abstract: Topic models are widely used to thematically describe a collection of text documents and have become an important technique for systems that measure document similarity for classification, clustering, segmentation, entity linking and more. While they have been applied to some non-text domains, their use for semi-structured graph data, such as RDF, has been less explored. We present a framework for applying topic modeling to RDF graph data and describe how it can be used in a number of linked data tasks. Since topic modeling builds abstract topics using the co-occurrence of document terms, sparse documents can be problematic, presenting challenges for RDF data. We outline techniques to overcome this problem and the results of experiments in using them. Finally, we show preliminary results of using Latent Dirichlet Allocation generative topic modeling for several linked data use cases.

Type: InProceedings

Publisher: CEUR Workshop Proceedings

Pages: 48-62

Volume: 1267

Tags: rdf, semantic web, topic modeling, lda, coreference resolution, entity disambiguation, entity linking, entity type recognition, ontology mapping, community detection

Google Scholar: search

Number of downloads: 422


Available for download as

size: 1653914 bytes