UMBC ebiquity

Entity Disambiguation for Wild Big Data Using Multi-Level Clustering

Authors: Jennifer Sleeman

Book Title: Doctoral Consortium, 14th International Semantic Web Conference

Date: October 12, 2015

Abstract: When RDF instances represent the same entity they are said to corefer. For example, two nodes from different RDF graphs 1 both refer to same individual, musical artist James Brown. Disambiguating entities is essential for knowledge base population and other tasks that result in integration or linking of data. Often however, entity instance data originates from different sources and can be represented using differ- ent schemas or ontologies. In the age of Big Data, data can have other characteristics such originating from sources which are schema-less or without ontological structure. Our work involves researching new ways to process this type of data in order to perform entity disambiguation. Our approach uses multi-level clustering and includes fine-grained entity type recognition, contextualization of entities, online processing of which can be supported by a parallel architecture.

Type: InProceedings

Publisher: ISWC 2015 Doctoral Consortium CEUR Proceeding

Tags: entity disambiguation, coference, entity type recognition, clustering, lda, topic modeling

Google Scholar: search

Number of downloads: 383


Available for download as

size: 197805 bytes