UMBC ebiquity

T2LD - An automatic framework for extracting, interpreting and representing tables as Linked Data

Authors: Varish Mulwad

Date: August 02, 2010

Abstract: We present an automatic framework for extracting, interpreting and generating linked data from tables. In the process of representing tables as linked data, we assign every column header a class label from an appropriate ontology, link table cells (if appropriate) to an entity from the Linked Open Data cloud and identify relations between various columns in the table, which helps us to build an overall interpretation of the table. Using the limited evidence provided by a table in the form of table headers and table data in rows and columns, we adopt a novel approach of querying existing knowledge bases such as Wikitology, DBpedia etc. to figure the class labels for table headers. In the process of entity linking, besides querying knowledgebases, we use machine learning algorithms like support vector machine and algorithms which can learn to rank entities within a given set to link a table cell to entity. We further use the class labels, linked entities and information from the knowledge bases to identify relations between columns. We prototyped a system to evaluate our approach against tables obtained from Google Squared, Wikipedia and set of tables obtained from a dataset which Google shared with us.

Type: MastersThesis

Publisher: UMBC

Tags: semantic web, linked data, human language technology, entity linking, information retrieval

Google Scholar: search

Number of downloads: 1320

 

Available for download as


size: 522247 bytes

size: 3586048 bytes
 

Related Projects:

Active Project

 Tables to Linked Data.

 

Assertions:

  1. (Publication) T2LD - An automatic framework for extracting, interpreting and representing tables as Linked Data is related to event (Event) T2LD – An automatic framework for extracting, interpreting and representing tables as linked data

Bookmark at: Digg | Del.icio.us | Connotea | CiteULike