UMBC ebiquity research group Building intelligent systems in open, heterogeneous, dynamic, distributed environments
Large RDF and OWL documents on the Semantic Web

Large RDF and OWL documents on the Semantic Web

Tim Finin, 12:23pm 26 January 2006

Recently Cláudio Fernandes asked on several semantic web mailing lists

“Can someone point me to some huge owl/rdf files? I’m writing a owl parser with different tools, and I’d like to benchmark them all with some really really big files.”

I just ran some queries over Swoogle’s collection of 850K RDF documents collected from the web. Here are the 100 largest RDF documents and OWL documents, respectively. Document size was measured in terms of the number of triples. For this query, a document was considered to be an OWL document if it used a namespace that contained the string OWL.

Curently, the version of Swoogle you get by going to http://swoogle.umbc.edu/ is Swoogle 2. Its database has been trapped in amber since last summer, when it was corrupted, preventing us from adding new data. We put our efforts into a reimplementation, Swoogle 3, which will be released early next week. The data reported here is from Swoogle 3’s database.

2 Responses to “Large RDF and OWL documents on the Semantic Web”

  1. Bob DuCharme Says:

    I gathered up pointers to non-trivially sized RDF files on the web for a while, and even got a domain name for it, but stopped looking last April. It was taking more and more searching to turn up things I hadn’t found before, which were often three-year-old homework projects. What I did find is listed at http://www.rdfdata.org.

    There’s a general problem with semantic web work that people are more interested in building ontologies than in accumulating data. Imagine if everyone devoted their energy in the early days of XML to writing DTDs without creating actual XML data.

    Bob

  2. Blog comment spam with plagiarized text: hard to spot Says:

    [...] Today I noticed that someone tried to post a comment on an ebiquity post on large RDF documents: [...]

Leave a Reply







UMBC