Large RDF and OWL documents on the Semantic Web
Tim Finin, 12:23pm 26 January 2006Recently Cláudio Fernandes asked on several semantic web mailing lists
“Can someone point me to some huge owl/rdf files? I’m writing a owl parser with different tools, and I’d like to benchmark them all with some really really big files.”
I just ran some queries over Swoogle’s collection of 850K RDF documents collected from the web. Here are the 100 largest RDF documents and OWL documents, respectively. Document size was measured in terms of the number of triples. For this query, a document was considered to be an OWL document if it used a namespace that contained the string OWL.
Curently, the version of Swoogle you get by going to http://swoogle.umbc.edu/ is Swoogle 2. Its database has been trapped in amber since last summer, when it was corrupted, preventing us from adding new data. We put our efforts into a reimplementation, Swoogle 3, which will be released early next week. The data reported here is from Swoogle 3’s database.

January 27th, 2006 at 10:09 am
I gathered up pointers to non-trivially sized RDF files on the web for a while, and even got a domain name for it, but stopped looking last April. It was taking more and more searching to turn up things I hadn’t found before, which were often three-year-old homework projects. What I did find is listed at http://www.rdfdata.org.
There’s a general problem with semantic web work that people are more interested in building ontologies than in accumulating data. Imagine if everyone devoted their energy in the early days of XML to writing DTDs without creating actual XML data.
Bob
December 18th, 2008 at 1:00 pm
[...] Today I noticed that someone tried to post a comment on an ebiquity post on large RDF documents: [...]