The LSDIS lab at the University of Georgia has released a new version of the SwetoDblp dataset. This has about 11M triples that capture the data in DBLP enriched with other datasets adding relationships to other entities including publishers, companies and universities. Rather than being a simple mapping of DBLP’s flat XML rendering, it’s based on a good ontology with lots of classes and individuals.
B. Aleman-Meza, F. Hakimpour, I.B. Arpinar, A.P. Sheth: SwetoDblp Ontology of Computer Science Publications, Web Semantics: Science, Services and Agents on the World Wide Web, 2007 (in Press)
It’s a great resource that we have used in collaboration with LSDIS with support from a joint NSF ITR project (ITR-IIS-0325464, ITR-IIS-0325172).