Proceedings of the 5th International Semantic Web Conference

Characterizing the Semantic Web on the Web

and

Semantic Web languages are being used to represent, encode and exchange semantic data in many contexts beyond the Web -- in databases, multiagent systems, mobile computing, and ad hoc networking environments. The core paradigm, however, remains what we call the {em Web aspect} of the Semantic Web -- its use by independent and distributed agents who publish and consume data on the World Wide Web. To better understand this central use case, we have harvested and analyzed a collection of Semantic Web documents from an estimated ten million available on the Web. Using a corpus of more than 1.7 million documents comprising over 300 million RDF triples, we describe a number of global metrics, properties and usage patterns. Most of the metrics, such as the size of Semantic Web documents and the use frequency of Semantic Web terms, were found to follow a power law distribution.


  • 837534 bytes

information retrieval, owl, rdf, semantic web, swoogle

InProceedings

Downloads: 6729 downloads

Google Scholar Citations: 82 citations

UMBC ebiquity