Proceedings of the 5th International Semantic Web Conference
Characterizing the Semantic Web on the Web
November 5, 2006
Semantic Web languages are being used to represent, encode and exchange semantic data in many contexts beyond the Web -- in databases, multiagent systems, mobile computing, and ad hoc networking environments. The core paradigm, however, remains what we call the {em Web aspect} of the Semantic Web -- its use by independent and distributed agents who publish and consume data on the World Wide Web. To better understand this central use case, we have harvested and analyzed a collection of Semantic Web documents from an estimated ten million available on the Web. Using a corpus of more than 1.7 million documents comprising over 300 million RDF triples, we describe a number of global metrics, properties and usage patterns. Most of the metrics, such as the size of Semantic Web documents and the use frequency of Semantic Web terms, were found to follow a power law distribution.
InProceedings
Downloads: 6929 downloads
Google Scholar Citations: 82 citations