Proceedings of the Tenth Workshop on Web Mining and Web Usage Analysis (WebKDD)

Detecting Commmunities via Simultaneous Clustering of Graphs and Folksonomies

, , and

We present a simple technique for detecting communities by utilizing both the link structure and folksonomy (or tag) information that is readily available in most social media systems. A simple way to describe our approach is by defining a community as a set of nodes in a graph that link more frequently to within this set than outside it and they share similar tags. Our technique is based on the Normalized Cut (NCut) algorithm and can be easily and efficiently implemented. We validate our method by using a real network of blogs and tag information obtained from a social bookmarking site. We also verify our results on a citation network for which we have access to ground truth cluster information. Our method, Simultaneous Cut (SimCut), has the advantage that it can group related tags and cluster the nodes simultaneously.


  • 625665 bytes

  • 10895360 bytes

blog, clustering, folksonomy, social media

InProceedings

ACM

(Held in conjunction with The 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2008))

Downloads: 6999 downloads

Google Scholar Citations: 5 citations

UMBC ebiquity