| Building intelligent systems in open, heterogeneous, dynamic, distributed environments | 16 May 2008, 22:26:32 EDT ![]() |
|||
Detecting Spam Blogs: A Machine Learning Approach Description: Weblogs or blogs are an important new way to publish information, engage in discussions, and form communities on the Internet. The Blogosphere has unfortunately been infected by several varieties of spam-like content. Blog search engines, for example, are inundated by posts from splogs – false blogs with machine generated or hijacked content whose sole purpose is to host ads or raise the PageRank of target sites. We discuss how SVM models based on local and link-based features can be used to detect splogs. We present an evaluation of learned models and their utility to blog search engines; systems that employ techniques differing from those of conventional web search engines. We evaluate the effectiveness of a combination of features, and finally report our informal analysis of a blog search engine index. (AAAI-06 Poster) Type: Poster Authors: Pranam Kolari Date: July 16, 2006 Tags: blog, blog, blog, blog, blogosphere, splog, splog, web spam Format: HTML Number of downloads: 421 Access Control: Publicly Available Available for download as
|
| Home | About Us | Contact Us | Site Map | Legal | Privacy Copyright © 1999-2008 UMBC ebiquity research group. Copyright © 2003-2008 Site design and RGB engine code by Filip Perich. XG Page gen 0.024 sec. |