<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>UMBC ebiquity &#187; Pranam Kolari</title>
	<atom:link href="http://ebiquity.umbc.edu/blogger/author/pranam-kolari/feed/" rel="self" type="application/rss+xml" />
	<link>http://ebiquity.umbc.edu/blogger</link>
	<description>EBB is the ebiquity research group\\\'s blog at the University of Maryland, Baltimore County (UMBC).  We focus on technologies that facilitate the design, implementation and control of distributed, intelligent information systems -- mobile and pervasive computing, ad hoc networking, multiagent systems, knowledge representation and reasoning, and the semantic web.  As the tides of technology ebb and flow, we hope the good ideas wash up on our beach and the bad ones drift back out to sea.</description>
	<lastBuildDate>Fri, 20 Nov 2009 13:50:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Twitter, twitter, twitter..</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/04/26/twitter-twitter-twitter/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/04/26/twitter-twitter-twitter/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1147</guid>
		<description><![CDATA[Just couldn&#8217;t resist reposting this &#8211;

GapingVoid, via kellypuffs.
]]></description>
			<content:encoded><![CDATA[<p>Just couldn&#8217;t resist reposting this &#8211;</p>
<div style="text-align: center"><img src="http://www.gapingvoid.com/history76156-thumb.jpg" /></div>
<p><a href="http://www.gapingvoid.com/Moveable_Type/archives/003881.html">GapingVoid</a>, via <a href="http://kellypuffs.wordpress.com/2007/04/20/history-of-my-blog/">kellypuffs</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/04/26/twitter-twitter-twitter/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Spam in Blogs and Social Media</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/04/03/spam-in-blogs-and-social-media/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/04/03/spam-in-blogs-and-social-media/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1111</guid>
		<description><![CDATA[We recently presented a tutorial on Spam in Blogs and Social Media at ICWSM.
Spam on the Internet dates back over a decade, with its earliest known appearance as an email about the infamous MAKE.MONEY.FAST. campaign. Spam has co-evolved with Internet applications and is now quite common on the World-Wide Web.
As social media systems such as [...]]]></description>
			<content:encoded><![CDATA[<p>We recently presented a <a href="http://www.icwsm.org/tutorials.html#tut2">tutorial</a> on <a href="http://ebiquity.umbc.edu/resource/html/id/216/Spam-in-Blogs-and-Social-Media">Spam in Blogs and Social Media</a> at ICWSM.</p>
<blockquote><p>Spam on the Internet dates back over a decade, with its earliest known appearance as an email about the infamous MAKE.MONEY.FAST. campaign. Spam has co-evolved with Internet applications and is now quite common on the World-Wide Web.</p>
<p>As social media systems such as blogs, wikis and bookmark sharing sites have emerged, spammers have quickly developed techniques to infect them as well. The very characteristics underlying the Web, be it version 1.0, 2.0 or 3.0, also enable new varieties of spam.</p>
<p>This tutorial will detail the problem of spam in social media, with an emphasis on spam in blogs. We will discuss different types of spam, the motivation for spammers and the seriousness and nature of the problem. We will then share some of our discussions with groups facing and tackling this issue. The second half of the tutorial will present an overview of spam detection and elimination efforts with an extensive survey of detecting blog spam. The tutorial will help researchers understand the unique aspects of spam in social media, recognize how it affects analytics and information extraction, and identify current research challenges. For practitioners, it will provide strategies for controlling the problem, and identify areas of collaboration across communities.</p>
</blockquote>
<p>We have made the slides <a href="http://ebiquity.umbc.edu/resource/html/id/216/Spam-in-Blogs-and-Social-Media">available online</a> in PDF. As always, we welcome suggestions and/or comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/04/03/spam-in-blogs-and-social-media/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hitwise on Fast Growing Social Networks &#8211; Implications</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/03/14/hitwise-on-fast-growing-social-networks-implications/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/03/14/hitwise-on-fast-growing-social-networks-implications/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1066</guid>
		<description><![CDATA[Hitwise is reporting numbers on social network usage among Web users. This is what stood out:
The market share of visits to the custom category of the top 20 social networking sites increased by 11.5% from January 2007 to February 2007. Year-over-year (February 2006 &#8211; February 2007) category traffic was up 87%.

This leads to an interesting [...]]]></description>
			<content:encoded><![CDATA[<p>Hitwise is <a href="http://weblogs.hitwise.com/leeann-prescott/2007/03/buzznet_and_imeem_fast_growing.html">reporting numbers</a> on social network usage among Web users. This is what stood out:</p>
<blockquote><p>The market share of visits to the custom category of the top 20 social networking sites increased by 11.5% from January 2007 to February 2007. Year-over-year (February 2006 &#8211; February 2007) category traffic was up 87%.</p>
</blockquote>
<p>This leads to an interesting question on evolving behavior of Web users. At any given point of time, consider user attention to be at one of these categories of content:</p>
<ol>
<li>Social Networking Sites</li>
<li>Commerce Sites</li>
<li>Feed Readers</li>
<li>Social Content (Blogs, Wikipedia etc..)</li>
<li>Contextual Advertisements</li>
<li>Organic Search Results</li>
<li>Rest of the Web</li>
</ol>
<p>It&#8217;s well known that traffic to the first five categories is either growing or stable.  So which of the last two categories is this growth biting into? Either case, we might soon see a headliner from Hitwise that goes &#8212; &#8220;<strong>Less users searching on the Web&#8221;, </strong>or something similar.</p>
<p>So what does this mean to Google et al &#8212; of course less revenue from self-hosted ads, and consequently reduced margins. Solution &#8212; Buy Social Networking Sites and offer new services, a trend that will (and better) continue.</p>
<p>(Via <a href="http://www.micropersuasion.com/2007/03/social_nets_acc.html">Micropersuasion</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/03/14/hitwise-on-fast-growing-social-networks-implications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>eBiquity now on Twitter</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/03/06/ebiquity-now-on-twitter/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/03/06/ebiquity-now-on-twitter/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1038</guid>
		<description><![CDATA[The growth of Twitter has been phenomenal over the last couple of months. While its utility is argued by some, current traction suggests this could be another Web 2.0 winner.
  Though we were initially circumspect (as were many others), we decided to take the plunge last week. See what we are upto now on [...]]]></description>
			<content:encoded><![CDATA[<p>The growth of <a href="http://twitter.com/ebiquity/with_friends">Twitter</a> has been phenomenal over the last couple of months. While its utility is argued by some, current traction suggests this could be another Web 2.0 winner.</p>
<p><center> <img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads//2007/03/twitter.png" /> </center>Though we were initially circumspect (as were many others), we decided to take the plunge last week. See what we are upto now on our blog sidebar, or <a href="http://twitter.com/ebiquity/with_friends">follow us</a> directly at twitter.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/03/06/ebiquity-now-on-twitter/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The I&#8217;s of the Blogosphere</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/02/25/the-is-of-the-blogosphere/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/02/25/the-is-of-the-blogosphere/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=1009</guid>
		<description><![CDATA[The token &#8220;I&#8221; (1, 2) can provide interesting cues on the Blogosphere, other than signifying the obvious personal nature of blog posts. &#8220;I&#8221;  sometimes use it to study the growth of the blogosphere (between David Sifry reports ofcourse), or just for fun to see how frequently indices of blog search engines are updated and [...]]]></description>
			<content:encoded><![CDATA[<p>The token &#8220;I&#8221; (<a href="http://blogpulse.com/search?query=I">1</a>, <a href="http://technorati.com/search/I">2</a>) can provide interesting cues on the Blogosphere, other than signifying the obvious personal nature of blog posts. &#8220;I&#8221;  sometimes use it to study the growth of the blogosphere (between David Sifry reports ofcourse), or just for fun to see how frequently indices of blog search engines are updated and if any of them are in a &#8220;breather&#8221; mode.</p>
<p>Two charts on the distribution of &#8220;I&#8221; in blog posts, one from BlogPulse and the other from Technorati.</p>
<p><center><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads//2007/02/blogpulsei.png" /></center></p>
<div style="text-align: center"><img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads//2007/02/technoratii.png" /></div>
<p>BlogPulse reports that around 45% of all postings feature an &#8220;I&#8221;. Technorati indexes around 400000 posts featuring &#8220;I&#8221; per day. Merging the two data points Technorati indexes around 900000 posts per day, or rather around 40000 posts per hour, a number which has seen no change for almost a year. Nothing new here, the English blogosphere has plateaued. What&#8217;s confusing is that this analysis does not correlate with David SIfry&#8217;s <a href="http://www.sifry.com/alerts/Slide0005-12-tm.png">number from October 2006</a>, with around 1.3 Million postings per day, putting off my analysis by around 50%. What am I missing here?</p>
<p>As an aside this brings to question the growth of blogs in non-US English speaking geographies, <a href="http://ebiquity.umbc.edu/blogger/2006/12/14/blogging-to-peak-in-2007-they-must-be-right/">India for instance</a>.</p>
<p>Of course the same analysis can be done with other keywords, but neither of them give the coverage , nor are they as temporally independent as &#8220;I&#8221;.  Any other interesting uses of buzz charts?</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/02/25/the-is-of-the-blogosphere/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AIRWeb 2007: Papers Due February 23</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/02/11/airweb-2007-papers-due-february-23/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/02/11/airweb-2007-papers-due-february-23/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=981</guid>
		<description><![CDATA[AIRWeb 2007 is third in a series of workshops on Adversarial Information Retrieval on the Web. This year the workshop also features a web spam challenge.
AIRWeb is a series of international workshops focusing on Adversarial Information Retrieval on the Web that brings together both researchers and industry practitioners, to present and discuss advances in the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://airweb.cse.lehigh.edu/2007/">AIRWeb 2007</a> is third in a series of workshops on Adversarial Information Retrieval on the Web. This year the workshop also features a <a href="http://webspam.lip6.fr/wiki/pmwiki.php">web spam challenge</a>.</p>
<blockquote><p>AIRWeb is a series of international workshops focusing on Adversarial Information Retrieval on the Web that brings together both researchers and industry practitioners, to present and discuss advances in the state of the art. This year, AIRWeb&#8217;2007 will be co-located with the WWW&#8217;07 conference in Banff, Canada. The workshop will include a Web Spam challenge that will test different spam detection techniques on a shared reference collection.</p>
</blockquote>
<p>The <a href="http://airweb.cse.lehigh.edu/2007/cfp.html">call for papers</a> lists a interesting set of problems, including new one&#8217;s like malicious tagging.</p>
<blockquote><p>* Link spam<br />* Content spam<br />* Cloaking<br />* Comment spam<br />* Spam-oriented blogging<br />* Click fraud detection<br />* Reverse engineering of ranking algorithms<br />* Web content filtering<br />* Advertisement blocking<br />* Stealth crawling<br />* Malicious tagging</p>
</blockquote>
<p>Web spam is one area where research is highly influenced by discussions with practitioners. With <a href="http://labs.live.com/Live+Labs+Supports+AIRWeb+2007+Workshop+Colocated+With+WWW+2007.aspx">sponsorship</a> and <a href="http://www.mattcutts.com/blog/call-for-papers-airweb-2007/">involvement</a> from industry leaders, this should be a great venue to seek inputs. We plan to submit a paper on our <a href="http://ebiquity.umbc.edu/tag/splog">continuing work on splogs</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/02/11/airweb-2007-papers-due-february-23/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pings, Spings, Splogs and the Splogosphere: 2007 Updates</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/02/01/pings-spings-splogs-and-the-splogosphere-2007-updates/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/02/01/pings-spings-splogs-and-the-splogosphere-2007-updates/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=960</guid>
		<description><![CDATA[We present some updates on the Splogosphere as seen at a pingserver (weblogs.com). This follows our study from a year earlier which reported on splogs in the English speaking blogosphere. Our current update is based on 8.8 million pings on weblogs.com between January 23rd and January 26th. Though not fully representative, it does give a [...]]]></description>
			<content:encoded><![CDATA[<p>We present some updates on the Splogosphere as seen at a pingserver (<a title="weblogs.com" href="http://weblogs.com">weblogs.com</a>). This follows our <a href="http://ebiquity.umbc.edu/blogger/?p=429">study</a> from a year earlier which reported on splogs in the English speaking blogosphere. Our current update is based on 8.8 million pings on weblogs.com between January 23rd and January 26th. Though not fully representative, it does give a good sense of spam in the indexed blogosphere.</p>
<p><strong> (i) 53% of all pings is spam, 64% of all pings from blogs in English is spam. </strong> <a href="http://ebiquity.umbc.edu/blogger/?p=429">A year earlier</a> we found that close to 75% of all pings from English blogs are spings. Dave Sifry reported on seeing 70% spings in his <a href="http://www.sifry.com/alerts/archives/000443.html">last report</a>. Clearly the growth of spings has plateaued, one less thing to worry about.</p>
<p><center>  <img alt="ping_all" src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2007/02/ping_all.png" /> </center><strong>(ii) 56% of all pinging blogs are spam. </strong> By collapsing these pings to their respective blogs, we chart the distribution of authentic blogs against splogs. These numbers have seen no change, <a href="http://www.wired.com/wired/archive/14.09/splogs.html">56% of all pinging blogs are splogs</a>. <center> <img alt="ping_unique" src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2007/02/ping_unique.png" />  </center><strong> (iii) MySpace is now the biggest contributor to the blogosphere</strong>. The other key driver LiveJournal and blogs managed by SixApart (as seen at their <a title="update stream" href="http://updates.sixapart.com">update stream</a>) contribute only 50-60% of what MySpace does. The growth of MySpace blogs has in fact dwarfed the growth of splogs! Further if MySpace is discounted in our analysis close to 84% of all pings are spings! Though MySpace is relatively splog free, we are beginning to <a href="http://ebiquity.umbc.edu/blogger/2006/11/27/sploggers-compromising-myspace/">notice splogs</a>, something blog harvesters should keep an eye on. [Note that not all blogspot blogs ping weblogs.com] <center> <img alt="ping_host" src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2007/02/ping_hosts.png" /> </center><strong> (iv) Blogspot continues to be heavily spammed</strong>. Most of this spam however is now detected by blog search engines, a point also shared by <a href="http://www.mattcutts.com/blog/update-on-blog-spam/">Matt Cutts</a> and <a href="http://www.kbcafe.com/spam/?guid=20061201203459">Randy Morin</a>. In all of the pings we processed, 51% blogspot blogs were spam!</p>
<p><strong>(v) Most spam blogs are still hosted in the US.</strong> We ranked IPs associated with  spam blogs based on their frequency of pings, and located them using <a href="http://ws.arin.net/cgi-bin/whois.pl">ARIN</a>. <center></p>
<table>
<tr>
<td style="text-align: left">1.</td>
<td style="text-align: left">Mountain View, CA</td>
</tr>
<tr>
<td style="text-align: left">2.</td>
<td style="text-align: left">Washington DC</td>
</tr>
<tr>
<td style="text-align: left">3.</td>
<td style="text-align: left">San Francisco, CA</td>
</tr>
<tr>
<td style="text-align: left">4.</td>
<td style="text-align: left">Orlando, FL</td>
</tr>
<tr>
<td style="text-align: left">5.</td>
<td style="text-align: left">Lansing, MI</td>
</tr>
</table>
<p></center>Blogspot hosts the highest number of splogs, but we also found that most of the other top hosts where physically hosted in the US. Perhaps <a href="http://www.plagiarismtoday.com/">Jonathan Bailey</a> knows more about the legal ramifications.</p>
<p><strong>(vi) Content on .info domain continues to be a problem. </strong>99.75% of all blogs hosted on these domains are spam. In other words 1.65 Million blogs were spam as opposed to only around 4K authentic blogs! As long these domains are cheap and keyword rich this trend is likely to continue. Sploggers are also exploiting private domain registration services (<a title="see this" href="http://ebiquity.umbc.edu/blogger/2006/10/23/spam-on-anti-spam-services-private-domain-registration/">see here</a>).</p>
<p><strong>(vii) High PPC contexts remain the primary motivation to spam. </strong> We identified the top keywords associated with spam blogs and generated a tag cloud using keyword frequency.</p>
<p><center></p>
<table width="450">
<tr>
<td><font size="2"><a href="http://del.icio.us/tag/adult">*****</a></font>  <font size="2"><a href="http://del.icio.us/tag/auto">auto</a></font>  <font size="3"><a href="http://del.icio.us/tag/big">big</a></font>  <font size="2"><a href="http://del.icio.us/tag/buy">buy</a></font>  <font size="2"><a href="http://del.icio.us/tag/california">california</a></font>  <font size="2"><a href="http://del.icio.us/tag/cancer">cancer</a></font>  <font size="3"><a href="http://del.icio.us/tag/card">card</a></font>  <font size="2"><a href="http://del.icio.us/tag/casino">casino</a></font>  <font size="2"><a href="http://del.icio.us/tag/cheap">cheap</a></font>  <font size="2"><a href="http://del.icio.us/tag/college">college</a></font>  <font size="2"><a href="http://del.icio.us/tag/consolidation">consolidation</a></font>  <font size="3"><a href="http://del.icio.us/tag/credit">credit</a></font>  <font size="2"><a href="http://del.icio.us/tag/debt">debt</a></font>  <font size="2"><a href="http://del.icio.us/tag/diet">diet</a></font>  <font size="2"><a href="http://del.icio.us/tag/digital">digital</a></font>  <font size="2"><a href="http://del.icio.us/tag/discount">discount</a></font>  <font size="2"><a href="http://del.icio.us/tag/dvd">dvd</a></font>  <font size="2"><a href="http://del.icio.us/tag/equipment">equipment</a></font>  <font size="3"><a href="http://del.icio.us/tag/estate">estate</a></font>  <font size="2"><a href="http://del.icio.us/tag/finance">finance</a></font>  <font size="2"><a href="http://del.icio.us/tag/florida">florida</a></font>  <font size="2"><a href="http://del.icio.us/tag/forex">forex</a></font>  <font size="6"><a href="http://del.icio.us/tag/free">free</a></font>  <font size="2"><a href="http://del.icio.us/tag/furniture">furniture</a></font>  <font size="2"><a href="http://del.icio.us/tag/gift">gift</a></font>  <font size="2"><a href="http://del.icio.us/tag/girls">girls</a></font>  <font size="2"><a href="http://del.icio.us/tag/golf">golf</a></font>  <font size="3"><a href="http://del.icio.us/tag/health">health</a></font>  <font size="2"><a href="http://del.icio.us/tag/hotel">hotel</a></font>  <font size="3"><a href="http://del.icio.us/tag/info">info</a></font>  <font size="4"><a href="http://del.icio.us/tag/insurance">insurance</a></font>  <font size="2"><a href="http://del.icio.us/tag/jewelry">jewelry</a></font>  <font size="2"><a href="http://del.icio.us/tag/lawyer">lawyer</a></font>  <font size="3"><a href="http://del.icio.us/tag/loan">loan</a></font>  <font size="2"><a href="http://del.icio.us/tag/loans">loans</a></font>  <font size="2"><a href="http://del.icio.us/tag/medical">medical</a></font>  <font size="3"><a href="http://del.icio.us/tag/money">money</a></font>  <font size="3"><a href="http://del.icio.us/tag/mortgage">mortgage</a></font>  <font size="11"><a href="http://del.icio.us/tag/new">new</a></font>  <font size="5"><a href="http://del.icio.us/tag/online">online</a></font>  <font size="3"><a href="http://del.icio.us/tag/phone">phone</a></font>  <font size="2"><a href="http://del.icio.us/tag/poker">poker</a></font>  <font size="2"><a href="http://del.icio.us/tag/rental">rental</a></font>  <font size="2"><a href="http://del.icio.us/tag/sale">sale</a></font>  <font size="2"><a href="http://del.icio.us/tag/school">school</a></font>  <font size="3"><a href="http://del.icio.us/tag/sex">***</a></font>  <font size="2"><a href="http://del.icio.us/tag/small">small</a></font>  <font size="3"><a href="http://del.icio.us/tag/software">software</a></font>  <font size="2"><a href="http://del.icio.us/tag/texas">texas</a></font>  <font size="2"><a href="http://del.icio.us/tag/tits">****</a></font>  <font size="2"><a href="http://del.icio.us/tag/trading">trading</a></font>  <font size="3"><a href="http://del.icio.us/tag/travel">travel</a></font>  <font size="2"><a href="http://del.icio.us/tag/used">used</a></font>  <font size="2"><a href="http://del.icio.us/tag/vacation">vacation</a></font>  <font size="3"><a href="http://del.icio.us/tag/video">video</a></font>  <font size="2"><a href="http://del.icio.us/tag/wedding">wedding</a></font></td>
</tr>
</table>
<p></center> We link these keywords to del.icio.us to depict an emerging problem that is quickly becoming serious. We <a title="posted on this earlier" href="http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise/">posted on this recently</a>, though references date to quite a while back. [See related tag spam notes on <a href="http://www.micropersuasion.com/2005/07/yahoo_myweb_bec.html">MyWeb</a>, <a href="http://gigaom.com/2005/08/02/the-dark-side-of-technorati-tags/">Technorati</a> and <a href="http://many.corante.com/archives/2005/02/01/tags_run_amok.php">del.icio.us</a>]</p>
<p>We will continue our effort on tackling spam. Our ongoing research on spam is catalogued in our <a href="http://ebiquity.umbc.edu/tag/splog">tagged splog resources</a>, or better still check out our <a href="http://www.icwsm.org/tutorials.html#spam">tutorial at ICWSM</a> this March!</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/02/01/pings-spings-splogs-and-the-splogosphere-2007-updates/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Tag Spam on the Rise</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=950</guid>
		<description><![CDATA[I subscribe and follow keywords of interest through RSS feeds, both on blog search engines and bookmarking tools. Though splogs have always been a problem, lately I have noticed increasing spam in bookmarking tools. What do we call it &#8212; b00kmarks? (read zero, zero)
In the more popular one&#8217;s (like del.icio.us) the LONG TAIL is highly [...]]]></description>
			<content:encoded><![CDATA[<p>I subscribe and follow keywords of interest through RSS feeds, both on blog search engines and bookmarking tools. Though splogs have always been a problem, lately I have noticed increasing spam in bookmarking tools. What do we call it &#8212; b00kmarks? (read zero, zero)</p>
<p>In the more popular one&#8217;s (like del.icio.us) the LONG TAIL is highly compromised (<a href="http://del.icio.us/tag/flickr">1</a>, <a href="http://del.icio.us/tag/mesothelioma">2</a>), while in the less popular even the HEAD seems to have problems. The availability of many ready to use <a href="http://www.google.com/search?hl=en&#038;lr=&#038;q=%22tag+and+ping%22&#038;btnG=Search">&#8220;tag and ping&#8221;</a> tools is making things worse.</p>
<p><center> <img src="http://ebiquity.umbc.edu/blogger/wp-content/uploads/2007/02/furl_spam.bmp" /> </center>While my immediate response is to unsubscribe, being researchers we will of course investigate this further.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Splogs in the Non-English Blogosphere</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/01/20/splogs-in-the-non-english-blogosphere/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/01/20/splogs-in-the-non-english-blogosphere/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=947</guid>
		<description><![CDATA[We are conducting research on the nature and seriousness of the splog problem in the non-English   blogosphere. As contextual advertisements and affiliate marketing become more profitable in these other   languages, splogs are bound to infiltrate and pollute them. We suspect its already beginning to, in a limited way, and are interested [...]]]></description>
			<content:encoded><![CDATA[<p>We are conducting research on the nature and seriousness of the splog problem in the non-English   blogosphere. As contextual advertisements and affiliate marketing become more profitable in these other   languages, splogs are bound to infiltrate and pollute them. We suspect its already beginning to, in a limited way, and are interested in studying them.</p>
<p>From the research community, the only work related to non-English splogs is:</p>
<blockquote><p><a href="http://www.blogpulse.com/www2006-workshop/papers/detecting-blog-spam.pdf">Detecting Blog Spams using   the Vocabulary Size of All Substrings in Their Copies</a>, by Kazuyuki Narisawa, Yasuhiro Yamada, Daisuke Ikeda   and Masayuki Takeda<br />Most of this work is however based on synthetic data, not actual splogs.</p>
</blockquote>
<p>We have also made attempts to see the existence of splogs by querying blog search engines using <a href="http://google.com/translate_t">translated</a> spammy(profitable) advertising contexts like <strong>insurance</strong>, <strong>vacations</strong>, <strong>loans</strong> etc. Cultural differences indicate that this might not be the way to go. True, we   haven&#8217;t come across many splogs.</p>
<p>This is what prompts us to seek suggestions from the blogging and research community. If you know of this problem, have seen splogs in other languages, know of spammy non-English advertising contexts, and   would like to contribute or collaborate please send <a href="http://ebiquity.umbc.edu/person/html/Pranam/Kolari/">either</a> of <a href="http://ebiquity.umbc.edu/person/html/Tim/Finin/">us</a> a note or comment below.</p>
<p>UPDATE: For our readers, Is this a splog in Japanese? <a rel="nofollow" href="http://diet.newstanding.com/hcm/vpb/">http://diet.newstanding.com/hcm/vpb/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/01/20/splogs-in-the-non-english-blogosphere/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pew on Social Network Usage</title>
		<link>http://ebiquity.umbc.edu/blogger/2007/01/07/pew-on-social-network-usage/</link>
		<comments>http://ebiquity.umbc.edu/blogger/2007/01/07/pew-on-social-network-usage/#comments</comments>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<dc:creator>Pranam Kolari</dc:creator>
		
		<guid isPermaLink="false">http://ebiquity.umbc.edu/blogger/?p=932</guid>
		<description><![CDATA[Pew Internet just released a survey on social network usage among American teens. While confirming the known,  (i) MySpace is the most visited site (ii) girls are more active than boys, the report goes on to say that &#8220;74% of respondents post comments on friend&#8217;s blog&#8221;. I would have loved to see numbers on [...]]]></description>
			<content:encoded><![CDATA[<p>Pew Internet just released a <a href="http://www.pewinternet.org/PPF/r/198/report_display.asp">survey</a> on social network usage among American teens. While confirming the known,  (i) MySpace is the most visited site (ii) girls are more active than boys, the report goes on to say that &#8220;74% of respondents post comments on friend&#8217;s blog&#8221;. I would have loved to see numbers on &#8220;x% of respondents have created or plan to create blog posts&#8221;.</p>
<p>MySpace has been a major contributor to the growth of blogs recently, and now forms a major chunk of a blog search result. The last we checked MySpace blogs were contributing 15% to 20%  of all pings to <a href="http://weblogs.com/">weblogs.com</a>, with numbers rising by the day.</p>
]]></content:encoded>
			<wfw:commentRss>http://ebiquity.umbc.edu/blogger/2007/01/07/pew-on-social-network-usage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
