UMBC ebiquity research group Building intelligent systems in open, heterogeneous, dynamic, distributed environments
16 May 2008, 07:03:26 EDT  
An SEO dreams of billions and billions of spam pages

An SEO dreams of billions and billions of spam pages

By Tim Finin on Sunday, June 18th, 2006 at 1:00 pm.

A post from an SEO site claims that web spammers are using techniques to quickly get billions of new pages indexed by Google. I find it hard to believe. Google’s estimated number of results is often much higher than the actual number. This was discussed last Fall when search engines were arguing about who was bigger. While Google no longer reports on the size of their index, educated guesses from last Fall were around 25M. So if a spam site was able over the course of 18 days to add 5B pages, all from the same domain, I think Google would have noticed.

I did notice two interesting this from this post. First, this SEO site’s guess about the technique suggests using blog comment spam as the method to get external links to the web spam pages. Second, the site discussed in the post eiqz2q.org seems to generate content for all of its sub-domains via a simple call to its own specialized search engine at t1ps2see.com.

Related posts: • The State of Blogger (is it Splogger?);  • No spam on Twitter?!;  • BigOWLIM reasons over billions of RDF triples;  

 

 

Leave a Reply

Recent posts

  • Students: brand yourself with a blog
  • Social Data on the Web workshop at ISWC 2008
  • Petrini: Streaming Applications on the Cell BE Processor, 3pm 5/13 UMBC
  • Gossip-Based Outlier Detection for Mobile Ad Hoc Networks
  • Int. Conf. Semantic Web deadlines this week and next (ISWC 2008)

  • Ebiquity community

  • Fieldmarking data blog
  • Geospatial Semantic Web
  • Harry Chen thinks aloud
  • Planet social media research
  • Social media research blog
  • TrackForward by Kolari
  • UMBC GAIM

  • UMBC