UMBC ebiquity research group Building intelligent systems in open, heterogeneous, dynamic, distributed environments
05 July 2008, 09:35:46 EDT  
AOL research releases Web search engine datasets

AOL research releases Web search engine datasets

By Tim Finin on Saturday, August 5th, 2006 at 1:00 pm.

AOL Research has released some interesting data collections, including:

  • 20K hand labeled, classified queries
  • 3.5M web Q/A queries (who, what, where, when …)
  • Query streams for 500K users over three months (20M queries)
  • Query arrival rates for queuing analysis
  • 2M queries against US Government domains

Additional datasets are promised in the future.

A paper describing some measurements over this (or related?) data is available: A Picture of Search by G. Pass, A. Chawdry and C. Torgeson.

Related posts: • Does AOL search data compromise privacy?;  • Analyzing AOL search data shows click through rates for search rank;  • An RDF crawler;  

 

 

Leave a Reply






UMBC