UMBC ebiquity
AOL research releases Web search engine datasets

AOL research releases Web search engine datasets

Tim Finin, 1:00pm 5 August 2006

AOL Research has released some interesting data collections, including:

  • 20K hand labeled, classified queries
  • 3.5M web Q/A queries (who, what, where, when …)
  • Query streams for 500K users over three months (20M queries)
  • Query arrival rates for queuing analysis
  • 2M queries against US Government domains

Additional datasets are promised in the future.

A paper describing some measurements over this (or related?) data is available: A Picture of Search by G. Pass, A. Chawdry and C. Torgeson.

Related posts:

  1. Does AOL search data compromise privacy?
  2. WolframAlpha releases API
  3. PubSub dynamic content search engine
  4. Grokker search engine
  5. Yahoo! using Bing search engine in US and Canada

Comments are closed.