AOL research releases Web search engine datasets
By Tim Finin on Saturday, August 5th, 2006 at 1:00 pm.AOL Research has released some interesting data collections, including:
- 20K hand labeled, classified queries
- 3.5M web Q/A queries (who, what, where, when …)
- Query streams for 500K users over three months (20M queries)
- Query arrival rates for queuing analysis
- 2M queries against US Government domains
Additional datasets are promised in the future.
A paper describing some measurements over this (or related?) data is available: A Picture of Search by G. Pass, A. Chawdry and C. Torgeson.
Related posts: • Does AOL search data compromise privacy?; • Analyzing AOL search data shows click through rates for search rank; • An RDF crawler;
