AOL Hits Rock Bottom AND Pisses Off Google

August 7, 2006 · 17 comments

Usually after hitting rock bottom once, a company learns from its actions and fixes things. Apparently this is not the case with AOL. Earlier yesterday they released a 439MB file (~2GB uncompressed) of approximately 20 million search queries collected from about 650k unique users over roughly 3 months. The AOL research site (the relevant page of which has been taken down) claimed that the data was to be used to..

The goal of this collection is to provide real query log data that is based on real users. It could be used for personalization, query reformulation or other types of search research.

Now the story gets interesting when you realize that AOL search is simply rebranded Google search. You can imagine what kind of a hissy-fit Google will get into now that sensitive information such as their most powerful keywords are thriving on P2P communities at this very moment. With the aforementioned file in hand, an SEO expert can find out which keywords work best and pay well for services like Google AdSense and Google AdWords. Now when a spammer studies this file, they will be having a field day. As one site put it, “Google is gonna get mega spammed.” Follow the rest of the blogosphere’s reaction on Techmeme.

The utter stupidity of this is staggering. AOL has released very private data about its users without their permission. While the AOL username has been changed to a random ID number, the abilitiy to analyze all searches by a single user will often lead people to easily determine who the user is, and what they are up to. The data includes personal names, addresses, social security numbers and everything else someone might type into a search box.
Techcrunch

{ 5 trackbacks }

AOL Gate: Search Query Data Scandal by Elliott Back
August 7, 2006 at 4:12 am
Cadeautje van AOL aan spammers | KennethVerburg.nl - Information Engineer in het Wild
August 7, 2006 at 4:24 am
seoluv / seoluv told you so
August 9, 2006 at 10:59 am
NhanTran » Blog Archive » AOL Gate: Search Query Data Scandal
August 17, 2006 at 4:03 pm
Will Sullivan's Journerdism - Online journalism, multimedia, web design, media changes and all things nerdy.
July 26, 2007 at 2:10 am

{ 12 comments… read them below or add one }

1 MJ August 7, 2006 at 6:08 am

What can you say? AOL is and will always stay my most hated ISP.
Did you know AOL dishes out US-IPs in Germany?
I guess I’ve got some SEO to do on my site.

Reply

2 Chris Morrell August 7, 2006 at 6:36 am

When I woke up and saw this hitting the headlines I couldn’t believe what I was reading. Just another reason to boycott AOL and switch to a proper broadband service.

Reply   More from author

3 Frother August 7, 2006 at 10:15 am

I couldn’t believe what I was reading too…

Reply   More from author

4 AOLSearchLogs.com August 7, 2006 at 10:44 am

AOL really broke the trust with its members.

Reply   More from author

5 Alan August 7, 2006 at 10:57 am

Is AOL on drugs again? They’ve really done it now. Looks like I got some SEO myself to do. :P Somebody’s gotta take advantage of these.

Reply   More from author

6 Brian Pinard August 7, 2006 at 5:06 pm

How low can you go….

Reply   More from author

7 Brian Pinard August 7, 2006 at 5:15 pm

Why is AOL still around, anyway?

Reply   More from author

8 Guillaumeb August 7, 2006 at 5:25 pm

Nice write-up, did you puposedly not link to a backup of the file? I’m just wondering as other tech sites, especially Techcrunch, were called irreponsible by the blogsphere.
Also you might be able by this other AOL (bad, really shockng ) news…from STLToday.com http://tinyurl.com/z2nku

Reply   More from author

9 Paul Stamatiou August 7, 2006 at 5:28 pm

I didn’t link to it for the same reason I don’t link to piracy and bittorrent sites. In case something goes down, I don’t want to have linked to it. =)

Reply   More from author

10 bird_flu_bob August 7, 2006 at 7:14 pm

I was able to grab this download. Does anyone have any good ideas or even websites that have popped up on the best ways to analyze this data? So far I have just come up with simple things like filtering by url and looking at search terms resulting in clicks for that url.

Reply

11 Huffers August 8, 2006 at 6:48 am

I think datamining just went mainstream

Reply

12 Montoya August 9, 2006 at 1:27 pm

Lesson to be learned: when you give your money to stupid companies, you enable them to do stupid things.

I’m not going to touch this log file; it’s unethical and anyone who does look at it should be ashamed of themselves.

Reply   More from author

Leave a Comment

You can use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Previous post: 123MacBook.com is LIVE!

Next post: WWDC – Apple Unveils Mac Pro, Leopard Sneak Peek