AOL search data
Encyclopedia
The AOL search data leak was the release of detailed search logs by AOL
AOL
AOL Inc. is an American global Internet services and media company. AOL is headquartered at 770 Broadway in New York. Founded in 1983 as Control Video Corporation, it has franchised its services to companies in several nations around the world or set up international versions of its services...

 of a large number of AOL users. The release was intentional and intended for a research purposes; however, the public release meant that the entire Internet could see the results, rather than a select number of academics. AOL did not redact any information, causing privacy concerns since users could potentially be identified by their searches.

Background

On August 4, 2006, AOL Research, headed by Dr. Abdur Chowdhury, released a compressed text file on one of its websites containing twenty million search keywords for over 650,000 users over a 3-month period, intended for research purposes. AOL pulled the file from public access by the 7th, but not before it had been mirrored and distributed on the Internet.

AOL themselves did not identify users in the report, however; personally identifiable information
Personally identifiable information
Personally Identifiable Information , as used in information security, is information that can be used to uniquely identify, contact, or locate a single person or can be used with other sources to uniquely identify a single individual...

 was present in many of the queries, and as the queries were attributed by AOL to particular user accounts, identified numerically, an individual could be identified and matched to their account and search history by such information.
The New York Times was able to locate an individual from the released and anonymized search records by cross referencing them with phonebook listings.
Consequently, the ethical implications of using this data for research are under debate.

AOL acknowledged it was a mistake and removed the data, although the files can still be downloaded from mirror sites.

Lawsuits

In September 2006, a class action lawsuit was filed against AOL in the U.S. District Court for the Northern District of California.
"The lawsuit accuses AOL of violating the Electronic Communications Privacy Act and of fraudulent and deceptive business practices, among other claims, and seeks at least $5,000 for every person whose search data was exposed."

Notable users

Although the searchers were only identified by a numeric ID, some people's search results have become notable due to various reasons.

Thelma Arnold

The New York Times successfully discovered the identity of several searchers, and with her permission, exposed search number 4417749 as Thelma Arnold, a 62-year-old Georgian widow. This privacy breach was widely reported, and led to the resignation of AOL's CTO, Maureen Govern
Maureen Govern
Maureen Govern was Chief Technology Officer at AOL at the time of their search data release. The researcher who released the data and his supervisor were reportedly fired....

, on August 21, 2006. The media quoted an insider as saying that two employees had been fired: the researcher who released the data, and his immediate supervisor, who reported to Govern.

User 927

One product of the AOL scandal was the proliferation of blog entries examining the exposed data. Certain users' search logs were identified as humorous, disturbing, or even dangerous.

Consumer watchdog website The Consumerist
Consumerist (blog)
The Consumerist is a consumer affairs blog owned by Consumer Reports with posts provided by regular daily contributors. The blog's focus is on consumerism and consumers' experiences and issues with companies and corporations, concentrating mostly on U.S. consumers...

posted a blog entry by editor Ben Popken identifying the anonymous user number 927 as having an especially chilling search history.
The blog posting has since been viewed nearly 4,000 times and referenced on a number of other high-profile sites. In addition to sparking the interest of the Internet community, User 927 inspired a theatrical production, written by Katharine Clark Gray in Philadelphia. The play, also named User 927, has since been cited on several of the same blogs that originally discovered the real user's existence.

User 711391

As time has passed, more artistic renderings of individual user logs have appeared. A series of movies on the web site Minimovies.org called "I Love Alaska" puts to voice and imagery to user 711391 which the authors have labeled as "an episodic documentary"

Criticism in the Media

In January 2007, Business 2.0 Magazine on CNNMoney ranked the release of the search data #57 in a segment called "101 Dumbest Moments in Business."

External links

  • http://www.gregsadetsky.com/aol-data/ — List of sites where the complete data can be downloaded.
  • http://www.aolstalker.com/ — Search keywords and users. Tag users and search tags, also features funniest users list.
  • http://aolscandal.com/ - Analysis and discussion of leaked AOL search data.
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK