All Topics  
Web search engine

 

   Email Print
   Bookmark   Link






 

Web search engine



 
 
A Web search engine is a tool designed to search for information on the World Wide Web
World Wide Web

The World Wide Web is a very large set of interlinked hypertext documents accessed via the Internet. With a Web browser, one can view Web pages that may contain writing, s, videos, and other multimedia and navigate between them using hyperlinks....
. The search results are usually presented in a list and are commonly called hits. The information may consist of web page
Web page

A web page or webpage is a resource of information that is suitable for the World Wide Web and can be accessed through a web browser.This information is usually in HyperText Markup Language or eXtensible HyperText Markup Language format, and may provide Navigation bar to other web pages via hypertext Hyperlink....
s, images, information and other types of files. Some search engines also mine data available in newsbooks, databases, or open directories
Web directory

A web directory or link directory is a directory on the World Wide Web. It specializes in hyperlink to other web sites and Categorization those links....
. Unlike Web directories, which are maintained by human editors, search engines operate algorithmically or are a mixture of algorithmic and human input.

le class="bordered infobox">
Timeline (full list
List of search engines

This is a list of Wikipedia articles about search engines, including web search engines, selection-based search engines, metasearch engines, desktop search tools, and web portals and vertical market websites that have a search facility for online databases....
)
YearEngineEvent
1993 Aliweb
Aliweb

ALIWEB can be considered the first Web search engine, as its predecessors were either built with different purposes or were literally just indexers ....
Launch
JumpStation
JumpStation

JumpStation was the first Web search engine that behaved, and appeared to the user, the way current web search engines do. It started indexing on Sunday 12th December 1993 and was announced on the Mosaic "What's New" webpage on 21st December 1993....
Launch
1994 WebCrawler
WebCrawler

WebCrawler is a metasearch engine that blends the top search results from Google, Yahoo!, Live Search , Ask.com, About.com, MIVA, LookSmart and other popular search engines....
Launch
Infoseek
Infoseek

Infoseek was a very popular search engine founded in 1994 by Steve Kirsch. It was also known as "big yellow".It was bought by The Walt Disney Company in 1998, and the technology was merged with that of the Disney-acquired Starwave to form the Go.com network....
Launch
Lycos
Lycos

Lycos is a Web search engine and web portal with broadband entertainment content....
Launch
1995 AltaVista
AltaVista

AltaVista is an Internet search engine company , and that company's search engine product....
Launch
Open Text
Open text

In semiotic analysis, an open text is a text that allows multiple or mediated Hermeneutics by the readers. In contrast, a closed text leads the reader to one intended interpretation....
 Web Index
Launch
Magellan
Magellan

Magellan may refer to:People*Ferdinand Magellan, Portuguese explorer who led the first expedition around the world.Geography*The Strait of Magellan....
Launch
Excite
Excite

Excite is an Internet Web portal, and as one of the "Dot-com companys" of the 1990s , it was once one of the most recognized brands on the Internet....
Launch
SAPO
SAPO

SAPO , Servidor de Apontadores Portugueses, is a brand and subsidiary company of Portugal Telecom Group. It is a Portugal internet service provider that started being a search engine when founded in 1995....
Launch
1996 Dogpile
Dogpile

Dogpile is a metasearch engine that fetches results from Google, Yahoo!, Live Search, Ask.com, About.com, MIVA, LookSmart and several other popular search engines, including those from audio and video content providers....
Launch
Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
Founded
HotBot
HotBot

HotBot is one of the early Internet search engines and was launched in May 1996 as a service of Wired Magazine. It was launched using a "new links" strategy of marketing, claiming to update its search database more often than its competitors....
Founded
Ask Jeeves Founded
1997 Northern Light
Northern Light Group

Northern Light Group, LLC is a company specializing in strategic research portals, enterprise search technology, and text analytics solutions. The company provides custom, hosted, turnkey solutions for its clients....
Launch
Yandex
Yandex

Yandex is a Russian search engine and the largest Russian Web portal. Yandex was launched in 1997. Its name can be explained as "Yet Another iNDEXer" or "????????? Index"....
Launch
1998 Google
Google search

Google search is a Web search engine owned by Google, and is the most used search engine on the World Wide Web. Google receives several hundred million queries each day through its various services....
Launch
1999 AlltheWeb
AlltheWeb

AlltheWeb is an Internet search engine that debuted in mid-1999. It grew out from FTP Search, Tor Egge's doctorate thesis at the Norwegian University of Science and Technology, which he started on in 1994, which in turn resulted in the formation of Fast Search and Transfer established on July 16, 1997....
Launch
GenieKnows
GenieKnows

GenieKnows is a division of IT Interactive Services Inc., a privately owned vertical search search engine company based in Halifax Regional Municipality, Nova Scotia....
Founded
Naver Launch
Teoma
Teoma

Teomo, pronounced tay-a-mo, was an Internet search engine founded in 2000 by Professor Apostolos Gerasoulis and his colleagues at Rutgers University in New Jersey....
Founded
Vivisimo
Vivísimo

Viv?simo is a privately held enterprise search software company in Pittsburgh, Pennsylvania that develops and sells software products to improve search on the web and in enterprises....
Founded
2000 Baidu
Baidu

Baidu is the leading Chinese language search engine for websites, audio files, and images. Baidu offers 57 search and community services including an online collaboratively-built encyclopedia , and a searchable keyword-based discussion forum....
Founded
2003 Info.com
Info.com

Info.com is a metasearch engine which provides results from leading search engines and pay-per-click directories, including Google, Yahoo!, MSN Search, Ask.com , LookSmart, About.com and Open Directory Project ....
Launch
2004 Yahoo! Search
Yahoo! Search

Yahoo! Search is a search engine, owned by Yahoo! and is currently the second largest search engine on the web, after its competitor Google. Originally Yahoo! Search started as a web directory of other websites, organized in a hierarchy, as opposed to a searchable index of pages....
Final launch
A9.com
A9.com

A9.com is a subsidiary of Amazon.com based in Palo Alto, California that develops search engine technology. A9 currently has over 100 employees in offices its Palo Alto, Bangalore, and Dublin offices....
Launch
Sogou
Sogou

Sogou is a Chinese language search engine which can search text, images, music, and maps. It was launched 4 August 2004 and is owned by Sohu, SoGou means "Search Dog" in Chinese....
Launch
2005 MSN Search Final launch
Ask.com
Ask.com

Ask.com is a web search engine started in 1996 by Garrett Gruener and David Warthen in Berkeley, California. The original software was implemented by Gary Chevsky from his own design....
Launch
GoodSearch
GoodSearch

GoodSearch is a Yahoo-powered search engine that donates 50% of its revenue, about a penny per search, to listed American charities and schools designated by its users....
Launch
2006 wikiseek
Wikiseek

Wikiseek was a search engine that indexed Wikipedia pages and pages that were linked to from Wikipedia articles. The search engine was founded by Palo Alto based internet startup SearchMe and was officially launched on January 17, 2007....
Founded
Quaero
Quaero

Quaero is a European research and development program which has the goal of developing multimedia and multilingual indexing and management tools for professional and general public applications ....
Founded
Ask.com
Ask.com

Ask.com is a web search engine started in 1996 by Garrett Gruener and David Warthen in Berkeley, California. The original software was implemented by Gary Chevsky from his own design....
Launch
Live Search Launch
ChaCha
ChaCha (search engine)

ChaCha is a mobile answering service which uses a technique known as social searching. ChaCha was created by Scott A. Jones and Brad Bostic. The company is based in Carmel, Indiana, a suburb of Indianapolis....
Beta Launch
Guruji.com
Guruji.com

Guruji.com is an Indian Internet search engine that is focused on providing better search results to Indian consumers, by leveraging proprietary algorithms and data in the Indian context....
Beta Launch
2007 wikiseek
Wikiseek

Wikiseek was a search engine that indexed Wikipedia pages and pages that were linked to from Wikipedia articles. The search engine was founded by Palo Alto based internet startup SearchMe and was officially launched on January 17, 2007....
Launched
Wikia Search
Wikia Search

Wikia Search is a free content and open-source Web search engine and a part of Wikia operated by Wikia, Inc., a for-profit company founded in late 2004 by Jimmy Wales and Angela Beesley....
Launched
2008 Powerset
Powerset (company)

Powerset is a company based in San Francisco, California that is developing a natural language search engine for the Internet.Powerset is working on building a natural language search engine that can find targeted answers to user questions ....
Launched
Viewzi
Viewzi

Viewzi is a Web search engine company based in Dallas, Texas that is developing a highly visual experience that tailors the way users look at information based on what they are looking for ....
Launched
Cuil
Cuil

Cuil is a search engine that organizes web pages by content and displays relatively long entries along with thumbnail pictures for many results....
Launched
Boogami
Boogami

Boogami is a search engine that was developed by James Wildish, a sixteen year old college student from Kent in United Kingdom. It combines a search engine with a pixel advertising grid that appears every time someone uses Boogami to search the Internet, and for the fact that it offers free pixel advertising to charitable organisation....
Launched
LeapFish
LeapFish

LeapFish is a search aggregator that retrieves results from other portals and search engines, including Google, Yahoo, Live Search, Blogs, Videos etc.......
Beta Launch
Musu
Musu

MUSU may refer to* University of Melbourne Student Union, one of several student organisations at the University of Melbourne, Australia* University of Manchester Students' Union, student organisation of the University of Manchester, England...
Beta Launch
VADLO
VADLO

VADLO is a life sciences search engine, privately owned by Life in Research, LLC., based in Chicago, USA. VADLO caters to life sciences and biomedical researchers, educators, students, clinicians and reference librarians....
Launch


Before there were web search engines there was a complete list of all webservers.






Discussion
Ask a question about 'Web search engine'
Start a new discussion about 'Web search engine'
Answer questions from other users
Full Discussion Forum



Encyclopedia


A Web search engine is a tool designed to search for information on the World Wide Web
World Wide Web

The World Wide Web is a very large set of interlinked hypertext documents accessed via the Internet. With a Web browser, one can view Web pages that may contain writing, s, videos, and other multimedia and navigate between them using hyperlinks....
. The search results are usually presented in a list and are commonly called hits. The information may consist of web page
Web page

A web page or webpage is a resource of information that is suitable for the World Wide Web and can be accessed through a web browser.This information is usually in HyperText Markup Language or eXtensible HyperText Markup Language format, and may provide Navigation bar to other web pages via hypertext Hyperlink....
s, images, information and other types of files. Some search engines also mine data available in newsbooks, databases, or open directories
Web directory

A web directory or link directory is a directory on the World Wide Web. It specializes in hyperlink to other web sites and Categorization those links....
. Unlike Web directories, which are maintained by human editors, search engines operate algorithmically or are a mixture of algorithmic and human input.

History

Timeline (full list
List of search engines

This is a list of Wikipedia articles about search engines, including web search engines, selection-based search engines, metasearch engines, desktop search tools, and web portals and vertical market websites that have a search facility for online databases....
)
YearEngineEvent
1993 Aliweb
Aliweb

ALIWEB can be considered the first Web search engine, as its predecessors were either built with different purposes or were literally just indexers ....
Launch
JumpStation
JumpStation

JumpStation was the first Web search engine that behaved, and appeared to the user, the way current web search engines do. It started indexing on Sunday 12th December 1993 and was announced on the Mosaic "What's New" webpage on 21st December 1993....
Launch
1994 WebCrawler
WebCrawler

WebCrawler is a metasearch engine that blends the top search results from Google, Yahoo!, Live Search , Ask.com, About.com, MIVA, LookSmart and other popular search engines....
Launch
Infoseek
Infoseek

Infoseek was a very popular search engine founded in 1994 by Steve Kirsch. It was also known as "big yellow".It was bought by The Walt Disney Company in 1998, and the technology was merged with that of the Disney-acquired Starwave to form the Go.com network....
Launch
Lycos
Lycos

Lycos is a Web search engine and web portal with broadband entertainment content....
Launch
1995 AltaVista
AltaVista

AltaVista is an Internet search engine company , and that company's search engine product....
Launch
Open Text
Open text

In semiotic analysis, an open text is a text that allows multiple or mediated Hermeneutics by the readers. In contrast, a closed text leads the reader to one intended interpretation....
 Web Index
Launch
Magellan
Magellan

Magellan may refer to:People*Ferdinand Magellan, Portuguese explorer who led the first expedition around the world.Geography*The Strait of Magellan....
Launch
Excite
Excite

Excite is an Internet Web portal, and as one of the "Dot-com companys" of the 1990s , it was once one of the most recognized brands on the Internet....
Launch
SAPO
SAPO

SAPO , Servidor de Apontadores Portugueses, is a brand and subsidiary company of Portugal Telecom Group. It is a Portugal internet service provider that started being a search engine when founded in 1995....
Launch
1996 Dogpile
Dogpile

Dogpile is a metasearch engine that fetches results from Google, Yahoo!, Live Search, Ask.com, About.com, MIVA, LookSmart and several other popular search engines, including those from audio and video content providers....
Launch
Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
Founded
HotBot
HotBot

HotBot is one of the early Internet search engines and was launched in May 1996 as a service of Wired Magazine. It was launched using a "new links" strategy of marketing, claiming to update its search database more often than its competitors....
Founded
Ask Jeeves Founded
1997 Northern Light
Northern Light Group

Northern Light Group, LLC is a company specializing in strategic research portals, enterprise search technology, and text analytics solutions. The company provides custom, hosted, turnkey solutions for its clients....
Launch
Yandex
Yandex

Yandex is a Russian search engine and the largest Russian Web portal. Yandex was launched in 1997. Its name can be explained as "Yet Another iNDEXer" or "????????? Index"....
Launch
1998 Google
Google search

Google search is a Web search engine owned by Google, and is the most used search engine on the World Wide Web. Google receives several hundred million queries each day through its various services....
Launch
1999 AlltheWeb
AlltheWeb

AlltheWeb is an Internet search engine that debuted in mid-1999. It grew out from FTP Search, Tor Egge's doctorate thesis at the Norwegian University of Science and Technology, which he started on in 1994, which in turn resulted in the formation of Fast Search and Transfer established on July 16, 1997....
Launch
GenieKnows
GenieKnows

GenieKnows is a division of IT Interactive Services Inc., a privately owned vertical search search engine company based in Halifax Regional Municipality, Nova Scotia....
Founded
Naver Launch
Teoma
Teoma

Teomo, pronounced tay-a-mo, was an Internet search engine founded in 2000 by Professor Apostolos Gerasoulis and his colleagues at Rutgers University in New Jersey....
Founded
Vivisimo
Vivísimo

Viv?simo is a privately held enterprise search software company in Pittsburgh, Pennsylvania that develops and sells software products to improve search on the web and in enterprises....
Founded
2000 Baidu
Baidu

Baidu is the leading Chinese language search engine for websites, audio files, and images. Baidu offers 57 search and community services including an online collaboratively-built encyclopedia , and a searchable keyword-based discussion forum....
Founded
2003 Info.com
Info.com

Info.com is a metasearch engine which provides results from leading search engines and pay-per-click directories, including Google, Yahoo!, MSN Search, Ask.com , LookSmart, About.com and Open Directory Project ....
Launch
2004 Yahoo! Search
Yahoo! Search

Yahoo! Search is a search engine, owned by Yahoo! and is currently the second largest search engine on the web, after its competitor Google. Originally Yahoo! Search started as a web directory of other websites, organized in a hierarchy, as opposed to a searchable index of pages....
Final launch
A9.com
A9.com

A9.com is a subsidiary of Amazon.com based in Palo Alto, California that develops search engine technology. A9 currently has over 100 employees in offices its Palo Alto, Bangalore, and Dublin offices....
Launch
Sogou
Sogou

Sogou is a Chinese language search engine which can search text, images, music, and maps. It was launched 4 August 2004 and is owned by Sohu, SoGou means "Search Dog" in Chinese....
Launch
2005 MSN Search Final launch
Ask.com
Ask.com

Ask.com is a web search engine started in 1996 by Garrett Gruener and David Warthen in Berkeley, California. The original software was implemented by Gary Chevsky from his own design....
Launch
GoodSearch
GoodSearch

GoodSearch is a Yahoo-powered search engine that donates 50% of its revenue, about a penny per search, to listed American charities and schools designated by its users....
Launch
2006 wikiseek
Wikiseek

Wikiseek was a search engine that indexed Wikipedia pages and pages that were linked to from Wikipedia articles. The search engine was founded by Palo Alto based internet startup SearchMe and was officially launched on January 17, 2007....
Founded
Quaero
Quaero

Quaero is a European research and development program which has the goal of developing multimedia and multilingual indexing and management tools for professional and general public applications ....
Founded
Ask.com
Ask.com

Ask.com is a web search engine started in 1996 by Garrett Gruener and David Warthen in Berkeley, California. The original software was implemented by Gary Chevsky from his own design....
Launch
Live Search Launch
ChaCha
ChaCha (search engine)

ChaCha is a mobile answering service which uses a technique known as social searching. ChaCha was created by Scott A. Jones and Brad Bostic. The company is based in Carmel, Indiana, a suburb of Indianapolis....
Beta Launch
Guruji.com
Guruji.com

Guruji.com is an Indian Internet search engine that is focused on providing better search results to Indian consumers, by leveraging proprietary algorithms and data in the Indian context....
Beta Launch
2007 wikiseek
Wikiseek

Wikiseek was a search engine that indexed Wikipedia pages and pages that were linked to from Wikipedia articles. The search engine was founded by Palo Alto based internet startup SearchMe and was officially launched on January 17, 2007....
Launched
Wikia Search
Wikia Search

Wikia Search is a free content and open-source Web search engine and a part of Wikia operated by Wikia, Inc., a for-profit company founded in late 2004 by Jimmy Wales and Angela Beesley....
Launched
2008 Powerset
Powerset (company)

Powerset is a company based in San Francisco, California that is developing a natural language search engine for the Internet.Powerset is working on building a natural language search engine that can find targeted answers to user questions ....
Launched
Viewzi
Viewzi

Viewzi is a Web search engine company based in Dallas, Texas that is developing a highly visual experience that tailors the way users look at information based on what they are looking for ....
Launched
Cuil
Cuil

Cuil is a search engine that organizes web pages by content and displays relatively long entries along with thumbnail pictures for many results....
Launched
Boogami
Boogami

Boogami is a search engine that was developed by James Wildish, a sixteen year old college student from Kent in United Kingdom. It combines a search engine with a pixel advertising grid that appears every time someone uses Boogami to search the Internet, and for the fact that it offers free pixel advertising to charitable organisation....
Launched
LeapFish
LeapFish

LeapFish is a search aggregator that retrieves results from other portals and search engines, including Google, Yahoo, Live Search, Blogs, Videos etc.......
Beta Launch
Musu
Musu

MUSU may refer to* University of Melbourne Student Union, one of several student organisations at the University of Melbourne, Australia* University of Manchester Students' Union, student organisation of the University of Manchester, England...
Beta Launch
VADLO
VADLO

VADLO is a life sciences search engine, privately owned by Life in Research, LLC., based in Chicago, USA. VADLO caters to life sciences and biomedical researchers, educators, students, clinicians and reference librarians....
Launch


Before there were web search engines there was a complete list of all webservers. The list was edited by Tim Berners-Lee
Tim Berners-Lee

Sir Timothy John Berners-Lee, Order of Merit, Order of the British Empire, Royal Society, Royal Academy of Engineering, Royal Society of Arts is an English people computer scientist and MIT professor credited with inventing the World Wide Web....
 and hosted on the CERN webserver. One historical snapshot from 1992 remains. As more and more webservers went online the central list could not keep up. On the NCSA Site new servers were announced under the title "What's New!" but no complete listing existed any more.

The very first tool used for searching on the (pre-web) Internet was Archie
Archie search engine

Archie is a tool for indexing File Transfer Protocol archives, allowing people to find specific files. It is considered to be the first Internet Search engine ....
. The name stands for "archive" without the "v." It was created in 1990 by Alan Emtage
Alan Emtage

Alan Emtage conceived of and implemented the first version of Archie search engine, a pre-Web internet search engine for locating material in public File Transfer Protocol archives....
, a student at McGill University
McGill University

McGill University is a Public university#Canada located in Montreal, Quebec, Canada. It bears the name of James McGill, a prominent Montreal merchant from Scotland, whose bequest formed the beginning of the university....
 in Montreal. The program downloaded the directory listings of all the files located on public anonymous FTP (File Transfer Protocol
File Transfer Protocol

File Transfer Protocol is a network protocol used to transfer data from one computer to another through a network such as the Internet.FTP is a file transfer protocol for exchanging and manipulating files over a Transmission Control Protocol computer network....
) sites, creating a searchable database of file names; however, Archie did not index the contents of these sites.

The rise of Gopher (created in 1991 by Mark McCahill at the University of Minnesota
University of Minnesota

The University of Minnesota, Twin Cities is a public university research university located in Minneapolis and St. Paul, Minnesota, Minnesota, United States....
) led to two new search programs, Veronica
Veronica (computer)

Veronica is a Search engine system for the Gopher , developed in 1992 by Steven Foster and Fred Barrie at the University of Nevada, Reno.Veronica is a constantly updated database of the names of almost every menu item on thousands of Gopher servers....
 and Jughead
Jughead (computer)

Jughead is a search engine system for the Gopher . It is distinct from Veronica in that it searches a single Server at a time.Jughead is officially an acronym for Jonzy's Universal Gopher Hierarchy Excavation And Display, though it was originally chosen to match that of the File Transfer Protocol search service known as Archie search...
. Like Archie, they searched the file names and titles stored in Gopher index systems. Veronica (Very Easy Rodent-Oriented Net-wide Index to Computerized Archives) provided a keyword search of most Gopher menu titles in the entire Gopher listings. Jughead (Jonzy's Universal Gopher Hierarchy Excavation And Display) was a tool for obtaining menu information from specific Gopher servers. While the name of the search engine "Archie
Archie search engine

Archie is a tool for indexing File Transfer Protocol archives, allowing people to find specific files. It is considered to be the first Internet Search engine ....
" was not a reference to the Archie comic book
Archie Comics

Archie Comics is an United States of America comic book publisher, known for its many series featuring the fictional teenager Archie Andrews , Betty Cooper, Veronica Lodge, Reggie Mantle and Jughead Jones characters by publisher/editor John L....
 series, "Veronica
Veronica Lodge

Veronica "Ronnie" Lodge is an adolescent fictional character in the Archie Comics books series. Since the Archie characters are ageless, Lodge remains a high-school teenager after 66 years....
" and "Jughead
Jughead Jones

Forsythe Pendleton "Jughead" Jones III is a fictional character in Archie Comics, first appearing in December 1941. He is the son of Forsythe II....
" are characters in the series, thus referencing their predecessor.

The first Web search engine was an index collected in 1993 by the World Wide Web Wanderer
World Wide Web Wanderer

The World Wide Web Wanderer, also referred to as just the Wanderer, was a Perl-based web crawler that was first deployed in June 1993 to measure the size of the World Wide Web....
 called 'Wandex', Wanderer was a web crawler
Web crawler

A Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner. Other terms for Web crawlers are ants, automatic indexers, bots, and worms or Web spider, Web robot, or?especially in the FOAF community?Web scutter....
 developed by Matthew Gray at MIT
Massachusetts Institute of Technology

The Massachusetts Institute of Technology is a private university research university located in Cambridge, Massachusetts, Massachusetts, United States....
 to measure the size of the World Wide Web. Another very early search engine, Aliweb
Aliweb

ALIWEB can be considered the first Web search engine, as its predecessors were either built with different purposes or were literally just indexers ....
, also appeared in 1993. JumpStation
JumpStation

JumpStation was the first Web search engine that behaved, and appeared to the user, the way current web search engines do. It started indexing on Sunday 12th December 1993 and was announced on the Mosaic "What's New" webpage on 21st December 1993....
 (released in December 1993) used a crawler to find web pages for searching and used a web form as the interface to its query program, but search was limited to the titles and headings of web pages. One of the first "full text" crawler-based search engines was WebCrawler
WebCrawler

WebCrawler is a metasearch engine that blends the top search results from Google, Yahoo!, Live Search , Ask.com, About.com, MIVA, LookSmart and other popular search engines....
, which came out in 1994. Unlike its predecessors, it let users search for any word in any webpage, which became the standard for all major search engines since. It was also the first one to be widely known by the public. Also in 1994 Lycos
Lycos

Lycos is a Web search engine and web portal with broadband entertainment content....
 (which started at Carnegie Mellon University
Carnegie Mellon University

Carnegie Mellon University is a top private university research university in Pittsburgh. Since its inception, Carnegie Mellon has grown into a world-renowned institution, with numerous programs that are frequently college and university rankings among the best in the world....
) was launched, and became a major commercial endeavor.

Soon after, many search engines appeared and vied for popularity. These included Magellan
Magellan

Magellan may refer to:People*Ferdinand Magellan, Portuguese explorer who led the first expedition around the world.Geography*The Strait of Magellan....
, Excite
Excite

Excite is an Internet Web portal, and as one of the "Dot-com companys" of the 1990s , it was once one of the most recognized brands on the Internet....
, Infoseek
Infoseek

Infoseek was a very popular search engine founded in 1994 by Steve Kirsch. It was also known as "big yellow".It was bought by The Walt Disney Company in 1998, and the technology was merged with that of the Disney-acquired Starwave to form the Go.com network....
, Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
, Northern Light
Northern Light Group

Northern Light Group, LLC is a company specializing in strategic research portals, enterprise search technology, and text analytics solutions. The company provides custom, hosted, turnkey solutions for its clients....
, and AltaVista
AltaVista

AltaVista is an Internet search engine company , and that company's search engine product....
. Yahoo!
Yahoo!

Yahoo! Inc. is an United States public company corporation with headquarters in Sunnyvale, California, , and provides Internet services worldwide....
 was among the most popular ways for people to find web pages of interest, but its search function operated on its web directory
Web directory

A web directory or link directory is a directory on the World Wide Web. It specializes in hyperlink to other web sites and Categorization those links....
, rather than full-text copies of web pages. Information seekers could also browse the directory instead of doing a keyword-based search.

In 1996, Netscape
Netscape

Netscape Communications is a United States computer services company, best known for its web browser. The browser was once dominant in terms of Usage share of web browsers, but lost most of that share to Internet Explorer during the browser wars....
 was looking to give a single search engine an exclusive deal to be their featured search engine. There was so much interest that instead a deal was struck with Netscape by 5 of the major search engines, where for $5Million per year each search engine would be in a rotation on the Netscape search engine page. These five engines were: Yahoo!
Yahoo!

Yahoo! Inc. is an United States public company corporation with headquarters in Sunnyvale, California, , and provides Internet services worldwide....
, Magellan
Magellan

Magellan may refer to:People*Ferdinand Magellan, Portuguese explorer who led the first expedition around the world.Geography*The Strait of Magellan....
, Lycos
Lycos

Lycos is a Web search engine and web portal with broadband entertainment content....
, Infoseek
Infoseek

Infoseek was a very popular search engine founded in 1994 by Steve Kirsch. It was also known as "big yellow".It was bought by The Walt Disney Company in 1998, and the technology was merged with that of the Disney-acquired Starwave to form the Go.com network....
 and Excite
Excite

Excite is an Internet Web portal, and as one of the "Dot-com companys" of the 1990s , it was once one of the most recognized brands on the Internet....
.

Search engines were also known as some of the brightest stars in the Internet investing frenzy that occurred in the late 1990s. Several companies entered the market spectacularly, receiving record gains during their initial public offering
Initial public offering

Initial public offering , also referred to simply as a "public offering" or "flotation," is when a company issues common stock or Share to the public for the first time....
s. Some have taken down their public search engine, and are marketing enterprise-only editions, such as Northern Light. Many search engine companies were caught up in the dot-com bubble
Dot-com bubble

The "dot-com bubble" was a economic bubble covering roughly 1995?2001 during which stock markets in Western world saw their value increase rapidly from growth in the new quaternary sector of industry and related fields....
, a speculation-driven market boom that peaked in 1999 and ended in 2001.

Around 2000, the Google search engine
Google search

Google search is a Web search engine owned by Google, and is the most used search engine on the World Wide Web. Google receives several hundred million queries each day through its various services....
 rose to prominence. The company achieved better results for many searches with an innovation called PageRank
PageRank

PageRank is a Network theory#link analysis algorithm used by the Google Internet search engine that assigns a numerical weighting to each element of a hyperlinked set of documents, such as the World Wide Web, with the purpose of "measuring" its relative importance within the set....
. This iterative algorithm ranks web pages based on the number and PageRank of other web sites and pages that link there, on the premise that good or desirable pages are linked to more than others. Google also maintained a minimalist interface to its search engine. In contrast, many of its competitors embedded a search engine in a web portal
Web portal

A web portal presents information from diverse sources in a unified way. Apart from the search engine standard, web portals offer other services such as e-mail, news, stock prices, infotainment, and other features....
.

By 2000, Yahoo was providing search services based on Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
's search engine. Yahoo! acquired Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
 in 2002, and Overture
Overture

Overture in music is the instrumental introduction to a dramatic, choir or, occasionally, Musical composition. During the early Romantic era, composers such as Ludwig van Beethoven and Felix Mendelssohn began to use the term to refer to instrumental, programmatic works that presaged genres such as the symphonic poem....
 (which owned AlltheWeb
AlltheWeb

AlltheWeb is an Internet search engine that debuted in mid-1999. It grew out from FTP Search, Tor Egge's doctorate thesis at the Norwegian University of Science and Technology, which he started on in 1994, which in turn resulted in the formation of Fast Search and Transfer established on July 16, 1997....
 and AltaVista
AltaVista

AltaVista is an Internet search engine company , and that company's search engine product....
) in 2003. Yahoo! switched to Google's search engine until 2004, when it launched its own search engine based on the combined technologies of its acquisitions.

Microsoft first launched MSN Search (since re-branded Live Search) in the fall of 1998 using search results from Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
. In early 1999 the site began to display listings from Looksmart
LookSmart

LookSmart is a search advertising network and management solutions company based in San Francisco. LookSmart provides search advertising products and services to text advertisers, as well as targeted pay-per-click search and contextual advertising via its Search Advertising Network....
 blended with results from Inktomi
Inktomi

Inktomi Corporation was a California company that provided software for Internet service providers. It was founded in 1996 by UC Berkeley professor Eric Brewer and graduate student Paul Gauthier ....
 except for a short time in 1999 when results from AltaVista
AltaVista

AltaVista is an Internet search engine company , and that company's search engine product....
 were used instead. In 2004, Microsoft began a transition to its own search technology, powered by its own web crawler
Web crawler

A Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner. Other terms for Web crawlers are ants, automatic indexers, bots, and worms or Web spider, Web robot, or?especially in the FOAF community?Web scutter....
 (called msnbot
Msnbot

msnbot is a web-crawling robot , deployed by Microsoft to supply Live Search. It collects documents from the web to build a searchable index for the MSN Search, which went into beta in 2004, and had full public release in 2005....
).

As of late 2007, Google was by far the most popular Web search engine worldwide. A number of country-specific search engine companies have become prominent; for example Baidu
Baidu

Baidu is the leading Chinese language search engine for websites, audio files, and images. Baidu offers 57 search and community services including an online collaboratively-built encyclopedia , and a searchable keyword-based discussion forum....
 is the most popular search engine in the People's Republic of China
People's Republic of China

The People's Republic of China , commonly known as China, is the largest country in East Asia and the List of countries by population in the world with over 1.3 billion people, approximately a fifth of the world's population....
 and guruji.com
Guruji.com

Guruji.com is an Indian Internet search engine that is focused on providing better search results to Indian consumers, by leveraging proprietary algorithms and data in the Indian context....
 in India
India

India, officially the Republic of India , is a country in South Asia. It is the List of countries and outlying territories by total area country by geographical area, the List of countries by population country, and the most populous liberal democracy in the world....
.

How Web search engines work

A search engine operates, in the following order
  1. Web crawling
  2. Indexing
    Index (search engine)

    Search engine index collects, parses, and stores data to facilitate fast and accurate information retrieval. Index design incorporates interdisciplinary concepts from linguistics, cognitive psychology, mathematics, informatics, physics and computer science....
  3. Searching
    Web search query

    A web search query is a query that a user enters into web search engine to satisfy his or her information needs. Web search queries are distinctive in that they are unstructured and often ambiguous; they vary greatly from standard query languages which are governed by strict syntax rules....


Web search engines work by storing information about many web pages, which they retrieve from the WWW itself. These pages are retrieved by a Web crawler
Web crawler

A Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner. Other terms for Web crawlers are ants, automatic indexers, bots, and worms or Web spider, Web robot, or?especially in the FOAF community?Web scutter....
 (sometimes also known as a spider) — an automated Web browser which follows every link it sees. Exclusions can be made by the use of robots.txt. The contents of each page are then analyzed to determine how it should be indexed (for example, words are extracted from the titles, headings, or special fields called meta tags). Data about web pages are stored in an index database for use in later queries. Some search engines, such as Google
Google

Google Inc. is an United States public company, earning revenue from AdWords related to its Google search, Gmail, Google Maps, Google Apps, Orkut, and YouTube services as well as selling advertising-free versions of the Google Search Appliance....
, store all or part of the source page (referred to as a cache
Web cache

Web caching is the Cache of web documents in order to reduce Bandwidth usage, web server load, and perceived lag. A web cache stores copies of documents passing through it; subsequent requests may be satisfied from the cache if certain conditions are met....
) as well as information about the web pages, whereas others, such as AltaVista
AltaVista

AltaVista is an Internet search engine company , and that company's search engine product....
, store every word of every page they find. This cached page always holds the actual search text since it is the one that was actually indexed, so it can be very useful when the content of the current page has been updated and the search terms are no longer in it. This problem might be considered to be a mild form of linkrot, and Google's handling of it increases usability
Usability

Usability is a term used to denote the ease with which people can employ a particular tool or other human-made object in order to achieve a particular goal....
 by satisfying user expectations
User expectations

User expectations refers to the consistency that users expect from products. Interaction design is very concerned with this topic. For example, our user expectations for traffic behavior is one of the more consistent ones because it is governed by traffic laws that are enforced....
 that the search terms will be on the returned webpage. This satisfies the principle of least astonishment
Principle of least astonishment

In user interface design, programming language design, and ergonomics, the principle of least astonishment states that, when two elements of an interface conflict, or are ambiguous, the behaviour should be that which will least surprise the human User or programmer at the time the conflict arises....
 since the user normally expects the search terms to be on the returned pages. Increased search relevance makes these cached pages very useful, even beyond the fact that they may contain data that may no longer be available elsewhere.

When a user enters a query
Web search query

A web search query is a query that a user enters into web search engine to satisfy his or her information needs. Web search queries are distinctive in that they are unstructured and often ambiguous; they vary greatly from standard query languages which are governed by strict syntax rules....
 into a search engine (typically by using key word
Keyword (Internet search)

An index term, subject term, subject heading, or descriptor, in information retrieval, is a term that captures the essence of the topic of a document....
s), the engine examines its index
Inverted index

In information technology, an inverted index is an Index storing a mapping from content, such as words or numbers, to its locations in a Table , or in a document or a set of documents, in this case allowing full text search....
 and provides a listing of best-matching web pages according to its criteria, usually with a short summary containing the document's title and sometimes parts of the text. Most search engines support the use of the boolean operators AND, OR and NOT to further specify the search query
Web search query

A web search query is a query that a user enters into web search engine to satisfy his or her information needs. Web search queries are distinctive in that they are unstructured and often ambiguous; they vary greatly from standard query languages which are governed by strict syntax rules....
. Some search engines provide an advanced feature called proximity search which allows users to define the distance between keywords.

The usefulness of a search engine depends on the relevance
Relevance (information retrieval)

In the context of information science and information retrieval, relevance denotes how well a retrieved set of documents meets the information need of the user....
 of the result set it gives back. While there may be millions of webpages that include a particular word or phrase, some pages may be more relevant, popular, or authoritative than others. Most search engines employ methods to rank the results to provide the "best" results first. How a search engine decides which pages are the best matches, and what order the results should be shown in, varies widely from one engine to another. The methods also change over time as Internet usage changes and new techniques evolve. Most Web search engines are commercial ventures supported by advertising
Advertising

Advertising is a form of communication that typically attempts to persuade potential customers to Purchasing or to consume more of a particular brand of Product or Service ....
 revenue and, as a result, some employ the practice of allowing advertisers to pay money to have their listings ranked
Paid inclusion

Paid inclusion is a search engine marketing product where the search engine company charges fees related to inclusion of websites in their search index....
 higher in search results. Those search engines which do not accept money for their search engine results make money by running search related ads
Contextual advertising

Contextual advertising is a form of targeted advertising for advertisements appearing on websites or other media, such as content displayed in mobile browser....
 alongside the regular search engine results. The search engines make money every time someone clicks on one of these ads.

Revenue in the web search portals industry is projected to grow in 2008 by 13.4 percent, with broadband connections expected to rise by 15.1 percent. Between 2008 and 2012, industry revenue is projected to rise by 56 percent as Internet penetration still has some way to go to reach full saturation in American households. Furthermore, broadband services are projected to account for an ever increasing share of domestic Internet users, rising to 118.7 million by 2012, with an increasing share accounted for by fiber-optic and high speed cable lines.

See also


Bibliography

  • For a more detailed history of early search engines, see (from Search Engine Watch
    Search Engine Watch

    Search Engine Watch is a website that provides news and information about search engines and search engine marketing. Search Engine Watch was started by Danny Sullivan in 1996....
    ), Chris Sherman, September 2003.
ISBN 978-0-910965-76-7*

External links