Zettair
Encyclopedia
Zettair is a compact text search engine
Search engine
A search engine is an information retrieval system designed to help find information stored on a computer system. The search results are usually presented in a list and are commonly called hits. Search engines help to minimize the time required to find information and the amount of information...

 for indexing and search of HTML (or TREC
TREC
TREC may refer to:* Techniques de Randonnée Équestre de Compétition or Trec, an equestrian discipline* Text REtrieval Conference, an on-going series of workshops co-sponsored by the National Institute of Standards and Technology and the U.S...

) collections. It is an open source
Open source
The term open source describes practices in production and development that promote access to the end product's source materials. Some consider open source a philosophy, others consider it a pragmatic methodology...

 software developed by a group of researchers at RMIT University
RMIT University
RMIT University is an Australian public university located in Melbourne, Victoria. It has two branches, referred to as RMIT University in Australia and RMIT International University in Vietnam....

.

Its primary feature is the ability to handle large document collections (100 GB
Gigabyte
The gigabyte is a multiple of the unit byte for digital information storage. The prefix giga means 109 in the International System of Units , therefore 1 gigabyte is...

 and more). It has a single executable
Executable
In computing, an executable file causes a computer "to perform indicated tasks according to encoded instructions," as opposed to a data file that must be parsed by a program to be meaningful. These instructions are traditionally machine code instructions for a physical CPU...

, which performs both on-the-fly indexing
Index (search engine)
Search engine indexing collects, parses, and stores data to facilitate fast and accurate information retrieval. Index design incorporates interdisciplinary concepts from linguistics, cognitive psychology, mathematics, informatics, physics, and computer science...

 and searching, with a command-line interface
Command-line interface
A command-line interface is a mechanism for interacting with a computer operating system or software by typing commands to perform specific tasks...

.

It is licensed under the terms of the BSD license.

See also

  • Information retrieval (IR)
    Information retrieval
    Information retrieval is the area of study concerned with searching for documents, for information within documents, and for metadata about documents, as well as that of searching structured storage, relational databases, and the World Wide Web...

  • Zetta
    Zetta
    Zetta- is a prefix in the metric system denoting a factor of 1021 or .Added to the SI in 1991, it is evocative of the French numeral sept, meaning seven, because it is equal to 10007....

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK