The Collection of Computer Science Bibliographies
Encyclopedia
The Collection of Computer Science Bibliographies is one of the oldest (if not the oldest) bibliography collections freely accessible on the Internet
Internet
The Internet is a global system of interconnected computer networks that use the standard Internet protocol suite to serve billions of users worldwide...

. It is a collection of bibliographies of scientific literature in computer science
Computer science
Computer science or computing science is the study of the theoretical foundations of information and computation and of practical techniques for their implementation and application in computer systems...

 and (computational) mathematics
Mathematics
Mathematics is the study of quantity, space, structure, and change. Mathematicians seek out patterns and formulate new conjectures. Mathematicians resolve the truth or falsity of conjectures by mathematical proofs, which are arguments sufficient to convince other mathematicians of their validity...

 from various sources, covering most aspects of computer science. The bibliographies are updated weekly from their original locations.

As of 2009 the collection contains more than 2.8 million unique references (mostly to journal articles, conference papers and technical reports), clustered in about 1700 bibliographies, and consists of more than 4.4 Gb
Gigabyte
The gigabyte is a multiple of the unit byte for digital information storage. The prefix giga means 109 in the International System of Units , therefore 1 gigabyte is...

 (950 Mb
Megabyte
The megabyte is a multiple of the unit byte for digital information storage or transmission with two different values depending on context: bytes generally for computer memory; and one million bytes generally for computer storage. The IEEE Standards Board has decided that "Mega will mean 1 000...

 gzip
Gzip
Gzip is any of several software applications used for file compression and decompression. The term usually refers to the GNU Project's implementation, "gzip" standing for GNU zip. It is based on the DEFLATE algorithm, which is a combination of Lempel-Ziv and Huffman coding...

ped) of BibTeX
BibTeX
BibTeX is reference management software for formatting lists of references. The BibTeX tool is typically used together with the LaTeX document preparation system...

 entries. More than 600,000 references contain cross-references to citing or cited publications.

More than 1 million references contain URL
Uniform Resource Locator
In computing, a uniform resource locator or universal resource locator is a specific character string that constitutes a reference to an Internet resource....

s to an online version of the paper. Abstracts are available for more than 1 million entries. There are more than 2,000 links to other sites carrying bibliographic information.

Duplicates and links

As the Collection of Computer Science Bibliographies consists of many subcollections there is a substantial overlap (roughly 1/3). At the end of 2008 there were more than 4.2 million records which represent about 2.8 million unique (in terms of normalized title and authors' last names) bibliographic entries.

The number of duplicates may be seen as a feature, because there is a greater chance for finding a freely available full text PDF of a searched publication. Publications are clustered by title and last names of authors, so it is possible to find an extended version (e.g. Technical Report
Technical report
A technical report is a document that describes the process, progress, or results of technical or scientific research or the state of a technical or scientific research problem. It might also include recommendations and conclusions of the research...

 or Thesis
Thesis
A dissertation or thesis is a document submitted in support of candidature for an academic degree or professional qualification presenting the author's research and findings...

) of an article.

There are also generated links to Google Scholar
Google Scholar
Google Scholar is a freely accessible web search engine that indexes the full text of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes most peer-reviewed online journals of Europe and America's largest...

 and IEEE Xplore
IEEE Xplore
IEEE Xplore is a scholarly research database that indexes, abstracts, and provides full-text for articles and papers on computer science, electrical engineering and electronics. The database mainly covers material from IEEE and IET. The IEEE Xplore database contains over two million...

 in the case no full text link was available directly. Almost every bibliographic query may be served in RSS
RSS (file format)
RSS is a family of web feed formats used to publish frequently updated works—such as blog entries, news headlines, audio, and video—in a standardized format...

 format.

Major subcollections


History

The collection was started in 1993 by Alf-Christian Achilles with a simple email-based interface and limited number of entries. One year later the first web interface has been made available. Since then the Collection was maintained by Achilles in his spare time. At the end of 2002 the maintenance has been handed over to Paul Ortyl.

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK