All Topics  
Distributed Proofreaders

 

   Email Print
   Bookmark   Link






 

Distributed Proofreaders



 
 
Distributed Proofreaders (commonly abbreviated as DP or PGDP) is a web-based project that supports the development of e-text
E-text

An e-text is, generally, any text-based information that is available in a digitally encoded human-readable format and read by electronic means, but more specifically it refers to files in the ASCII character encoding....
s for Project Gutenberg
Project Gutenberg

Project Gutenberg, abbreviated as PG, is a volunteer effort to digitize, archive and distribute cultural works, as founder Michael Hart said "To encourage the creation and distribution of eBooks."....
 by allowing many people to work together in proofreading drafts of e-texts for errors.

History
Distributed Proofreaders was founded by Charles Franks in 2000 as an independent site to assist Project Gutenberg. Distributed Proofreaders became an official Project Gutenberg site in 2002.

On 8 November 2002, Distributed Proofreaders was slashdotted
Slashdot effect

The Slashdot effect, also known as slashdotting, is the phenomenon of a popular website linking to a smaller site, causing the smaller site to slow down or even temporarily close due to the increased traffic....
, and more than 4,000 new members joined in one day, causing an influx of new proofreaders and software developers, which helped to greatly increase the quantity and quality of e-text production.






Discussion
Ask a question about 'Distributed Proofreaders'
Start a new discussion about 'Distributed Proofreaders'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Distributed Proofreaders (commonly abbreviated as DP or PGDP) is a web-based project that supports the development of e-text
E-text

An e-text is, generally, any text-based information that is available in a digitally encoded human-readable format and read by electronic means, but more specifically it refers to files in the ASCII character encoding....
s for Project Gutenberg
Project Gutenberg

Project Gutenberg, abbreviated as PG, is a volunteer effort to digitize, archive and distribute cultural works, as founder Michael Hart said "To encourage the creation and distribution of eBooks."....
 by allowing many people to work together in proofreading drafts of e-texts for errors.

History


Distributed Proofreaders was founded by Charles Franks in 2000 as an independent site to assist Project Gutenberg. Distributed Proofreaders became an official Project Gutenberg site in 2002.

On 8 November 2002, Distributed Proofreaders was slashdotted
Slashdot effect

The Slashdot effect, also known as slashdotting, is the phenomenon of a popular website linking to a smaller site, causing the smaller site to slow down or even temporarily close due to the increased traffic....
, and more than 4,000 new members joined in one day, causing an influx of new proofreaders and software developers, which helped to greatly increase the quantity and quality of e-text production. Distributed Proofreaders posted their 5,000th text to Project Gutenberg in August 2004, and in March 2007, the 10,000th DP-produced e-text was posted to Project Gutenberg. the 11,000+ DP-contributed e-texts comprised almost half of works in Project Gutenberg.

On 31 July, 2006, the Distributed Proofreaders Foundation was formed to provide Distributed Proofreaders with its own legal entity and not-for-profit
Non-profit organization

A nonprofit organization is any organization that does not aim to make a profit, and which is not a public body....
 status. IRS
Internal Revenue Service

The Internal Revenue Service is the Federal government of the United States agency that collects taxes and enforces the tax law. It is an agency within the U.S....
 approval of section 501(c)(3)
501(c)

501 is a provision of the United States Internal Revenue Code , listing 26 types of non-profit organizations Tax exemption from some Taxation in the United States Income tax in the United States....
 status was granted retroactive to 7 April, 2006.

Proofreading process


Public domain
Public domain

File:PD-icon.svgThe public domain is a range of abstract materials?commonly referred to as intellectual property?which are not owned or controlled by anyone....
 works, typically books with expired copyright, are scanned by volunteers or culled from digitalization projects, and the images are run through optical character recognition
Optical character recognition

Optical character recognition, usually abbreviated to OCR, is the mechanical or Electronics translation of s of handwritten, typewritten or printed text into machine-editable text....
 (OCR) software. Since OCR software is far from perfect, often a large number of errors appear in the resulting text. To correct them, pages are made available to volunteers via the Internet; the original page image and the recognized text appear side by side. This process thereby distributes the time-consuming error-correction process, akin to distributed computing
Distributed computing

Distributed computing deals with hardware and software systems containing more than one processing element or Computer data storage element, Concurrent computing processes, or multiple programs, running under a loosely or tightly controlled regime....
.

Each page is proofread and formatted many times, and then a post-processor combines the pages and prepares the text for uploading to Project Gutenberg.

Besides custom software created to support the project, DP also runs a forum and a wiki for project coordinators and participants.

Related Projects


DP Europe

In January 2004, Distributed Proofreaders Europe started, hosted by Project Rastko
Project Rastko

Project Rastko - Internet Library of Serb Culture is a non-profit and non-governmental publishing, cultural and educational project dedicated to Serbs and Serb-related arts and humanities....
. This site has the ability to process text in Unicode
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
 UTF-8
UTF-8

UTF-8 is a Variable-width encoding character encoding for Unicode. It is able to represent any character in the Unicode standard, yet the initial encoding of byte codes and character assignments for UTF-8 is backward compatibility with ASCII....
 encoding. Books proofread are centered mainly on European culture, with a large proportion of non-English texts including Hebrew, Arabic, Urdu and many others. , DP Europe had produced over 480 e-texts.

DP is sometimes referred to as "DP International" by members of DP Europe. However, DP servers are located in the United States
United States

The United States of America is a Federal government constitutional republic comprising U.S. state and a federal district. The country is situated mostly in central North America, where its Contiguous United States and Washington, D.C., the Capital districts and territories, lie between the Pacific Ocean and Atlantic Oceans, Borders of the U...
, and therefore works must be cleared by Project Gutenberg as being in the public domain
Public domain

File:PD-icon.svgThe public domain is a range of abstract materials?commonly referred to as intellectual property?which are not owned or controlled by anyone....
 according to U.S. copyright
Copyright

Copyright is a form of intellectual property which gives the creator of an original work exclusive rights for a certain time period in relation to that work, including its publication, distribution and adaptation; after which time the work is said to enter the public domain....
 law before they can be proofread at DP.

DP Canada

On 1 December 2007, Distributed Proofreaders Canada launched to support the production of e-books for Project Gutenberg Canada
Project Gutenberg Canada

Project Gutenberg Canada began on Canada Day 2007. Canadian citizens will be able to create e-texts and download many books that are not yet in the Public Domain of some other countries....
 and take advantage of shorter Canadian copyright
Copyright

Copyright is a form of intellectual property which gives the creator of an original work exclusive rights for a certain time period in relation to that work, including its publication, distribution and adaptation; after which time the work is said to enter the public domain....
 terms. Although it was established by members of the original Distributed Proofreaders site, it is a separate entity. All of its projects are posted to Project Gutenberg Canada, which launched on Canada Day
Canada Day

Canada Day , formerly Dominion Day , is Canada's National Day, a Public holidays in Canada, celebrating the anniversary of the July 1, 1867 enactment of the Constitution Act, 1867, which united Canada as a single country of four provinces....
 2007.

In addition to preserving Canadiana, DP Canada is notable because it is the first major effort to take advantage of Canada's copyright laws which may allow more works to be preserved. Like copyright law in most other countries, Canada has a "life plus 50" copyright term. This means that works by authors who died more than fifty years ago may be preserved in Canada, whereas in other parts of the world those works may not be distributed because they are still copyright.

Notable authors whose works may be preserved in Canada but not other parts of the world include A. A. Milne
A. A. Milne

Alan Alexander Milne was an England author, best known for his books about the teddy bear Winnie-the-Pooh and for various children's poems. Milne was a noted writer, primarily as a playwright, before the huge success of Pooh overshadowed all his previous work....
, Walter de la Mare
Walter de la Mare

Walter John de la Mare , Order of Merit Order of the Companions of Honour was an British poetry, short story writer and British literature, probably best remembered for his works for children and "The Listeners"....
, Sheila Kaye-Smith
Sheila Kaye-Smith

Sheila Kaye-Smith was an English writer, known for her many novels set in the borderlands of Sussex and Kent in the English regional tradition....
 and Amy Carmichael
Amy Carmichael

Amy Wilson Carmichael was a Protestant Christian missionary in India, who opened an orphanage and founded a mission in Tamil Nadu. She served in India for fifty-five years without furlough and authored many books about the missionary work there....
.

10,000th E-book


On 9 March 2007, Distributed Proofreaders announced completing more than 10,000 titles. In celebration, a block of 15 titles was published:
  • by the U.S. Work Projects Administration (English)
  • edited by John Wesley Powell
    John Wesley Powell

    John Wesley Powell was a United States soldier, geology, and explorer of the American West. He is famous for the 1869 Powell Geographic Expedition of 1869, a three-month river trip down the Green River and Colorado River rivers that included the first passage through the Grand Canyon....
     (English)
  • by Randolph Caldecott
    Randolph Caldecott

    Randolph Caldecott was a British artist and illustrator, born in Chester. He was the eponym of the Caldecott Medal.He exercised his art chiefly in book illustrations....
     [Illustrator] (English)
  • by Serpa Pinto (Portuguese)
  • by E. E. "Doc" Smith
    E. E. Smith

    E. E. Smith, also Edward Elmer Smith, Ph.D., E.E. "Doc" Smith, Doc Smith, "Skylark" Smith, and Ted was a Food engineering and early science fiction author who wrote the Lensman series and the Skylark series, among others....
     (English)
  • by Johanna Spyri
    Johanna Spyri

    Johanna Spyri was an author of children's stories, and is best known for Heidi. Born Johanna Louise Heusser in the rural area of Hirzel, Switzerland, as a child she spent several summers in the area around Chur in Graub?nden, the setting she later would use in her novels....
     (English)
  • by Johanna Spyri (German)
  • of Punch
    Punch (magazine)

    'Punch' was a Great Britain weekly magazine of humour and satire published from 1841 to 1992 and from 1996 to 2002. Punch material was also collected in book formats as early as the 1800s, including Pick of the Punch annuals with cartoons and text features, Punch and the War a 1941 collection of WWII-related cartoons, and A B...
     (English)
  • by John Evelyn
    John Evelyn

    John Evelyn was an England writer, gardener and diarist.Evelyn's diary or Memoirs are largely contemporaneous with those of the other noted diarist of the time, Samuel Pepys, and cast considerable light on the art, culture and politics of the time ....
     (English)
  • by Therese de Dillmont (English)
  • by Francisco Ernantez Arana (fl. 1582), translated and edited by Daniel G. Brinton
    Daniel Garrison Brinton

    Daniel Garrison Brinton , was an American archaeologist and ethnologist....
     (1837–1899) (English with Central American Indian)
  • by Richard Runciman Terry
    Richard Runciman Terry

    Sir Richard Runciman Terry was an English organist, choir director and musicologist. He is noted for his pioneering revival of Tudor era liturgical music....
     (1864–1938) (English)
  • by William Shakespeare
    William Shakespeare

    William Shakespeare was an English people poet and playwright, widely regarded as the greatest writer in the English language and the world's preeminent dramatist....
    , translated by François Guizot
    François Guizot

    Fran?ois Pierre Guillaume Guizot was a France historian, orator, and statesman. Guizot was a dominant figure in French politics prior to the Revolution of 1848, actively opposing as a liberal the reactionary King Charles X before his overthrow in the July Revolution of 1830, then in government service to the "citizen king" Louis-Philippe of...
     (French)
  • by Charles William Burkett (English)
  • by Carolus Linnaeus
    Carolus Linnaeus

    Carl Linnaeus was a Sweden botanist, physician, and zoologist, who laid the foundations for the modern scheme of binomial nomenclature. He is known as the father of modern alpha taxonomy, and is also considered one of the fathers of modern ecology....
     (Carl von Linné) (Latin)


See also



External links

  • at SourceForge
    SourceForge

    SourceForge Enterprise Edition is a collaborative revision control and software development management system. It provides a front-end to a range of software development lifecycle services and integrates with a number of free software / open source software applications ....