Bank of English
Encyclopedia
The Bank of English is the name of the COBUILD
COBUILD
COBUILD, an acronym for Collins Birmingham University International Language Database, is a British research facility set up at the University of Birmingham in 1980 and funded by Collins publishers.The facility was led by Professor John Sinclair...

 corpus
Text corpus
In linguistics, a corpus or text corpus is a large and structured set of texts...

, a collection of English texts. These are mainly British, but American and Australian data are also included.

The majority of the texts are from written English, but there is also a large component of spoken data. The corpus totals 525 million running words as of 2005. Copies of the corpus are held both at HarperCollins
HarperCollins
HarperCollins is a publishing company owned by News Corporation. It is the combination of the publishers William Collins, Sons and Co Ltd, a British company, and Harper & Row, an American company, itself the result of an earlier merger of Harper & Brothers and Row, Peterson & Company. The worldwide...

 Publishers and the University of Birmingham
University of Birmingham
The University of Birmingham is a British Redbrick university located in the city of Birmingham, England. It received its royal charter in 1900 as a successor to Birmingham Medical School and Mason Science College . Birmingham was the first Redbrick university to gain a charter and thus...

. The version at Birmingham can be accessed for academic research.

The Bank of English forms part of the Collins Word Web together with the French, German and Spanish corpora.

See also

  • Corpus of Contemporary American English
    Corpus of Contemporary American English
    The freely-searchable 425 million word Corpus of Contemporary American English is the largest corpus of American English currently available, and the only publicly-available corpus of American English to contain a wide array of texts from a number of genres.It was created by Mark Davies, Professor...

     (COCA) 385 million words, 1990-present. Freely searchable online.
  • British National Corpus
    British National Corpus
    The British National Corpus is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. It was compiled as a general corpus in the field of corpus linguistics...

  • Corpus linguistics
    Corpus linguistics
    Corpus linguistics is the study of language as expressed in samples or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally done by hand, corpora are now largely...

  • COBUILD
    COBUILD
    COBUILD, an acronym for Collins Birmingham University International Language Database, is a British research facility set up at the University of Birmingham in 1980 and funded by Collins publishers.The facility was led by Professor John Sinclair...

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK