All Topics  
Base pair

 

   Email Print
   Bookmark   Link

 

Base pair


 
 

In molecular biologyMolecular biology

Molecular biology is the study of biology at a molecular level....
, two nucleotideNucleotide

A nucleotide is a chemical compound that consists of a heterocyclic base, a sugar, and one or more phosphate groups....
s on opposite complementaryComplementarity (molecular biology)

In genetics, a double-stranded DNA or RNA strand consists of two complementary strands of base pairs, which are non-covalent...
 DNADNA

Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions for the biological development of a cellu...
 or RNARNA

Ribonucleic acid is a nucleic acid polymer consisting of nucleotide monomers....
 strands that are connected via hydrogen bondFacts About Hydrogen bond

In chemistry, a hydrogen bond is a type of attractive intermolecular force that exists between two partial electric charges ...
s are called a base pair (often abbreviated bp). In the canonical Watson-Crick base pairing, adenineAdenine

Adenine is one of the two purine nucleobases used in forming nucleotides of the nucleic acids DNA and RNA....
 (A) forms a base pair with thymineThymine Summary

Thymine is one of the four bases in the nucleic acid of DNA that make up the letters ATGC....
 (T), as does guanineGuanine

Guanine is one of the five main nucleobases found in the nucleic acids DNA and RNA; the others being adenine, cytosine, thym...
 (G) with cytosineCytosine

Cytosine is one of the 5 main nucleobases used in storing and transporting genetic information within a cell in the nucleic ...
 (C) in DNA. In RNA, thymineThymine

Thymine is one of the four bases in the nucleic acid of DNA that make up the letters ATGC....
 is replaced by uracilUracil

Uracil is a common naturally occurring pyrimidine....
 (U). Non-Watson-Crick base pairing with alternate hydrogen bonding patterns also occur, especially in RNA; common such patterns are Hoogsteen base pairHoogsteen base pair

A Hoogsteen base pair is a minor variation of base-pairing in nucleic acids such as the AT pair shown in the figure....
s.
Pairing is also the mechanism by which codons on messenger RNAMessenger RNA

Messenger Ribonucleic Acid is RNA that encodes and carries information from DNA during transcription to sites of protein sy...
 molecules are recognized by anticodons on transfer RNATransfer RNA

Transfer RNA is a small RNA chain that transfers a specific amino acid to a growing polypeptide chain at the ribosomal site of pro...
 during protein translationTranslation (genetics)

Translation is the second process of protein biosynthesis....
. Some DNA- or RNA-binding enzymes can recognize specific base pairing patterns that identify particular regulatory regions of genes.

The size of an individual geneGene

A gene is the unit of heredity in living organisms....
 or an organism's entire genomeFacts About Genome

In biology the genome of an organism is its whole hereditary information and is encoded in the DNA ....
 is often measured in base pairs because DNA is usually double-stranded. Hence, the number of total base pairs is equal to the number of nucleotideNucleotide

A nucleotide is a chemical compound that consists of a heterocyclic base, a sugar, and one or more phosphate groups....
s in one of the strands (with the exception of non-coding single-stranded regions of telomereTelomere

A telomere is a region of highly repetitive DNA at the end of a linear chromosome that functions as a disposable buffer....
s). The haploid human genomeHuman genome

The human genome is the genome of Homo sapiens, which is composed of 24 distinct chromosomes with a total of approximate...
 (23 chromosomes) is estimated to be about 3 billion base pairs long and to contain 20,000-25,000 distinct genes.

A Kilobase is a unit of measurement in molecular biologyMolecular biology

Molecular biology is the study of biology at a molecular level....
 denoting 1000 base pairs of DNADNA Overview

Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions for the biological development of a cellu...
 or RNARNA

Ribonucleic acid is a nucleic acid polymer consisting of nucleotide monomers....
.

Examples

The following DNA sequences illustrate pair double-stranded patterns. By convention, the top strand is written from the 5' end to the 3' end; thus the bottom strand is written 3' to 5'.

A base-paired DNA sequence:

ATCGAT
TAGCTA
The corresponding base-paired RNA sequence, in which uracilUracil

Uracil is a common naturally occurring pyrimidine....
 is substituted for thymine:

AUCGAU
UAGCUA

Length measurements

The following abbreviations are commonly used to describe the length of a DNA/RNA molecule:
  • bp = base pair(s)—one bp corresponds to ca 3.4 Å of length along the strand
  • kb (= kbp) = kilo base pairs = 1,000 bp
  • Mb = mega base pairs = 1,000,000 bp
  • Gb = giga base pairs = 1,000,000,000 bp


In case of single stranded DNA/RNA we talk about nucleotideNucleotide

A nucleotide is a chemical compound that consists of a heterocyclic base, a sugar, and one or more phosphate groups....
s, abbreviated nt (or knt, Mnt, Gnt), rather than base pairs, as they are not paired.
For distinction between units of computer storageComputer storage

Computer storage, computer memory, and often casually memory refer to computer components, devices and recording...
 and bases kbp, Mbp, Gbp etc may be used for disambiguation.

The CentimorganCentimorgan

In genetics, a centimorgan is a unit of recombinant frequency for measuring genetic linkage....
 is also often used to imply distance along a chromosome, but the number of base-pairs it corresponds to varies widely. In the Human genome, it is about 1 million base pairs . .

Hydrogen bonding and stability



Hydrogen bondHydrogen bond

In chemistry, a hydrogen bond is a type of attractive intermolecular force that exists between two partial electric charges ...
ing is the chemical mechanism that underlies the base-pairing rules described above. Appropriate geometrical correspondence of hydrogen bond donors and acceptors allows only the "right" pairs to form stably. The GC base pair has three hydrogen bonds, whereas the AT base pair has only two; as a consequence, the GC pair is more stable.

The larger nucleic acids, adenine and guanine, are members of a class of doubly-ringed chemical structures called purinePurine

Purine is a heterocyclic aromatic organic compound, consisting of a pyrimidine ring fused to an imidazole ring....
s; the smaller nucleic acids, cytosine and thymine (and uracil), are members of a class of singly-ringed chemical structures called pyrimidinePyrimidine

Pyrimidine is a heterocyclic aromatic organic compound similar to benzene and pyridine, containing two nitrogen atoms at pos...
s. Purines are only complementary with pyrimidines: pyrimidine-pyrimidine pairings are energetically unfavorable because the molecules are too far apart for hydrogen bonding to be established; purine-purine pairings are energetically unfavorable because the molecules are too close, leading to electrostatic repulsion. The only other possible pairings are GT and AC; these pairings are mismatches because the pattern of hydrogen donors and acceptors do not correspond. (It should be noted that the GU pairing, with two hydrogen bonds, does occur fairly often in RNARNA Summary

Ribonucleic acid is a nucleic acid polymer consisting of nucleotide monomers....
 but rarely in DNADNA

Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions for the biological development of a cellu...
.)

Paired DNA and RNA molecules are comparatively stable at room temperature but the two nucleotide strands will separate above a melting point that is determined by the length of the molecules, the extent of mispairing (if any), and the GC content. Higher GC content results in higher melting temperatures; it is therefore unsurprising that the genomes of extremophileExtremophile

An extremophile is an organism, usually unicellular, which thrives in or requires 'extreme' conditions that would exceed opt...
 organisms such as Thermus thermophilusThermus thermophilus

Thermus thermophilus is a gram negative eubacterium used in a range of biotechnological applications, including as a model o...
are particularly GC-rich. Conversely, regions of a genome that need to separate frequently - for example, the promoter regions for often-transcribedTranscription (genetics)

Transcription is the process through which a DNA sequence is enzymatically copied by an RNA polymerase to produce a complem...
 genes - are comparatively GC-poor (for example, see TATA boxFacts About TATA box

A TATA box is a DNA sequence found in the promoter region of most genes....
). GC content and melting temperature must also be taken into account when designing primersPrimer (molecular biology)

A primer is a nucleic acid strand, or a related molecule that serves as a starting point for DNA replication....
 for PCR reactions.

Base stacking

Base stackingStacking (chemistry)

Stacking in supramolecular chemistry refers to a stacked arrangement of aromatic molecules, which interact through aromatic ...
 interactions between the pi orbitalsPi bond

In chemistry, pi bonds are chemical bonds of the covalent type, where two lobes of one involved electron orbital overl...
 of the bases' aromatic rings also contribute to stability, and again GC stacking interactions with adjacent bases tend to be more favorable. (Note, though, that a GC stacking interaction with the next base pair is geometrically different from a CG interaction.) Base stacking effects are especially important in the secondary structure of RNA; for example, RNA stem-loopStem-loop

Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded DNA or, more commonly, in RNA....
 structures are stabilized by base stacking in the loop region.

Base analogs and intercalators

Chemical analogs of nucleotides can take the place of proper nucleotides and establish non-canonical base-pairing, leading to errors (mostly point mutationPoint mutation

A point mutation, or substitution, is a type of mutation that causes the replacement of a single base nucleotide with ...
s) in DNA replicationDNA replication

DNA replication or DNA synthesis is the process of copying a double-stranded DNA strand in a cell, prior to cell divis...
 and DNA transcriptionTranscription (genetics)

Transcription is the process through which a DNA sequence is enzymatically copied by an RNA polymerase to produce a complem...
. One common mutagenic base analog is 5-bromouracilFacts About 5-Bromouracil

5-Bromouracil-pyrimidinedione or 5-BrU or 5-BU) is a component of 5-bromo-2-deoxy-uridine....
, which resembles thymine but can base-pair to guanine in its enolEnol

Enol is an alkene with hydroxyl group on one of the carbon atoms of the double bond....
 form.

Other chemicals, known as DNA intercalators, fit into the gap between adjacent bases on a single strand and induce frameshift mutationFrameshift mutation

A frameshift mutation is a genetic mutation that inserts or deletes a number of nucleotides that is not evenly divisible by ...
s by "masquerading" as a base, causing the DNA replication machinery to skip or insert additional nucleotides at the intercalated site. Most intercalators are large polyaromatic compounds and are known or suspected carcinogenCarcinogen

In pathology, a carcinogen is any substance or agent that promotes cancer....
s. Examples include ethidium bromideEthidium bromide

Ethidium bromide is an intercalating agent commonly used as a nucleic acid stain in molecular biology laboratories for techn...
 and acridineAcridine

Acridine, C13H9N, is an organic compound and a nitrogen heterocycle....
.

See also

  • DNADNA

    Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions for the biological development of a cellu...
  • NucleobaseNucleobase

    align="center" valign="bottom"! Nucleobase...
  • Wobble base pairWobble base pair

    ...
  • Hoogsteen base pairHoogsteen base pair

    A Hoogsteen base pair is a minor variation of base-pairing in nucleic acids such as the AT pair shown in the figure....
  • List of binary polymorphisms

External links

  • - webserver version of the EMBOSS tool for calculating melting temperatures