All Topics  
Nucleic acid analogues

 

   Email Print
   Bookmark   Link






 

Nucleic acid analogues



 
 
Nucleic acid analogues are compounds structurally similar (analog
Analog (chemistry)

In chemistry, analogs or analogues are chemical compound in which one or more individual atoms have been replaced, either with a different atom, or with a different functional group....
) to naturally occurring RNA and DNA, used in medicine and in molecular biology research. Nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
s are chains of nucleotides, which are composed of three parts: a phosphate
Phosphate

A phosphate, an inorganic chemical, is a Salt of phosphoric acid. Inorganic phosphates are mining to obtain phosphorus for use in agriculture and industry....
 backbone, a pucker-shaped pentose sugar, either ribose
Ribose

Ribose, primarily occurring as D-ribose, is an organic compound that occurs widely in nature. It is an aldopentose, that is a monosaccharide containing five carbon atoms that, in its acyclic form, has an aldehyde functional group at one end....
 or deoxyribose
Deoxyribose

Deoxyribose, also known as D-Deoxyribose and 2-deoxyribose, is an aldopentose — a monosaccharide containing five carbon atoms, and including an aldehyde functional group in its linear structure....
, and one of four nucleobase
Nucleobase

Nucleobases are the parts of DNA and RNA that may be involved in pairing . The main ones are cytosine, guanine, adenine , thymine and uracil , abbreviated as C, G, A, T, and U, respectively....
s. An analogue may have any of these altered, typically the analogue nucleobases confer, among other things, different base pairing and base stacking proprieties such as universal bases, which can pair with all four canon bases, while the phosphate-sugar backbone analogues affect the properties of the chain, such as PNA, the secondary structure of which differs significantly from DNA, and may form a triplex (a triple stranded helix).






Discussion
Ask a question about 'Nucleic acid analogues'
Start a new discussion about 'Nucleic acid analogues'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Na Comparedto Dna Thymineanduracilcorrected
Nucleic acid analogues are compounds structurally similar (analog
Analog (chemistry)

In chemistry, analogs or analogues are chemical compound in which one or more individual atoms have been replaced, either with a different atom, or with a different functional group....
) to naturally occurring RNA and DNA, used in medicine and in molecular biology research. Nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
s are chains of nucleotides, which are composed of three parts: a phosphate
Phosphate

A phosphate, an inorganic chemical, is a Salt of phosphoric acid. Inorganic phosphates are mining to obtain phosphorus for use in agriculture and industry....
 backbone, a pucker-shaped pentose sugar, either ribose
Ribose

Ribose, primarily occurring as D-ribose, is an organic compound that occurs widely in nature. It is an aldopentose, that is a monosaccharide containing five carbon atoms that, in its acyclic form, has an aldehyde functional group at one end....
 or deoxyribose
Deoxyribose

Deoxyribose, also known as D-Deoxyribose and 2-deoxyribose, is an aldopentose — a monosaccharide containing five carbon atoms, and including an aldehyde functional group in its linear structure....
, and one of four nucleobase
Nucleobase

Nucleobases are the parts of DNA and RNA that may be involved in pairing . The main ones are cytosine, guanine, adenine , thymine and uracil , abbreviated as C, G, A, T, and U, respectively....
s. An analogue may have any of these altered, typically the analogue nucleobases confer, among other things, different base pairing and base stacking proprieties such as universal bases, which can pair with all four canon bases, while the phosphate-sugar backbone analogues affect the properties of the chain, such as PNA, the secondary structure of which differs significantly from DNA, and may form a triplex (a triple stranded helix).

Medicine

Several nucleoside analogues are used as antiviral or anticancer agents. The viral polymerase incorporates these compounds with non-canon bases. These compounds are activated in the cells by being converted into nucleotides, they are administered as nucleosides since charged nucleotides cannot easily cross cell membranes.

Molecular biology

Nucleic acid analogues are used in molecular biology for several purposes:
  • As a tool to detect particular sequences
  • As a tool with resistance to RNA hydrolysis
    RNA

    Ribonucleic acid is a type of molecule that consists of a long chain of nucleotide units. Each nucleotide consists of a nucleobase, a ribose sugar, and a phosphate....
  • As a tool for another purpose, such as sequencing
    DNA sequencing

    The term DNA sequencing refers to methods for determining the order of the nucleotide bases, adenine, guanine, cytosine, and thymine, in a molecule of DNA....
  • Naturally occurring, such as in tRNA
  • Investigation of the mechanisms used by enzyme, such as an Enzyme inhibitor
    Enzyme inhibitor

    Enzyme inhibitors are molecules that bind to enzymes and decrease their enzyme activity. Since blocking an enzyme's activity can kill a pathogen or correct a metabolism imbalance, many drugs are enzyme inhibitors....
  • Investigation of possible scenarios of the origin of life
  • Investigation of the structural features of nucleic acids
  • Investigation of the possible alternatives to the natural system in Synthetic biology
    Synthetic biology

    Synthetic biology is a new area of biology research that combines science and engineering in order to design and build novel biological functions and systems....


Backbone analogues


Hydrolysis resistant RNA-analogues

To overcome the fact that ribose
Ribose

Ribose, primarily occurring as D-ribose, is an organic compound that occurs widely in nature. It is an aldopentose, that is a monosaccharide containing five carbon atoms that, in its acyclic form, has an aldehyde functional group at one end....
's 2' hydroxy group that reacts with the phosphate linked 3' hydroxy group (RNA is too unstable to be used or synthesised reliably), a ribose anologue is used. The most common RNA analogues are locked nucleic acid (LNA
Locked nucleic acid

A locked nucleic acid , often referred to as inaccessible RNA, is a modified RNA nucleotide. The ribose moiety of an LNA nucleotide is modified with an extra bridge connecting the 2' and 4' carbons....
), morpholino
Morpholino

In molecular biology, a Morpholino is a molecule used to modify gene expression. Morpholino oligomers are an antisense technology used to block access of other molecules to specific sequences within nucleic acid....
, peptide nucleic acid (PNA). These oligonucleotides differ as they have a different backbone sugar but still bind according to Watson and Crick pairing with RNA or DNA, but are immune to nuclease activity (They generally cannot be enzymatically synthesised and can only be produced synthetically
Oligonucleotide synthesis

Oligonucleotide synthesis is the non-biological, chemical synthesis of defined short sequences of nucleic acids. It is extremely useful in laboratory practice covering a wide range of molecular biology applications and in pharmaceutical research....
.

Other notable analogues used as tools

In sequencing
Sequencing

In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule....
 dideoxynucleotides
Dideoxynucleotides

Dideoxynucleotides, or ddNTPs, are nucleotides lacking a 3'-hydroxyl group on their deoxyribose sugar. Since deoxyribose already lacks a 2'-OH, dideoxyribose lacks hydroxyl groups at both its 2' and 3' carbons....
 are used. These nucleotide triphosphates possess a non-canon sugar, dideoxyribose which lacks 3' hydroxyl group (which accepts the phosphate) and therefore cannot bond with the next base, terminating the chain as the DNA polymerases mistake it for a regular deoxyribonucleotide. The nucleoside analogue with a ribose lacking both 2' and 3' is called cordycepin
Cordycepin

Cordycepin, or 3-deoxyadenosine, is a Derivative of the nucleoside adenosine, differing from the latter by the absence of oxygen in the Nucleic acid nomenclature position of its ribose part....
, an anticancer drug. Another analogue in sequencing is a nucleobase analogue, 7-deaza-GTP and is used to sequence CG rich regions, instead 7-deaza-ATP is called tubercidin, an antibiotic.

Precursors to the RNA world

RNA may be too complex to be the first nucleic acid, so before the RNA world
RNA world hypothesis

The RNA world hypothesis proposes that a world filled with life based on ribonucleic acid predated current life based on deoxyribonucleic acid ....
 there may have been one of these several candidate original nucleic acids which differ in the backbone, such as TNA
TNA (nucleic acid)

Threose nucleic acid is a polymer similar to DNA or RNA but differing in the composition of its "backbone". TNA is not known to occur naturally....
 and GNA
GNA (nucleic acid)

Glycerol nucleic acid is a polymer similar to DNA or RNA but differing in the composition of its "backbone". GNA is not known to occur naturally....
 and PNA.

Base analogues


Nucleobase structure and nomeclature

Natural bases are divided into two classes depending on their structure: pyrimidine
Pyrimidine

Pyrimidine is a heterocyclic aromatic organic compound similar to benzene and pyridine, containing two nitrogen atoms at positions 1 and 3 of the six-member ring....
 (an heterocyclic aromatic six-membered ring with nitrogen atoms in position 1 and 3) and purine
Purine

Purine is a heterocyclic compound aromatic organic compound, consisting of a pyrimidine ring fused to an imidazole ring. Purines, including substituted purines and their tautomers, are the most widely distributed kind of nitrogen-containing heterocycle in nature....
 (a pyrimidine (numeration inverted) fused with an imidazole ring, a five-membered ring with 2 nitrogen atoms separated by one carbon (meta), 7,9). Their main proprieties are base pairing, resulting form 2 or 3 hydrogen bonds between keto and amino functional groups, and base stacking
Stacking (chemistry)

Stacking in supramolecular chemistry refers to a stacked arrangement of aromatic molecules, which interact through aromatic interactions. The most popular example of a stacked system is found for consecutive base pairs in DNA....
, caused by the attraction of the delocalized pi electron clouds of the aromatic ring structure.

Purine
Purine

Purine is a heterocyclic compound aromatic organic compound, consisting of a pyrimidine ring fused to an imidazole ring. Purines, including substituted purines and their tautomers, are the most widely distributed kind of nitrogen-containing heterocycle in nature....
Pyrimidine
Pyrimidine

Pyrimidine is a heterocyclic aromatic organic compound similar to benzene and pyridine, containing two nitrogen atoms at positions 1 and 3 of the six-member ring....

Fluorophores

Commonly fluorophore
Fluorophore

A fluorophore, in analogy to a chromophore, is a component of a molecule which causes a molecule to be fluorescent. It is a functional group in a molecule which will absorb energy of a specific wavelength and re-emit energy at a different wavelength....
s (such as rhodamine
Rhodamine

Rhodamine is a family of related chemical compounds, fluorone dyes. Examples are Rhodamine 6G and Rhodamine B. They are used as a dye and as a dye laser gain medium....
 or fluorescein
Fluorescein

Fluorescein is a fluorophore commonly used in microscopy, in a type of dye laser as the gain medium, in forensics and serology to detect latent blood stains, and in dye tracing....
) are linked to the ring linked to the sugar (in para) via a flexible arm, presumably extruding form the major groove of the helix. Due to taq polymerases low processivity of the nucleotides linked to bulky adducts such as florophores, the sequence is typically copied using a nucleotide with an arm and later coupled with a reactive fluorophore (indirect labelling):
  • amine reactive: Aminoallyl nucleotide
    Aminoallyl nucleotide

    Aminoallyl nucleotides are used in post-labeling of nucleic acid to be used in microarrays. These nucleotides are formally known as 5--nucleotides since the aminoallyl group is usually attached to carbon 5 of the pyrimidine ring of uracil and cytosine....
     contain a primary amine group on a linker that reacts with the amino-reactive dye such as a cyanine
    Cyanine

    Cyanine is a International Union of Pure and Applied Chemistry nomenclature name of a synthetic dye family belonging to polymethine group. Cyanines have many uses as fluorescent dyes, particularly in biomedical imaging....
     or Alexa Fluor dyes, which contain a reactive leaving group, such as a succinimidyl ester (NHS). (base pairing amino groups are not affected).
  • thiol reactive: thiol containing nucleotides reacts with the fluorophore linked to a reactive leaving group, such as a maleimide.
  • biotin
    Biotin

    Biotin, also known as vitamin H or B7, has the chemical formula C10H16N2O3S , is a water-soluble B-complex vitamin which is composed of an ureido ring fused with a tetrahydrothiophene ring....
     linked nucleotides rely on the same indirect labelling principle (+ fluorescent streptavidin) and are used in Affymetrix
    Affymetrix

    Affymetrix is a manufacturer of DNA microarrays, based in Santa Clara, California, United States. The company was co-founded by Dr. Stephen Fodor in 1992....
     DNAchips.
Fluorophores find a variety of uses
Fluorescence

Fluorescence is a luminescence that is mostly found as an optical phenomenon in cold bodies, in which the molecular absorption of a photon triggers the emission of a photon with a longer wavelength....
 in medicine and biochemistry.

Natural non-canon bases

In a cell, there are several noncanon bases present: CpG islands in DNA (are often methylated), all eukaryotic mRNA (capped with a methyl-7-guanosine), and several bases of rRNAs (are methylated). Often, tRNAs are heavily modified postranscriptionally in order to improve their conformation or base pairing in particular in/near the anticodon: inosine can base pair with C, U, and even with A, whereas thiouridine (with A) is more specific than uracil (with a purine). Other common tRNA base modifications are pseudorindine (which gives its name to the T?C loop), dihydrouridine (which does not stack as it is not aromatic), queosine, wyosine and so forth. Nevertheless these are all modifications to normal bases and are not placed by a polymerase.

Base-pairing

Canonical bases may have either a keto or amino group on the carbons surrounding the nitrogen atom furthest away from the glycosidic bond, which allows them to base pair
Base pair

In molecular biology, two nucleotides on opposite complementarity DNA or RNA strands that are connected via hydrogen bonds are called a base pair ....
 (Watson-Crick base pairing) via hydrogen bonds (amine with keto, purine with pyrimidine). A and T are amine only and keto only, while C and G are mixed (inverted in respect to each other).

Natural basepairs
A GC basepair: purine keto/amine forms three intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with pyrimidine amine/keto
An AT basepair: purine amine/- forms two intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with pyrimidine keto/keto


The precise reason why there are only four nucleotides is debated, but there are several unused possibilities. Furthermore adenine is not the most stable choice for base pairing: in Cyanophage S-2L diaminopurine (DAP) is used instead of adenine (host evasion
Innate immune system

The innate immune system comprises the cells and mechanisms that defend the host from infection by other organisms, in a non-specific manner. This means that the cells of the innate system recognize and respond to pathogens in a generic way, but unlike the adaptive immune system, it does not confer long-lasting or protective immunity to the h...
). Diaminopurine basepairs perfectly with thymine as it is identical to adenine but has an amine group at position 2 forming 3 intramolecular hydrogen bonds, eliminating the major difference between the two types of basepairs (Weak:A-T and Strong:C-G). This improved stability affects protein binding ineractions which rely on those differences. Other combination include,
  • isoguanosine and isocytosine, which are have their amino and keto inverted, (not used probably as tautomers are problematic for base pairing, but isoG and isoG can be amplified correctly with PCR even in the presence of the 4 canon bases)
  • diaminopyrimidine and a xanthine (not used as xanthine is a deamination product)


Unused basepair arrangements
A DAP-T base: purine amine/amine forms three intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with pyrimidine keto/keto
An X-DAY base: purine keto/keto forms three intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with pyrimidine amine/amine
A iG-iC base: purine amine/keto forms three intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with pyrimidine keto/amine


However correct DNA structure can form even when the bases are not paired via hydrogen bonding, as studies have shown using DNA isostere
Isostere

Isosteres are molecules or ions with the same number of atoms and the same number of valence electrons. As a result, they can exhibit similar pharmacokinetic and pharmacodynamic properties....
s (analogues with same number of atoms), such as the thymine analogue 2,4-difluorotoluene (F) or the adenine analogue 4-methylbenzimidazole (Z).

Other noteworthy basepairs:
  • Several fluorescent bases have also been made, such as the 2-amino-6-(2-thienyl)purine and pyrrole-2-carbaldehyde base pair.
  • Metal coordinated bases, such as two 2,6-bis(ethylthiomethyl)pyridine (SPy) with a silver ion or pyridine-2,6-dicarboxamide (Dipam) and a mondentate pyridine (Py) with a copper ion .
  • Universal bases may pair indiscriminately with any other base, but generally lower the melting temperature of the sequence considerably, examples include 2'-deoxyinosine derivatives, nitroazole analogues and hydrophobic aromatic non-hydrogen bonding bases (strong stacking effects). These are used as proof of concept and are not generally utilised in degenerate primers (which are a mixture of primers).
  • The numbers of possible base pairs is doubled when xDNA
    XDNA

    xDNA is a modified form of DNA with 8 nucleobases: the four natural bases adenine, cytosine, guanine, and thymine, and four artificial modifications of these made longer by the addition of an extra benzene ring: xA, xC, xG, and xT....
     is considered. xDNA contains expanded bases, in which a benzine ring has been added, which may pair with canon bases, resulting in four possible base-pairs (8 bases:xA-T,xT-A,xC-G,xG-C, 16 bases if the unused arrangements are used). Another form of benzine added bases is yDNA, in which the base is widened by the benzine.


Novel basepairs with special proprieties
 
A F-Z base: methylbenzimidazole does not form intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with toluene F/F
An S-Pa base: purine thienyl/amine forms three intermolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s with pyrrole -/carbaldehyde
An xA-T base: same bonding as A-T 


See also

  • Molecular biology
    Molecular biology

    Molecular biology is the study of biology at a molecule level. The field overlaps with other areas of biology and chemistry, particularly genetics and biochemistry....
  • Genetics
    Genetics

    Genetics , a discipline of biology, is the science of heredity and Genetic variation in living organisms. The fact that living things inherit traits from their parents has been used since prehistoric times to improve crop plants and animals through selective breeding....
  • Synthetic biology
    Synthetic biology

    Synthetic biology is a new area of biology research that combines science and engineering in order to design and build novel biological functions and systems....
  • Oligonucleotide synthesis
    Oligonucleotide synthesis

    Oligonucleotide synthesis is the non-biological, chemical synthesis of defined short sequences of nucleic acids. It is extremely useful in laboratory practice covering a wide range of molecular biology applications and in pharmaceutical research....
  • Nucleobase
    Nucleobase

    Nucleobases are the parts of DNA and RNA that may be involved in pairing . The main ones are cytosine, guanine, adenine , thymine and uracil , abbreviated as C, G, A, T, and U, respectively....
    , nucleoside
    Nucleoside

    Nucleosides are glycosylamines consisting of a nucleobase bound to a ribose or deoxyribose sugar. Examples of these include cytidine, uridine, adenosine, guanosine, thymidine and inosine....
    , nucleotide
    Nucleotide

    Nucleotides are molecules that comprise the structural units of RNA and DNA. Additionally, nucleotides play central roles in metabolism. In that capacity, they serve as sources of chemical energy , participate in cell signaling , and are incorporated into important cofactors of enzymatic reactions ....
    , and nucleic acid
    Nucleic acid

    A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
  • Biotin
    Biotin

    Biotin, also known as vitamin H or B7, has the chemical formula C10H16N2O3S , is a water-soluble B-complex vitamin which is composed of an ureido ring fused with a tetrahydrothiophene ring....
    , fluorophore
    Fluorophore

    A fluorophore, in analogy to a chromophore, is a component of a molecule which causes a molecule to be fluorescent. It is a functional group in a molecule which will absorb energy of a specific wavelength and re-emit energy at a different wavelength....
     and dark quencher
    Dark quencher

    A Quenching_ is a substance that absorbs excitation energy from a fluorophore; a typical quencher re-emits much of this energy as light; a dark quencher dissipates the energy as heat....
  • Ribozyme
    Ribozyme

    A ribozyme is an RNA molecule that catalyzes a chemical reaction. Many natural ribozymes catalyze either the hydrolysis of one of their own phosphodiester bonds, or the hydrolysis of bonds in other RNAs, but they have also been found to catalyze the aminotransferase activity of the ribosome....