All Topics  
DNA

 

   Email Print
   Bookmark   Link






 

DNA



 
 
Deoxyribonucleic acid (DNA) is a nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
 that contains the genetic
Genetics

Genetics , a discipline of biology, is the science of heredity and Genetic variation in living organisms. The fact that living things inherit traits from their parents has been used since prehistoric times to improve crop plants and animals through selective breeding....
 instructions used in the development and functioning of all known living organism
Organism

In biology, an organism is any life thing . In at least some form, all organisms are capable of response to stimulus , reproduction, growth and developmental biology, and maintenance of homeostasis as a stable whole....
s and some virus
Virus

A virus is a Optical microscope#Limitations of light microscopes infectious agent that is unable to grow or reproduce outside a host cell . Viruses infect all cellular life....
es. The main role of DNA molecule
Molecule

In chemistry, a molecule is defined as a sufficiently stable, electric charge neutral group of at least two atoms in a definite arrangement held together by very strong chemical bonds....
s is the long-term storage of information
Information

Information as a Conveyed concept has a diversity of meanings, from everyday usage to technical settings. Generally speaking, the concept of information is closely related to notions of constraint, communication, control system, data, form, instruction, knowledge, Meaning , stimulation, pattern, perception, and knowledge representation....
. DNA is often compared to a set of blueprint
Blueprint

A blueprint is a type of paper-based reproduction usually of a technical drawing, documenting an architecture or an engineering design. More generally, the term "blueprint" has come to be used to refer to any detailed plan....
s or a recipe, or a code, since it contains the instructions needed to construct other components of cells
Cell (biology)

The cell is the structural and functional unit of all known Life organisms. It is the smallest unit of an organism that is classified as living, and is often called the building bricks of life....
, such as protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s and RNA
RNA

Ribonucleic acid is a type of molecule that consists of a long chain of nucleotide units. Each nucleotide consists of a nucleobase, a ribose sugar, and a phosphate....
 molecules.






Discussion
Ask a question about 'DNA'
Start a new discussion about 'DNA'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


Dna Overview
Deoxyribonucleic acid (DNA) is a nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
 that contains the genetic
Genetics

Genetics , a discipline of biology, is the science of heredity and Genetic variation in living organisms. The fact that living things inherit traits from their parents has been used since prehistoric times to improve crop plants and animals through selective breeding....
 instructions used in the development and functioning of all known living organism
Organism

In biology, an organism is any life thing . In at least some form, all organisms are capable of response to stimulus , reproduction, growth and developmental biology, and maintenance of homeostasis as a stable whole....
s and some virus
Virus

A virus is a Optical microscope#Limitations of light microscopes infectious agent that is unable to grow or reproduce outside a host cell . Viruses infect all cellular life....
es. The main role of DNA molecule
Molecule

In chemistry, a molecule is defined as a sufficiently stable, electric charge neutral group of at least two atoms in a definite arrangement held together by very strong chemical bonds....
s is the long-term storage of information
Information

Information as a Conveyed concept has a diversity of meanings, from everyday usage to technical settings. Generally speaking, the concept of information is closely related to notions of constraint, communication, control system, data, form, instruction, knowledge, Meaning , stimulation, pattern, perception, and knowledge representation....
. DNA is often compared to a set of blueprint
Blueprint

A blueprint is a type of paper-based reproduction usually of a technical drawing, documenting an architecture or an engineering design. More generally, the term "blueprint" has come to be used to refer to any detailed plan....
s or a recipe, or a code, since it contains the instructions needed to construct other components of cells
Cell (biology)

The cell is the structural and functional unit of all known Life organisms. It is the smallest unit of an organism that is classified as living, and is often called the building bricks of life....
, such as protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s and RNA
RNA

Ribonucleic acid is a type of molecule that consists of a long chain of nucleotide units. Each nucleotide consists of a nucleobase, a ribose sugar, and a phosphate....
 molecules. The DNA segments that carry this genetic information are called gene
Gene

A gene is the basic unit of heredity in a living organism. All living things depend on genes. Genes hold the information to build and maintain their cell and pass genetic trait to offspring....
s, but other DNA sequences have structural purposes, or are involved in regulating the use of this genetic information.

Chemically, DNA consists of two long polymers of simple units called nucleotide
Nucleotide

Nucleotides are molecules that comprise the structural units of RNA and DNA. Additionally, nucleotides play central roles in metabolism. In that capacity, they serve as sources of chemical energy , participate in cell signaling , and are incorporated into important cofactors of enzymatic reactions ....
s, with backbones made of sugars and phosphate groups joined by ester
Ester

An ester is an often Aroma compound organic chemistry or partially organic compound formed by the reaction between an acid and an alcohol or aromatic alcohol with the elimination of water....
 bonds. These two strands run in opposite directions to each other and are therefore anti-parallel
Antiparallel (biochemistry)

In biochemistry, two molecules are antiparallel if they run side-by-side in opposite directions.In DNA, the 5' carbon is located at the top of the leading strand, and the 3' carbon is located at the lower section of the lagging strand....
. Attached to each sugar is one of four types of molecules called bases
Nucleobase

Nucleobases are the parts of DNA and RNA that may be involved in pairing . The main ones are cytosine, guanine, adenine , thymine and uracil , abbreviated as C, G, A, T, and U, respectively....
. It is the sequence of these four bases along the backbone that encodes information. This information is read using the genetic code
Genetic code

The genetic code is the set of rules by which information encoded in genetic material is Translation into proteins by living cell s. The code defines a mapping between tri-nucleotide sequences, called codons, and amino acids....
, which specifies the sequence of the amino acid
Amino acid

In chemistry, an amino acid is a molecule containing both amine and carboxyl functional groups. These molecules are particularly important in biochemistry, where this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent....
s within proteins. The code is read by copying stretches of DNA into the related nucleic acid RNA, in a process called transcription
Transcription (genetics)

Transcription is the synthesis of RNA under the direction of DNA. RNA synthesis, or transcription, is the process of transcribing DNA nucleotide sequence information into RNA sequence information....
.

Within cells, DNA is organized into structures called chromosome
Chromosome

A chromosome is an organized structure of DNA and protein that is found in Cell . A chromosome is a single piece of DNA that contains many genes, regulatory sequence and other genetic sequence....
s. These chromosomes are duplicated before cells divide
Cell division

Cell division is a process by which a cell , called the parent cell, divides into two or more cells, called daughter cells. Cell division is usually a small segment of a larger cell cycle....
, in a process called DNA replication
DNA replication

DNA replication, the basis for heredity, is a fundamental process occurring in all living organisms to copy their DNA. This process is "semiconservative replication" in that each strand of the original double-stranded DNA molecule serves as template for the reproduction of the complementary strand....
. Eukaryotic organisms
Eukaryote

Animals, plants, fungus, and protists are eukaryotes , organisms whose Cell are organized into complex structures enclosed within Cell membrane....
 (animal
Animal

Animals are a major group of multicellular, eukaryotic organisms of the Kingdom Animalia or Metazoa. Their body plan eventually becomes fixed as they develop, although some undergo a process of metamorphosis later on in their life....
s, plant
Plant

Plants are Life organisms belonging to the Kingdom Plantae. They include familiar organisms such as trees, herbs, bushes, grasses, vines, ferns, mosses, and green algae....
s, fungi
Fungus

A fungus is a Eukaryote organism that is a member of the Kingdom Fungi . The fungi are a monophyletic group, also called the Eumycota , that is phylogeny distinct from the morphologically similar slime molds and water molds ....
, and protist
Protist

Protists ; eukaryote microorganisms. Historically, protists were treated as the kingdom Protista but this group is no longer recognized in modern taxonomy....
s) store their DNA inside the cell nucleus
Cell nucleus

In cell biology, the nucleus , also sometimes referred to as the "control center", is a membrane-enclosed organelle found in all eukaryote cell ....
, while in prokaryote
Prokaryote

The prokaryotes are a group of organisms that lack a cell nucleus , or any other cell membrane-bound organelles. They differ from the eukaryotes, which have a cell nucleus....
s (bacteria
Bacteria

The Bacteria are a large group of unicellular microorganisms. Typically a few micrometres in length, bacteria have a wide range of shapes, ranging from spheres to rods and spirals....
 and archaea
Archaea

The Archaea are a group of single-celled microorganisms. A single individual or species from this domain is called an archaeon . Archaea, like bacteria, are prokaryotic....
) it is found in the cell's cytoplasm
Cytoplasm

The cytoplasm is the part of a Cell that is enclosed within the plasma membrane. In eukaryote cells the cytoplasm contains organelles, such as mitochondrion, that are filled with liquid kept separate from the rest of the cytoplasm by biological membranes....
. Within the chromosomes, chromatin
Chromatin

Chromatin is the complex combination of DNA, RNA, and protein that makes up chromosomes. It is found inside the cell nucleus of Eukaryote cell , and within the nucleoid in prokaryotic cells....
 proteins such as histone
Histone

In biology, histones are the chief protein components of chromatin. They act as spools around which DNA winds, and they play a role in gene regulation....
s compact and organize DNA. These compact structures guide the interactions between DNA and other proteins, helping control which parts of the DNA are transcribed.

Properties


DNA is a long polymer
Polymer

A polymer is a large molecule composed of repeating structural units typically connected by covalent chemical bonds. While polymer in popular usage suggests plastic, the term actually refers to a large class of natural and synthetic materials with a variety of properties....
 made from repeating units called nucleotide
Nucleotide

Nucleotides are molecules that comprise the structural units of RNA and DNA. Additionally, nucleotides play central roles in metabolism. In that capacity, they serve as sources of chemical energy , participate in cell signaling , and are incorporated into important cofactors of enzymatic reactions ....
s. The DNA chain is 22 to 26 Ångström
Ångström

An ?ngstr?m or angstrom is an internationally recognized non-SI unit of length equal to 0.1 nanometre or 1 metres. It is sometimes used in expressing the sizes of atoms, lengths of chemical bonds and optical spectrum, and dimensions of parts of integrated circuits....
s wide (2.2 to 2.6 nanometre
Nanometre

A nanometre is a Units of measurement of length in the metric system, equal to one billionth of a metre .It is one of the more often used units for very small lengths, and equals ten ?ngstr?m, an internationally recognized non-International System of Units of length....
s), and one nucleotide unit is 3.3 Å (0.33 nm) long. Although each individual repeating unit is very small, DNA polymers can be very large molecules containing millions of nucleotides. For instance, the largest human chromosome, chromosome number 1, is approximately 220 million base pair
Base pair

In molecular biology, two nucleotides on opposite complementarity DNA or RNA strands that are connected via hydrogen bonds are called a base pair ....
s long.

In living organisms, DNA does not usually exist as a single molecule, but instead as a tightly-associated pair of molecules. These two long strands entwine like vines, in the shape of a double helix
Double helix

In geometry a double helix typically consists of two congruence helix with the same axis, differing by a translation along the axis, which may or may not be half-way....
. The nucleotide repeats contain both the segment of the backbone of the molecule, which holds the chain together, and a base, which interacts with the other DNA strand in the helix. In general, a base linked to a sugar is called a nucleoside
Nucleoside

Nucleosides are glycosylamines consisting of a nucleobase bound to a ribose or deoxyribose sugar. Examples of these include cytidine, uridine, adenosine, guanosine, thymidine and inosine....
 and a base linked to a sugar and one or more phosphate groups is called a nucleotide
Nucleotide

Nucleotides are molecules that comprise the structural units of RNA and DNA. Additionally, nucleotides play central roles in metabolism. In that capacity, they serve as sources of chemical energy , participate in cell signaling , and are incorporated into important cofactors of enzymatic reactions ....
. If multiple nucleotides are linked together, as in DNA, this polymer is called a polynucleotide
Polynucleotide

A polynucleotide molecule is an organic chemistry polymer molecule composed of nucleotide monomers covalently bonded in a chain. DNA and RNA are examples of polynucleotides with distinct biological function....
.

The backbone of the DNA strand is made from alternating phosphate
Phosphate

A phosphate, an inorganic chemical, is a Salt of phosphoric acid. Inorganic phosphates are mining to obtain phosphorus for use in agriculture and industry....
 and sugar
Carbohydrate

Carbohydrates or saccharides are the most abundant of the four major classes of biomolecules. They fill numerous roles in living things, such as the storage and transport of energy and structural components ....
 residues. The sugar in DNA is 2-deoxyribose, which is a pentose
Pentose

A pentose is a monosaccharide with five carbon atoms.They either have an aldehyde functional group in position 1 , or a ketone functional group in position 2 ....
 (five-carbon
Carbon

Carbon is a chemical element with chemical symbol C and atomic number 6. As a member of group 14 on the periodic table, it is nonmetallic and tetravalence?making four electrons available to form covalent bond chemical bonds....
) sugar. The sugars are joined together by phosphate groups that form phosphodiester bond
Phosphodiester bond

A phosphodiester bond is a group of strong covalent bond between the Phosphorus in a phosphate group and two other molecules over two ester bonds....
s between the third and fifth carbon atom
Atom

|-! bgcolor=gray | Properties|-||}The atom is a basic unit of matter consisting of a dense, central atomic nucleus surrounded by a electron cloud of electric charge electrons....
s of adjacent sugar rings. These asymmetric bonds
Covalent bond

A covalent bond is a form of chemical bonding that is characterized by the sharing of pairs of electrons between atoms, or between atoms and other covalent bonds....
 mean a strand of DNA has a direction. In a double helix the direction of the nucleotides in one strand is opposite to their direction in the other strand. This arrangement of DNA strands is called antiparallel. The asymmetric ends of DNA strands are referred to as the 5'
Directionality (molecular biology)

Directionality, in molecular biology, refers to the end-to-end chemical orientation of a single strand of nucleic acid. The chemical convention of naming carbon atoms in the nucleotide Furanose numerically gives rise to a 5' end and a 3' end ....
 (five prime) and 3'
Directionality (molecular biology)

Directionality, in molecular biology, refers to the end-to-end chemical orientation of a single strand of nucleic acid. The chemical convention of naming carbon atoms in the nucleotide Furanose numerically gives rise to a 5' end and a 3' end ....
 (three prime) ends, with the 5' end being that with a terminal phosphate group and the 3' end that with a terminal hydroxyl group. One of the major differences between DNA and RNA is the sugar, with 2-deoxyribose being replaced by the alternative pentose sugar ribose
Ribose

Ribose, primarily occurring as D-ribose, is an organic compound that occurs widely in nature. It is an aldopentose, that is a monosaccharide containing five carbon atoms that, in its acyclic form, has an aldehyde functional group at one end....
 in RNA.

The DNA double helix is stabilized by hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s between the bases attached to the two strands. The four bases found in DNA are adenine
Adenine

Adenine is a nucleobase with a variety of roles in biochemistry including cellular respiration, in the form of both the energy-rich adenosine triphosphate and the cofactor s nicotinamide adenine dinucleotide and flavin adenine dinucleotide , and Protein biosynthesis, as a chemical component of DNA and RNA....
 (abbreviated A), cytosine
Cytosine

Cytosine is one of the five main bases found in DNA and RNA. It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ....
 (C), guanine
Guanine

Guanine is one of the five main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, thymine, and uracil. In DNA, guanine is paired with cytosine....
 (G) and thymine
Thymine

Thymine is one of the four bases in the nucleic acid of DNA that make up the letters GCAT. The others are adenine, guanine, and cytosine. Thymine always pairs with adenine....
 (T). These four bases are attached to the sugar/phosphate to form the complete nucleotide, as shown for adenosine monophosphate.

These bases are classified into two types; adenine and guanine are fused five- and six-membered heterocyclic compound
Heterocyclic compound

Heterocyclic compounds are organic compounds containing at least one atom of carbon, and at least one element other than carbon, such as sulfur, oxygen or nitrogen within a ring structure....
s called purine
Purine

Purine is a heterocyclic compound aromatic organic compound, consisting of a pyrimidine ring fused to an imidazole ring. Purines, including substituted purines and their tautomers, are the most widely distributed kind of nitrogen-containing heterocycle in nature....
s, while cytosine and thymine are six-membered rings called pyrimidine
Pyrimidine

Pyrimidine is a heterocyclic aromatic organic compound similar to benzene and pyridine, containing two nitrogen atoms at positions 1 and 3 of the six-member ring....
s. A fifth pyrimidine base, called uracil
Uracil

Uracil is a common and naturally occurring pyrimidine derivative. Originally discovered in 1900, it was isolated by hydrolysis of yeast nuclein that was found in bovine thymus and spleen, herring, sperm, and wheat germ....
 (U), usually takes the place of thymine in RNA and differs from thymine by lacking a methyl group
Methyl group

In chemistry, a methyl group is a hydrophobic alkyl functional group named after methane . It has the chemical formula -3 and is often abbreviated -Me....
 on its ring. Uracil is not usually found in DNA, occurring only as a breakdown product of cytosine.

Grooves


Normally, the double helix is a right-handed spiral. As the DNA strands wind around each other, they leave gaps between each set of phosphate backbones, revealing the sides of the bases inside (see animation). There are two of these grooves twisting around the surface of the double helix: one groove, the major groove, is 22 Å wide and the other, the minor groove, is 12 Å wide. The narrowness of the minor groove means that the edges of the bases are more accessible in the major groove. As a result, proteins like transcription factor
Transcription factor

In the field of molecular biology, a transcription factor is a protein that binds to specific DNA sequence and thereby controls the transfer of genetic information from DNA to RNA....
s that can bind to specific sequences in double-stranded DNA usually make contacts to the sides of the bases exposed in the major groove. This situation varies in unusual conformations of DNA within the cell (see below), but the major and minor grooves are always named to reflect the differences in size that would be seen if the DNA is twisted back into the ordinary B form.

Base pairing


Each type of base on one strand forms a bond with just one type of base on the other strand. This is called complementary base pair
Base pair

In molecular biology, two nucleotides on opposite complementarity DNA or RNA strands that are connected via hydrogen bonds are called a base pair ....
ing. Here, purines form hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s to pyrimidines, with A bonding only to T, and C bonding only to G. This arrangement of two nucleotides binding together across the double helix is called a base pair. As hydrogen bonds are not covalent
Covalent bond

A covalent bond is a form of chemical bonding that is characterized by the sharing of pairs of electrons between atoms, or between atoms and other covalent bonds....
, they can be broken and rejoined relatively easily. The two strands of DNA in a double helix can therefore be pulled apart like a zipper, either by a mechanical force or high temperature
Temperature

In physics, temperature is a physical property of a Physical system that underlies the common notions of hot and cold; something that feels hotter generally has the greater temperature....
. As a result of this complementarity, all the information in the double-stranded sequence of a DNA helix is duplicated on each strand, which is vital in DNA replication. Indeed, this reversible and specific interaction between complementary base pairs is critical for all the functions of DNA in living organisms.

The two types of base pairs form different numbers of hydrogen bonds, AT forming two hydrogen bonds, and GC forming three hydrogen bonds (see figures, left). DNA with high GC-content
GC-content

GC-content , in molecular biology, is the percentage of nitrogenous bases on a DNA molecule which are either guanine or cytosine . This may refer to a specific fragment of DNA or RNA, or that of the Genome....
 is more stable than DNA with low GC-content, but contrary to popular belief, this is not due to the extra hydrogen bond of a GC basepair but rather the contribution of stacking interactions (hydrogen bonding merely provides specificity of the pairing, not stability). As a result, it is both the percentage of GC base pairs and the overall length of a DNA double helix that determine the strength of the association between the two strands of DNA. Long DNA helices with a high GC content have stronger-interacting strands, while short helices with high AT content have weaker-interacting strands. In biology, parts of the DNA double helix that need to separate easily, such as the TATAAT Pribnow box
Pribnow box

The Pribnow box is the sequence TATAAT of six nucleotides that is an essential part of a promoter site on DNA for Transcription to occur in bacteria....
 in some promoter
Promoter

In biology, a promoter is a region of DNA that facilitates the Transcription of a particular gene. Promoters are typically located near the genes they regulate, on the same strand and Upstream and downstream ....
s, tend to have a high AT content, making the strands easier to pull apart. In the laboratory, the strength of this interaction can be measured by finding the temperature required to break the hydrogen bonds, their melting temperature (also called Tm value). When all the base pairs in a DNA double helix melt, the strands separate and exist in solution as two entirely independent molecules. These single-stranded DNA molecules have no single common shape, but some conformations are more stable than others.

Sense and antisense


A DNA sequence is called "sense" if its sequence is the same as that of a messenger RNA
Messenger RNA

Messenger ribonucleic acid is a molecule of RNA encoding a chemical "blueprint" for a protein product. mRNA is transcription from a DNA template, and carries coding information to the sites of protein synthesis: the ribosomes....
 copy that is translated into protein. The sequence on the opposite strand is called the "antisense" sequence. Both sense and antisense sequences can exist on different parts of the same strand of DNA (i.e. both strands contain both sense and antisense sequences). In both prokaryotes and eukaryotes, antisense RNA sequences are produced, but the functions of these RNAs are not entirely clear. One proposal is that antisense RNAs are involved in regulating gene expression
Gene expression

Gene expression is the process by which inheritable information from a gene, such as the DNA sequence, is made into a functional gene product, such as protein or RNA....
 through RNA-RNA base pairing.

A few DNA sequences in prokaryotes and eukaryotes, and more in plasmid
Plasmid

File:plasmid .svgA plasmid is an extra-chromosomal DNA molecule separate from the chromosome which is capable of replicating independently of the chromosomal DNA....
s and virus
Virus

A virus is a Optical microscope#Limitations of light microscopes infectious agent that is unable to grow or reproduce outside a host cell . Viruses infect all cellular life....
es, blur the distinction between sense and antisense strands by having overlapping genes. In these cases, some DNA sequences do double duty, encoding one protein when read along one strand, and a second protein when read in the opposite direction along the other strand. In bacteria
Bacteria

The Bacteria are a large group of unicellular microorganisms. Typically a few micrometres in length, bacteria have a wide range of shapes, ranging from spheres to rods and spirals....
, this overlap may be involved in the regulation of gene transcription, while in viruses, overlapping genes increase the amount of information that can be encoded within the small viral genome.

Supercoiling

DNA can be twisted like a rope in a process called DNA supercoil
DNA supercoil

In a "relaxed" double-helical segment of DNA, the two strands twist around the helical axis once every 10.4-10.5 base pairs of DNA sequence. Adding or subtracting twists, as some enzymes can do, imposes strain....
ing. With DNA in its "relaxed" state, a strand usually circles the axis of the double helix once every 10.4 base pairs, but if the DNA is twisted the strands become more tightly or more loosely wound. If the DNA is twisted in the direction of the helix, this is positive supercoiling, and the bases are held more tightly together. If they are twisted in the opposite direction, this is negative supercoiling, and the bases come apart more easily. In nature, most DNA has slight negative supercoiling that is introduced by enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
s called topoisomerase
Topoisomerase

Topoisomerases are isomerase enzymes that act on the DNA topology of DNA....
s. These enzymes are also needed to relieve the twisting stresses introduced into DNA strands during processes such as transcription
Transcription (genetics)

Transcription is the synthesis of RNA under the direction of DNA. RNA synthesis, or transcription, is the process of transcribing DNA nucleotide sequence information into RNA sequence information....
 and DNA replication
DNA replication

DNA replication, the basis for heredity, is a fundamental process occurring in all living organisms to copy their DNA. This process is "semiconservative replication" in that each strand of the original double-stranded DNA molecule serves as template for the reproduction of the complementary strand....
.

Alternative structures


DNA exists in many possible conformations
Conformational isomerism

In chemistry, conformational isomerism is a form of stereoisomerism in which molecules with the same structural formula exist as different conformational isomers or conformers in 3-D due to rotations about one or more sigma bond....
. However, only A-DNA
A-DNA

A-DNA is one of the many possible double helical structures of DNA. A-DNA is thought to be one of three biologically active double helical structures along with B-DNA and Z-DNA....
, B-DNA, and Z-DNA
Z-DNA

Z-DNA is one of the many possible double helical structures of DNA. It is a left-handed double helical structure in which the double helix winds to the left in a zig-zag pattern ....
 have been observed in organisms. Which conformation DNA adopts depends on the sequence of the DNA, the amount and direction of supercoiling, chemical modifications of the bases and also solution conditions, such as the concentration of metal
Metal

In chemistry, a metal is a chemical element whose atoms readily lose electrons to form positive ions , and form metallic bonds between other metal atoms and ionic bonds between nonmetal atoms....
 ion
Ion

An ion is an atom or molecule which has lost or gained one or more electrons, giving it a positive or negative electrical charge. According to the Bohr_model this will be from or in the outer shield 'n'....
s and polyamine
Polyamine

The polyamines are organic compounds having two or more primary amino groups - such as putrescine, cadaverine, spermidine, and spermine - that are essential molecules in both eukaryotic and prokaryotic cells....
s. Of these three conformations, the "B" form described above is most common under the conditions found in cells. The two alternative double-helical forms of DNA differ in their geometry and dimensions.

The A form is a wider right-handed spiral, with a shallow, wide minor groove and a narrower, deeper major groove. The A form occurs under non-physiological conditions in dehydrated samples of DNA, while in the cell it may be produced in hybrid pairings of DNA and RNA strands, as well as in enzyme-DNA complexes. Segments of DNA where the bases have been chemically-modified by methylation
Methylation

Methylation in the chemical sciences denotes the attachment or substitution of a methyl on various Substrate . This term is commonly used in chemistry, biochemistry, soil science and the biological sciences....
 may undergo a larger change in conformation and adopt the Z form
Z-DNA

Z-DNA is one of the many possible double helical structures of DNA. It is a left-handed double helical structure in which the double helix winds to the left in a zig-zag pattern ....
. Here, the strands turn about the helical axis in a left-handed spiral, the opposite of the more common B form. These unusual structures can be recognized by specific Z-DNA binding proteins and may be involved in the regulation of transcription.

Quadruplex structures


At the ends of the linear chromosomes are specialized regions of DNA called telomere
Telomere

A telomere is a region of repetitive DNA at the end of chromosomes, which protects the end of the chromosome from destruction. Its name is derived from the Greek nouns telos "end" and mer?s "part"....
s. The main function of these regions is to allow the cell to replicate chromosome ends using the enzyme telomerase
Telomerase

Telomerase is an enzyme that adds specific DNA sequence repeats to the 3' end of DNA strands in the telomere regions, which are found at the ends of eukaryote chromosomes....
, as the enzymes that normally replicate DNA cannot copy the extreme 3' ends of chromosomes. These specialized chromosome caps also help protect the DNA ends, and stop the DNA repair
DNA repair

DNA repair refers to a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolism activities and environmental factors such as UV light and Radiation can cause DNA damage, resulting in as many as 1 million individual molecular lesions per cell pe...
 systems in the cell from treating them as damage to be corrected. In human cells, telomeres are usually lengths of single-stranded DNA containing several thousand repeats of a simple TTAGGG sequence.

These guanine-rich sequences may stabilize chromosome ends by forming structures of stacked sets of four-base units, rather than the usual base pairs found in other DNA molecules. Here, four guanine bases form a flat plate and these flat four-base units then stack on top of each other, to form a stable G-quadruplex
G-quadruplex

Nucleic acid sequences which are rich in guanine are capable of forming four-stranded structures called G-quadruplexes . These consist of a square arrangement of guanines , stabilized by Hoogsteen base pair hydrogen bonding....
 structure. These structures are stabilized by hydrogen bonding between the edges of the bases and chelation
Chelation

Chelation is the binding or complex of a bi- or multidentate ligand. These ligands, which are often organic compounds, are called chelants, chelators, chelating agents, or sequestration....
 of a metal ion in the centre of each four-base unit. Other structures can also be formed, with the central set of four bases coming from either a single strand folded around the bases, or several different parallel strands, each contributing one base to the central structure.

In addition to these stacked structures, telomeres also form large loop structures called telomere loops, or T-loops. Here, the single-stranded DNA curls around in a long circle stabilized by telomere-binding proteins. At the very end of the T-loop, the single-stranded telomere DNA is held onto a region of double-stranded DNA by the telomere strand disrupting the double-helical DNA and base pairing to one of the two strands. This triple-stranded
Triple-stranded DNA

A triple-stranded DNA is a structure of DNA in which three oligonucleotides wind around each other and form a triple helix. In this structure, one strand binds to a DNA double helix through Hoogsteen base pair....
 structure is called a displacement loop or D-loop
D-loop

In molecular biology, a displacement loop or D-loop is a DNA structure where the two strands of a double-stranded DNA molecule are separated for a stretch and held apart by a third strand of DNA....
.

Branched DNA

In DNA fraying
DNA end

DNA end or sticky end refers to the properties of the end of a molecule of DNA. The concept is important in molecular biology, especially in Clone or when subcloning insert DNA into vector DNA....
 occurs when non-complimentary regions exist at the end of an otherwise complimentary double-strand of DNA. However, branched DNA can occur if a third strand of DNA is introduced and contains adjoining regions able to hybridize with the frayed regions of the pre-existing double-strand. Although the simplest example of branched DNA involves only three strands of DNA, complexes involving additional strands and multiple branches are also possible.

Chemical modifications


Base modifications

The expression of genes is influenced by how the DNA is packaged in chromosomes, in a structure called chromatin
Chromatin

Chromatin is the complex combination of DNA, RNA, and protein that makes up chromosomes. It is found inside the cell nucleus of Eukaryote cell , and within the nucleoid in prokaryotic cells....
. Base modifications can be involved in packaging, with regions that have low or no gene expression usually containing high levels of methylation
Methylation

Methylation in the chemical sciences denotes the attachment or substitution of a methyl on various Substrate . This term is commonly used in chemistry, biochemistry, soil science and the biological sciences....
 of cytosine
Cytosine

Cytosine is one of the five main bases found in DNA and RNA. It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ....
 bases. For example, cytosine methylation, produces 5-methylcytosine
5-Methylcytosine

5-Methylcytosine is a methylation form of cytosine in which a methyl group is attached to carbon 5, altering its structure without altering its base-pairing properties....
, which is important for X-chromosome inactivation
X-inactivation

X-inactivation is a process by which one of the two copies of the X chromosome present in female mammals is inactivated. The inactive X chromosome is silenced by packaging into transcriptionally inactive heterochromatin....
. The average level of methylation varies between organisms - the worm Caenorhabditis elegans
Caenorhabditis elegans

'Caenorhabditis elegans' is a free-living, transparent nematode , about 1 mm in length, which lives in temperate soil environments. Research into the molecular biology and developmental biology of C....
 lacks cytosine methylation, while vertebrate
Vertebrate

Vertebrates are members of the subphylum Vertebrata, chordates with Vertebras or Vertebral columns. The grouping sometimes includes the hagfish, which have no vertebrae, but are genetically quite closely related to lampreys, which do have vertebrae....
s have higher levels, with up to 1% of their DNA containing 5-methylcytosine. Despite the importance of 5-methylcytosine, it can deaminate
Deamination

Deamination is the removal of an amine group from a molecule.In the human body, deamination takes place in the liver. Deamination is the process by which amino acids are broken down when too much protein has been taken in....
 to leave a thymine base, methylated cytosines are therefore particularly prone to mutation
Mutation

In biology, mutations are changes to the nucleotide sequence of the genetic material of an organism. Mutations can be caused by copying errors in the genetic material during cell division, by exposure to ultraviolet or ionizing radiation, chemical mutagens, or virus , or can be induced by the organism, itself, by cellular processes such as s...
s. Other base modifications include adenine methylation in bacteria and the glycosylation
Glycosylation

Glycosylation is the enzymatic process that links saccharides to produce glycans, either free or attached to proteins and lipids. This enzymatic process produces one of four fundamental components of all cells and also provides a co-translational and post-translational modification mechanism that modulates the structure and function of membr...
 of uracil to produce the "J-base" in kinetoplastid
Kinetoplastid

The kinetoplastids are a group of flagellate protozoa, including a number of parasites responsible for serious diseases in humans and other animals, as well as various forms found in soil and aquatic environments....
s.

Damage


DNA can be damaged by many different sorts of mutagen
Mutagen

In biology, a mutagen is a physical or chemical agent that changes the genetic information of an organism and thus increases the frequency of mutations above the natural background level....
s, which change the DNA sequence. Mutagens include oxidizing agent
Oxidizing agent

An oxidizing agent can be defined as either:#a chemical compound that readily transfers oxygen atoms, or#a substance that gains electrons in a redox chemical reaction...
s, alkylating agents
Alkylation

Alkylation is the transfer of an alkyl group from one molecule to another. The alkyl group may be transferred as an alkyl carbocation, a free radical, a carbanion or a carbene ....
 and also high-energy electromagnetic radiation
Electromagnetic radiation

Electromagnetic radiation takes the form of wave propagation waves in a vacuum or in matter. EM radiation has an electric field and magnetic field component which oscillate in phase perpendicular to each other and to the direction of energy Wave propagation....
 such as ultraviolet
Ultraviolet

Ultraviolet light is electromagnetic radiation with a wavelength shorter than that of visible light, but longer than x-rays, in the range 400 nanometer to 10 nm, and energies from 3 Electron volt to 124 eV....
 light and X-ray
X-ray

X-radiation is a form of electromagnetic radiation. X-rays have a wavelength in the range of 10 to 0.01 nanometers, corresponding to frequency in the range 30 Hertz to 30 Hertz and energies in the range 120 Electron volt to 120 keV....
s. The type of DNA damage produced depends on the type of mutagen. For example, UV light can damage DNA by producing thymine dimer
Thymine dimer

A thymine dimer is the covalent bonding of two adjacent thymine residues within a DNA molecule, often catalyzed by ultraviolet radiation or chemical mutagenic agents....
s, which are cross-links between pyrimidine bases. On the other hand, oxidants such as free radicals
Radical (chemistry)

In chemistry, radicals are atoms, molecules or ions with unpaired electrons on an otherwise open shell configuration. These unpaired electrons are usually highly chemical reaction, so radicals are likely to take part in chemical reactions....
 or hydrogen peroxide
Hydrogen peroxide

Hydrogen peroxide is a very pale blue liquid which appears colorless in a dilute solution, slightly more viscous than water. It is a weak acid....
 produce multiple forms of damage, including base modifications, particularly of guanosine, and double-strand breaks. In each human cell, about 500 bases suffer oxidative damage per day. Of these oxidative lesions, the most dangerous are double-strand breaks, as these are difficult to repair and can produce point mutation
Point mutation

A point mutation, or single base substitution, is a type of mutation that causes the replacement of a single base nucleotide with another nucleotide of the genetic material, DNA or RNA....
s, insertions and deletions from the DNA sequence, as well as chromosomal translocation
Chromosomal translocation

In genetics, a chromosome translocation is a chromosome abnormality caused by rearrangement of parts between nonhomologous chromosomes. A fusion gene may be created when the translocation joins two otherwise separated genes, an event which is common in cancer....
s.

Many mutagens fit into the space between two adjacent base pairs, this is called intercalating
Intercalation (chemistry)

In chemistry, intercalation is the reversible inclusion of a molecule between two other molecules . Examples include DNA intercalation and in graphite intercalation compounds....
. Most intercalators are aromatic
Aromaticity

Aromaticity is a chemical property in which a conjugated system ring of unsaturated bonds, lone pairs, or empty orbitals exhibit a stabilization stronger than would be expected by the stabilization of conjugation alone....
 and planar molecules, and include Ethidium bromide
Ethidium bromide

Ethidium bromide is an intercalation agent commonly used as a fluorescent tag in molecular biology laboratories for techniques such as agarose gel electrophoresis....
, daunomycin
Daunorubicin

Daunorubicin or daunomycin is chemotherapy of the anthracycline family that is given as a treatment for some types of cancer. It is most commonly used to treat specific types of leukaemia ....
, and doxorubicin
Doxorubicin

Doxorubicin is a pharmaceutical used in cancer chemotherapy. It is an anthracycline antibiotic, closely related to the natural product daunorubicin, and like all anthracyclines it intercalation DNA....
. In order for an intercalator to fit between base pairs, the bases must separate, distorting the DNA strands by unwinding of the double helix. This inhibits both transcription and DNA replication, causing toxicity and mutations. As a result, DNA intercalators are often carcinogen
Carcinogen

The term carcinogen refers to any substance, radionuclide or radiation that is an agent directly involved in the promotion of cancer or in the increase of its propagation....
s, and benzopyrene diol epoxide
Benzopyrene

Benzo[a]pyrene, C20H12, is a five-ring polycyclic aromatic hydrocarbon that is mutagenic and highly carcinogenic. It is a crystalline yellow solid....
, acridine
Acridine

Acridine, C13H9N, is an organic compound and a heterocyclic compound. Acridine is also used to describe compounds containing the C13N tricycle....
s, aflatoxin
Aflatoxin

Aflatoxins are naturally occurring mycotoxins that are produced by many species of Aspergillus, a fungus, most notably Aspergillus flavus and Aspergillus parasiticus....
 and ethidium bromide
Ethidium bromide

Ethidium bromide is an intercalation agent commonly used as a fluorescent tag in molecular biology laboratories for techniques such as agarose gel electrophoresis....
 are well-known examples. Nevertheless, due to their ability to inhibit DNA transcription and replication, these toxins are also used in chemotherapy
Chemotherapy

Chemotherapy, in its most general sense, refers to treatment of disease by chemicals that kill cells, specifically those of micro-organisms or cancer....
 to inhibit rapidly-growing cancer
Cancer

Cancer is a class of diseases in which a group of cell display uncontrolled growth , invasion , and sometimes metastasis . These three malignant properties of cancers differentiate them from benign tumors, which are self-limited, do not invade or metastasize....
 cells.

Biological functions

DNA usually occurs as linear chromosome
Chromosome

A chromosome is an organized structure of DNA and protein that is found in Cell . A chromosome is a single piece of DNA that contains many genes, regulatory sequence and other genetic sequence....
s in eukaryotes, and circular chromosomes in prokaryotes. The set of chromosomes in a cell makes up its genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
; the human genome
Human genome

The human genome is the genome of Homo sapiens, which is stored on 23 chromosome pairs. Twenty-two of these are autosome, while the remaining pair is XY sex-determination system....
 has approximately 3 billion base pairs of DNA arranged into 46 chromosomes. The information carried by DNA is held in the sequence
DNA sequence

A DNA sequence or genetic sequence is a succession of letters representing the primary structure of a real or hypothetical DNA molecule or strand, with the capacity to carry information as described by the central dogma of molecular biology....
 of pieces of DNA called gene
Gene

A gene is the basic unit of heredity in a living organism. All living things depend on genes. Genes hold the information to build and maintain their cell and pass genetic trait to offspring....
s. Transmission
Transmission (genetics)

Genetic transmission is the transfer of genetic information from genes to another generation, or from one location in a cell to another....
 of genetic information in genes is achieved via complementary base pairing. For example, in transcription, when a cell uses the information in a gene, the DNA sequence is copied into a complementary RNA sequence through the attraction between the DNA and the correct RNA nucleotides. Usually, this RNA copy is then used to make a matching protein sequence in a process called translation which depends on the same interaction between RNA nucleotides. Alternatively, a cell may simply copy its genetic information in a process called DNA replication. The details of these functions are covered in other articles; here we focus on the interactions between DNA and other molecules that mediate the function of the genome.

Genes and genomes

Genomic DNA is located in the cell nucleus
Cell nucleus

In cell biology, the nucleus , also sometimes referred to as the "control center", is a membrane-enclosed organelle found in all eukaryote cell ....
 of eukaryotes, as well as small amounts in mitochondria
Mitochondrion

In cell biology, a mitochondrion is a membrane-enclosed organelle found in most eukaryote cell . These organelles range from 0.5–10 micrometers in diameter....
 and chloroplast
Chloroplast

Chloroplasts are organelles found in plant cells and other eukaryote organisms that conduct photosynthesis. Chloroplasts capture light energy to conserve Thermodynamic free energy in the form of Adenosine triphosphate and reduce NADP to NADPH through a complex set of processes called photosynthesis....
s. In prokaryotes, the DNA is held within an irregularly shaped body in the cytoplasm called the nucleoid
Nucleoid

In prokaryotes, the nucleoid is an irregularly-shaped region within the cell of prokaryotes where the genetic material is localized. The genome of prokaryotic organisms generally is a circular, double-stranded piece of DNA, of which multiple copies may exist at any time....
. The genetic information in a genome is held within genes, and the complete set of this information in an organism is called its genotype
Genotype

The genotype is the trait we can't see. The genotype is the Genetics constitution of a cell, an organism, or an individual usually with reference to a specific character under consideration....
. A gene is a unit of heredity
Heredity

Heredity is the passing of traits to offspring . This is the process by which an offspring cell or organism acquires or becomes predisposed to the characteristics of its parent cell or organism....
 and is a region of DNA that influences a particular characteristic in an organism. Genes contain an open reading frame
Open reading frame

An open reading frame is a portion of an organism's genome which contains a sequence of base pair that could potentially translation a protein....
 that can be transcribed, as well as regulatory sequence
Regulatory sequence

A regulatory sequence is a segment of DNA where DNA binding protein such as transcription factors bind preferentially. These regulatory proteins bind to short stretches of DNA called regulatory regions, which are appropriately positioned in the genome, usually a short distance 'upstream' of the gene being regulated....
s such as promoter
Promoter

In biology, a promoter is a region of DNA that facilitates the Transcription of a particular gene. Promoters are typically located near the genes they regulate, on the same strand and Upstream and downstream ....
s and enhancers
Enhancer (genetics)

In genetics, an enhancer is a short region of DNA that can be bound with proteins to enhance transcription levels of genes in a gene cluster....
, which control the transcription of the open reading frame.

In many species
Species

In biology, a species is one of the basic units of biological classification and a taxonomic rank. A species is often defined as a group of organisms capable of interbreeding and producing fertile offspring....
, only a small fraction of the total sequence of the genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
 encodes protein. For example, only about 1.5% of the human genome consists of protein-coding exon
Exon

An exon in a gene is a DNA or RNA sequence that is translated into RNA or protein. In contrast, an intron is a DNA sequence in the gene that is not translated....
s, with over 50% of human DNA consisting of non-coding repetitive sequences
Repeated sequence (DNA)

In the study of DNA sequences, one can distinguish two main types of repeated sequence:*Tandem repeats:**Satellite DNA,**Minisatellite,**Microsatellite;...
. The reasons for the presence of so much non-coding DNA
Noncoding DNA

In genetics, non-coding DNA describes DNA which does not contain genetic code for making proteins . In eukaryotes, a large percentage of many organisms' total genome sizes is comprised of noncoding DNA ....
 in eukaryotic genomes and the extraordinary differences in genome size
Genome size

Genome size refers to the total amount of DNA contained within one copy of a genome. It is typically measured in terms of mass in picograms or less frequently in Dalton or as the total number of nucleotide base pairs typically in megabases ....
, or C-value
C-value

The term C-value refers to the amount of DNA contained within a haploid Cell nucleus of a eukaryotic organism. In some cases , the terms C-value and genome size are used interchangeably, however in Polyploidy the C-value may represent two genomes contained within the same nucleus....
, among species represent a long-standing puzzle known as the "C-value enigma
C-value enigma

The C-value enigma or C-value paradox is a term used to describe the complex puzzle surrounding the extensive variation in nuclear genome size among eukaryotic species....
." However, DNA sequences that do not code protein may still encode functional non-coding RNA
Non-coding RNA

A non-coding RNA is a functional RNA molecule that is not Translation into a protein. Less-frequently used synonyms are non-protein-coding RNA , non-messenger RNA , small non-messenger RNA , functional RNA ....
 molecules, which are involved in the regulation of gene expression. Some non-coding DNA sequences play structural roles in chromosomes. Telomere
Telomere

A telomere is a region of repetitive DNA at the end of chromosomes, which protects the end of the chromosome from destruction. Its name is derived from the Greek nouns telos "end" and mer?s "part"....
s and centromere
Centromere

A centromere is a region of DNA typically found near the middle of a chromosome where two sister chromatids come in contact. It is involved in cell division as the point of mitotic spindle....
s typically contain few genes, but are important for the function and stability of chromosomes. An abundant form of non-coding DNA in humans are pseudogene
Pseudogene

Pseudogenes are defunct relatives of known genes that have lost their protein-coding ability or are otherwise no longer gene expression in the cell....
s, which are copies of genes that have been disabled by mutation. These sequences are usually just molecular fossil
Fossil

Fossils are the preserved remains or trace fossil of animals, plants, and other organisms from the remote past. The totality of fossils, both discovered and undiscovered, and their placement in fossiliferous Rock formations and sedimentary rock layers is known as the fossil record....
s, although they can occasionally serve as raw genetic material for the creation of new genes through the process of gene duplication
Gene duplication

Gene duplication is any duplication of a region of DNA that contains a gene; it may occur as an error in homologous recombination, a retrotransposon event, or duplication of an entire chromosome....
 and divergence
Divergent evolution

Divergent evolution is the accumulation of differences between groups which can lead to the formation of new species, usually a result of diffusion of the same species adapting to different environments, leading to natural selection defining the success of specific mutations....
.

Transcription and translation

A gene is a sequence of DNA that contains genetic information and can influence the phenotype
Phenotype

A phenotype is any observable characteristic or trait_ of an organism: such as its morphology , development, biochemical or physiological properties, or behavior....
 of an organism. Within a gene, the sequence of bases along a DNA strand defines a messenger RNA
Messenger RNA

Messenger ribonucleic acid is a molecule of RNA encoding a chemical "blueprint" for a protein product. mRNA is transcription from a DNA template, and carries coding information to the sites of protein synthesis: the ribosomes....
 sequence, which then defines one or more protein sequences. The relationship between the nucleotide sequences of genes and the amino-acid
Amino acid

In chemistry, an amino acid is a molecule containing both amine and carboxyl functional groups. These molecules are particularly important in biochemistry, where this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent....
 sequences of proteins is determined by the rules of translation, known collectively as the genetic code
Genetic code

The genetic code is the set of rules by which information encoded in genetic material is Translation into proteins by living cell s. The code defines a mapping between tri-nucleotide sequences, called codons, and amino acids....
. The genetic code consists of three-letter 'words' called codons formed from a sequence of three nucleotides (e.g. ACT, CAG, TTT).

In transcription, the codons of a gene are copied into messenger RNA by RNA polymerase
RNA polymerase

RNA polymerase is an enzyme that produces RNA. In cell s, RNAP is needed for constructing RNA chains from DNA genes as templates, a process called Transcription ....
. This RNA copy is then decoded by a ribosome
Ribosome

Ribosomes are complexes of RNA and protein that are found in all cell s. Ribosomes from bacteria, archaea and eukaryotes, the three domains of life on Earth, have significantly different structure and RNA....
 that reads the RNA sequence by base-pairing the messenger RNA to transfer RNA
Transfer RNA

Transfer RNA is a small RNA that transfers a specific active amino acid to a growing polypeptide chain at the ribosomal site of protein synthesis during translation ....
, which carries amino acids. Since there are 4 bases in 3-letter combinations, there are 64 possible codons ( combinations). These encode the twenty standard amino acids, giving most amino acids more than one possible codon. There are also three 'stop' or 'nonsense' codons signifying the end of the coding region; these are the TAA, TGA and TAG codons.

Replication


Cell division
Cell division

Cell division is a process by which a cell , called the parent cell, divides into two or more cells, called daughter cells. Cell division is usually a small segment of a larger cell cycle....
 is essential for an organism to grow, but when a cell divides it must replicate the DNA in its genome so that the two daughter cells have the same genetic information as their parent. The double-stranded structure of DNA provides a simple mechanism for DNA replication
DNA replication

DNA replication, the basis for heredity, is a fundamental process occurring in all living organisms to copy their DNA. This process is "semiconservative replication" in that each strand of the original double-stranded DNA molecule serves as template for the reproduction of the complementary strand....
. Here, the two strands are separated and then each strand's complementary DNA sequence is recreated by an enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
 called DNA polymerase
DNA polymerase

A DNA polymerase is an enzyme that catalyze the polymerization of deoxyribonucleotides into a DNA strand. DNA polymerases are best-known for their role in DNA replication, in which the polymerase "reads" an intact DNA strand as a template and uses it to synthesize the new strand....
. This enzyme makes the complementary strand by finding the correct base through complementary base pairing, and bonding it onto the original strand. As DNA polymerases can only extend a DNA strand in a 5' to 3' direction, different mechanisms are used to copy the antiparallel strands of the double helix. In this way, the base on the old strand dictates which base appears on the new strand, and the cell ends up with a perfect copy of its DNA.

Interactions with proteins

All the functions of DNA depend on interactions with proteins. These protein interactions can be non-specific, or the protein can bind specifically to a single DNA sequence. Enzymes can also bind to DNA and of these, the polymerases that copy the DNA base sequence in transcription and DNA replication are particularly important.

DNA-binding proteins


Structural proteins that bind DNA are well-understood examples of non-specific DNA-protein interactions. Within chromosomes, DNA is held in complexes with structural proteins. These proteins organize the DNA into a compact structure called chromatin
Chromatin

Chromatin is the complex combination of DNA, RNA, and protein that makes up chromosomes. It is found inside the cell nucleus of Eukaryote cell , and within the nucleoid in prokaryotic cells....
. In eukaryotes this structure involves DNA binding to a complex of small basic proteins called histone
Histone

In biology, histones are the chief protein components of chromatin. They act as spools around which DNA winds, and they play a role in gene regulation....
s, while in prokaryotes multiple types of proteins are involved. The histones form a disk-shaped complex called a nucleosome
Nucleosome

Nucleosomes form the fundamental repeating units of eukaryote chromatin, which is used to pack the large eukaryotic genomes into the nucleus while still ensuring appropriate access to it ....
, which contains two complete turns of double-stranded DNA wrapped around its surface. These non-specific interactions are formed through basic residues in the histones making ionic bond
Ionic bond

An ionic bond is a type of chemical bond that involves a metal and a non-metal ions through electrostatic attraction. In short, it is a bond formed by the attraction between two oppositely charged ions....
s to the acidic sugar-phosphate backbone of the DNA, and are therefore largely independent of the base sequence. Chemical modifications of these basic amino acid residues include methylation
Methylation

Methylation in the chemical sciences denotes the attachment or substitution of a methyl on various Substrate . This term is commonly used in chemistry, biochemistry, soil science and the biological sciences....
, phosphorylation
Phosphorylation

Phosphorylation is the addition of a phosphate group to a protein or other organic molecule. Protein phosphorylation in particular plays a significant role in a wide range of cellular processes....
 and acetylation
Acetylation

Acetylation describes a reaction that introduces an acetyl functional group into an organic compound. Deacetylation is the removal of the acetyl group....
. These chemical changes alter the strength of the interaction between the DNA and the histones, making the DNA more or less accessible to transcription factor
Transcription factor

In the field of molecular biology, a transcription factor is a protein that binds to specific DNA sequence and thereby controls the transfer of genetic information from DNA to RNA....
s and changing the rate of transcription. Other non-specific DNA-binding proteins in chromatin include the high-mobility group proteins, which bind to bent or distorted DNA. These proteins are important in bending arrays of nucleosomes and arranging them into the larger structures that make up chromosomes.

A distinct group of DNA-binding proteins are the DNA-binding proteins that specifically bind single-stranded DNA. In humans, replication protein A is the best-understood member of this family and is used in processes where the double helix is separated, including DNA replication, recombination and DNA repair. These binding proteins seem to stabilize single-stranded DNA and protect it from forming stem-loop
Stem-loop

Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded DNA or, more commonly, in RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same molecule, usually palindrome in nucleotide sequence, base-pair to form a double helix that ends in an unpaired loop....
s or being degraded by nuclease
Nuclease

A nuclease is an enzyme capable of cleaving the phosphodiester bonds between the nucleotide subunits of nucleic acids. Older papers may use terms such as "polynucleotidase" or "nucleodepolymerase"....
s.

In contrast, other proteins have evolved to bind to particular DNA sequences. The most intensively-studied of these are the various transcription factor
Transcription factor

In the field of molecular biology, a transcription factor is a protein that binds to specific DNA sequence and thereby controls the transfer of genetic information from DNA to RNA....
s, which are proteins that regulate transcription. Each transcription factor binds to one particular set of DNA sequences and activates or inhibits the transcription of genes that have these sequences close to their promoters. The transcription factors do this in two ways. Firstly, they can bind the RNA polymerase responsible for transcription, either directly or through other mediator proteins; this locates the polymerase at the promoter and allows it to begin transcription. Alternatively, transcription factors can bind enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
s that modify the histones at the promoter; this will change the accessibility of the DNA template to the polymerase.

As these DNA targets can occur throughout an organism's genome, changes in the activity of one type of transcription factor can affect thousands of genes. Consequently, these proteins are often the targets of the signal transduction
Signal transduction

In biology, 'signal transduction' refers to any process by which a cell converts one kind of signal or stimulus into another. Most processes of signal transduction involve ordered sequences of biochemistry chemical reaction inside the cell, which are carried out by enzymes, activated by Second messenger systems, resulting in a signal tran...
 processes that control responses to environmental changes or cellular differentiation and development. The specificity of these transcription factors' interactions with DNA come from the proteins making multiple contacts to the edges of the DNA bases, allowing them to "read" the DNA sequence. Most of these base-interactions are made in the major groove, where the bases are most accessible.

DNA-modifying enzymes


Nucleases and ligases
Nuclease
Nuclease

A nuclease is an enzyme capable of cleaving the phosphodiester bonds between the nucleotide subunits of nucleic acids. Older papers may use terms such as "polynucleotidase" or "nucleodepolymerase"....
s are enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
s that cut DNA strands by catalyzing the hydrolysis
Hydrolysis

Hydrolysis is a chemical reaction during which one or more water are split into hydrogen and hydroxide ions which may go on to participate in further reactions....
 of the phosphodiester bond
Phosphodiester bond

A phosphodiester bond is a group of strong covalent bond between the Phosphorus in a phosphate group and two other molecules over two ester bonds....
s. Nucleases that hydrolyse nucleotides from the ends of DNA strands are called exonuclease
Exonuclease

Exonucleases are enzymes that cleave nucleotides one at a time from an end of a polynucleotide chain. These enzymes hydrolyze phosphodiester bonds from either the 3' or 5' terminus of a polynucleotide molecule....
s, while endonuclease
Endonuclease

Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain, in contrast to exonucleases, which cleave phosphodiester bonds at the end of a polynucleotide chain....
s cut within strands. The most frequently-used nucleases in molecular biology
Molecular biology

Molecular biology is the study of biology at a molecule level. The field overlaps with other areas of biology and chemistry, particularly genetics and biochemistry....
 are the restriction endonucleases
Restriction enzyme

A restriction enzyme is an enzyme that cuts double-stranded or single stranded DNA at specific recognition nucleotide sequences known as restriction sites....
, which cut DNA at specific sequences. For instance, the EcoRV enzyme shown to the left recognizes the 6-base sequence 5'-GAT|ATC-3' and makes a cut at the vertical line. In nature, these enzymes protect bacteria
Bacteria

The Bacteria are a large group of unicellular microorganisms. Typically a few micrometres in length, bacteria have a wide range of shapes, ranging from spheres to rods and spirals....
 against phage
Bacteriophage

A bacteriophage is any one of a number of viruses that infection bacteria. The term is commonly used in its shortened form, phage.Typically, bacteriophages consist of an outer protein hull enclosing genetic material....
 infection by digesting the phage DNA when it enters the bacterial cell, acting as part of the restriction modification system
Restriction modification system

The restriction modification system is used by bacteria, and perhaps other prokaryote organisms to protect themselves from foreign DNA, such as bacteriophages....
. In technology, these sequence-specific nucleases are used in molecular cloning
Molecular cloning

Molecular cloning refers to the procedure of isolating a defined DNA sequence and obtaining multiple copies of it in vivo. Cloning is frequently employed to amplify DNA fragments containing genes, but it can be used to amplify any DNA sequence such as promoters, non-coding sequences, chemically synthesised oligonucleotides and randomly fr...
 and DNA fingerprinting
Genetic fingerprinting

DNA profiling is a technique employed by forensic scientists to assist in the identification of individuals on the basis of their respective DNA profiles....
.

Enzymes called DNA ligase
DNA ligase

In molecular biology, DNA ligase is a special type of ligase that can link together two DNA strands that have double-strand break . The alternative, a single-strand break, is fixed by a different type of DNA ligase using the Complementary DNA as a template but still requires DNA ligase to create the final phosphodiester bond to fully repair...
s can rejoin cut or broken DNA strands. Ligases are particularly important in lagging strand
Replication fork

The replication fork is a structure that forms within the nucleus during DNA replication. It is created by helicases, which break the hydrogen bonds holding the two DNA strands together....
 DNA replication, as they join together the short segments of DNA produced at the replication fork
Replication fork

The replication fork is a structure that forms within the nucleus during DNA replication. It is created by helicases, which break the hydrogen bonds holding the two DNA strands together....
 into a complete copy of the DNA template. They are also used in DNA repair
DNA repair

DNA repair refers to a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolism activities and environmental factors such as UV light and Radiation can cause DNA damage, resulting in as many as 1 million individual molecular lesions per cell pe...
 and genetic recombination
Genetic recombination

Genetic recombination is the process by which a strand of genetic material is broken and then joined to a different DNA molecule. In eukaryotes recombination commonly occurs during meiosis as chromosomal crossover between paired chromosomes....
.

Topoisomerases and helicases
Topoisomerase
Topoisomerase

Topoisomerases are isomerase enzymes that act on the DNA topology of DNA....
s are enzymes with both nuclease and ligase activity. These proteins change the amount of supercoiling
DNA supercoil

In a "relaxed" double-helical segment of DNA, the two strands twist around the helical axis once every 10.4-10.5 base pairs of DNA sequence. Adding or subtracting twists, as some enzymes can do, imposes strain....
 in DNA. Some of these enzyme work by cutting the DNA helix and allowing one section to rotate, thereby reducing its level of supercoiling; the enzyme then seals the DNA break. Other types of these enzymes are capable of cutting one DNA helix and then passing a second strand of DNA through this break, before rejoining the helix. Topoisomerases are required for many processes involving DNA, such as DNA replication and transcription.

Helicase
Helicase

Helicases are a class of proteins vital to all living organisms. They are motor proteins that move directionally along a nucleic acid Phosphodiester bond, separating two Annealing nucleic acid strands using energy derived from nucleotide hydrolysis....
s are proteins that are a type of molecular motor. They use the chemical energy in nucleoside triphosphate
Nucleoside triphosphate

Nucleoside triphosphate is a nucleoside with three phosphates. Natural nucleoside triphosphates include adenosine triphosphate , guanosine triphosphate , cytidine triphosphate , thymidine triphosphate and uridine triphosphate ....
s, predominantly ATP
Adenosine triphosphate

This article is about the chemical used by cells as an energy carrier. For other uses, see ATP .Adenosine-5'-triphosphate is a multifunctional nucleotide, and plays an important role in cell biology as a coenzyme that is the "molecule unit of currency" of intracellular energy transfer....
, to break hydrogen bonds between bases and unwind the DNA double helix into single strands. These enzymes are essential for most processes where enzymes need to access the DNA bases.

Polymerases
Polymerase
Polymerase

A polymerase is an enzyme whose central function is associated with polymers of nucleic acids such as RNA and DNA.The primary function of a polymerase is the polymerization of new DNA or RNA against an existing DNA or RNA template in the processes of DNA replication and Transcription ....
s are enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
s that synthesize polynucleotide chains from nucleoside triphosphate
Nucleoside triphosphate

Nucleoside triphosphate is a nucleoside with three phosphates. Natural nucleoside triphosphates include adenosine triphosphate , guanosine triphosphate , cytidine triphosphate , thymidine triphosphate and uridine triphosphate ....
s. The sequence of their products are copies of existing polynucleotide chains - which are called templates. These enzymes function by adding nucleotides onto the 3' hydroxyl group
Hydroxyl

Hydroxyl in chemistry stands for a molecule consisting of an oxygen atom and a hydrogen atom connected by a covalent bond. The neutral form is a hydroxyl Radical and the hydroxyl anion is called a hydroxide....
 of the previous nucleotide in a DNA strand. Consequently, all polymerases work in a 5' to 3' direction. In the active site
Active site

The active site of an enzyme contains the catalysis and binding sites. The structure and chemical properties of the active site allow the recognition and binding of the substrate ....
 of these enzymes, the incoming nucleoside triphosphate base-pairs to the template: this allows polymerases to accurately synthesize the complementary strand of their template. Polymerases are classified according to the type of template that they use.

In DNA replication, a DNA-dependent DNA polymerase
DNA polymerase

A DNA polymerase is an enzyme that catalyze the polymerization of deoxyribonucleotides into a DNA strand. DNA polymerases are best-known for their role in DNA replication, in which the polymerase "reads" an intact DNA strand as a template and uses it to synthesize the new strand....
 makes a copy of a DNA sequence. Accuracy is vital in this process, so many of these polymerases have a proofreading
Proofreading (biology)

The term proofreading is used in genetics to refer to the error-correcting processes involved in DNA replication. In bacteria, all three DNA polymerases have the ability to proofread, using 3'->5' exonuclease activity....
 activity. Here, the polymerase recognizes the occasional mistakes in the synthesis reaction by the lack of base pairing between the mismatched nucleotides. If a mismatch is detected, a 3' to 5' exonuclease
Exonuclease

Exonucleases are enzymes that cleave nucleotides one at a time from an end of a polynucleotide chain. These enzymes hydrolyze phosphodiester bonds from either the 3' or 5' terminus of a polynucleotide molecule....
 activity is activated and the incorrect base removed. In most organisms DNA polymerases function in a large complex called the replisome
Replisome

The replisome is a complex molecular machine that carries out replication of DNA. It is made up of a number of subcomponents that each provide a specific function during the process of replication....
 that contains multiple accessory subunits, such as the DNA clamp
DNA clamp

A DNA clamp, also known as a sliding clamp, is a protein tertiary structure that serves as a processivity-promoting factor in DNA replication....
 or helicase
Helicase

Helicases are a class of proteins vital to all living organisms. They are motor proteins that move directionally along a nucleic acid Phosphodiester bond, separating two Annealing nucleic acid strands using energy derived from nucleotide hydrolysis....
s.

RNA-dependent DNA polymerases are a specialized class of polymerases that copy the sequence of an RNA strand into DNA. They include reverse transcriptase
Reverse transcriptase

In biochemistry, a reverse transcriptase, also known as RNA-dependent DNA polymerase, is a DNA polymerase enzyme that transcription single-stranded RNA into double-stranded DNA....
, which is a viral
Virus

A virus is a Optical microscope#Limitations of light microscopes infectious agent that is unable to grow or reproduce outside a host cell . Viruses infect all cellular life....
 enzyme involved in the infection of cells by retrovirus
Retrovirus

A retrovirus is a virus with an RNA genome that replicates by using a viral reverse transcriptase enzyme to transcription its RNA into DNA in the host cell....
es, and telomerase
Telomerase

Telomerase is an enzyme that adds specific DNA sequence repeats to the 3' end of DNA strands in the telomere regions, which are found at the ends of eukaryote chromosomes....
, which is required for the replication of telomeres. Telomerase is an unusual polymerase because it contains its own RNA template as part of its structure.

Transcription is carried out by a DNA-dependent RNA polymerase
RNA polymerase

RNA polymerase is an enzyme that produces RNA. In cell s, RNAP is needed for constructing RNA chains from DNA genes as templates, a process called Transcription ....
 that copies the sequence of a DNA strand into RNA. To begin transcribing a gene, the RNA polymerase binds to a sequence of DNA called a promoter and separates the DNA strands. It then copies the gene sequence into a messenger RNA
Messenger RNA

Messenger ribonucleic acid is a molecule of RNA encoding a chemical "blueprint" for a protein product. mRNA is transcription from a DNA template, and carries coding information to the sites of protein synthesis: the ribosomes....
 transcript until it reaches a region of DNA called the terminator
Terminator (genetics)

In genetics, a terminator, or transcription terminator is a section of genetic sequence that marks the end of gene or operon on genomic DNA for Transcription ....
, where it halts and detaches from the DNA. As with human DNA-dependent DNA polymerases, RNA polymerase II, the enzyme that transcribes most of the genes in the human genome, operates as part of a large protein complex with multiple regulatory and accessory subunits.

Genetic recombination


Chromosomal Recombination
A DNA helix usually does not interact with other segments of DNA, and in human cells the different chromosomes even occupy separate areas in the nucleus called "chromosome territories". This physical separation of different chromosomes is important for the ability of DNA to function as a stable repository for information, as one of the few times chromosomes interact is during chromosomal crossover
Chromosomal crossover

Chromosomal crossover is the process by which two chromosomes pair up and exchange sections of their DNA. This often occurs during prophase 1 of meiosis in a process called synapsis....
 when they recombine
Genetic recombination

Genetic recombination is the process by which a strand of genetic material is broken and then joined to a different DNA molecule. In eukaryotes recombination commonly occurs during meiosis as chromosomal crossover between paired chromosomes....
. Chromosomal crossover is when two DNA helices break, swap a section and then rejoin.

Recombination allows chromosomes to exchange genetic information and produces new combinations of genes, which increases the efficiency of natural selection
Natural selection

Natural selection is the process by which favorable heritable trait become more common in successive generations of a population of Reproduction organisms, and unfavorable heritable traits become less common, due to differential reproduction of genotypes....
 and can be important in the rapid evolution of new proteins. Genetic recombination can also be involved in DNA repair, particularly in the cell's response to double-strand breaks.

The most common form of chromosomal crossover is homologous recombination
Homologous recombination

Homologous recombination, also known as general recombination, is a type of genetic recombination that involves a genetic exchange between two similar or identical strands of DNA....
, where the two chromosomes involved share very similar sequences. Non-homologous recombination can be damaging to cells, as it can produce chromosomal translocation
Chromosomal translocation

In genetics, a chromosome translocation is a chromosome abnormality caused by rearrangement of parts between nonhomologous chromosomes. A fusion gene may be created when the translocation joins two otherwise separated genes, an event which is common in cancer....
s and genetic abnormalities. The recombination reaction is catalyzed by enzymes known as recombinases, such as RAD51
RAD51

RAD51 is a human gene. The protein encoded by this gene is a member of the RAD51 protein family which assist in repair of DNA_repair#Double-strand_breaks....
. The first step in recombination is a double-stranded break either caused by an endonuclease
Endonuclease

Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain, in contrast to exonucleases, which cleave phosphodiester bonds at the end of a polynucleotide chain....
 or damage to the DNA. A series of steps catalyzed in part by the recombinase then leads to joining of the two helices by at least one Holliday junction
Holliday junction

A Holliday junction is a mobile junction between four strands of DNA. The structure is named after Robin Holliday, who proposed it in 1964 to account for a particular type of exchange of genetic information he observed in yeast known as homologous recombination....
, in which a segment of a single strand in each helix is annealed to the complementary strand in the other helix. The Holliday junction is a tetrahedral junction structure that can be moved along the pair of chromosomes, swapping one strand for another. The recombination reaction is then halted by cleavage of the junction and re-ligation of the released DNA.

Evolution

DNA contains the genetic information that allows all modern living things to function, grow and reproduce. However, it is unclear how long in the 4-billion-year history of life
Timeline of evolution

This timeline of the evolution of life outlines the major events in the development of life on the planet Earth . For a thorough explanatory context, see the history of Earth, and geologic time scale....
 DNA has performed this function, as it has been proposed that the earliest forms of life may have used RNA as their genetic material. RNA may have acted as the central part of early cell metabolism as it can both transmit genetic information and carry out catalysis
Catalysis

Catalysis is the process in which the reaction rate of a chemical reaction is either increased or decreased by means of a chemical substance known as a catalyst....
 as part of ribozyme
Ribozyme

A ribozyme is an RNA molecule that catalyzes a chemical reaction. Many natural ribozymes catalyze either the hydrolysis of one of their own phosphodiester bonds, or the hydrolysis of bonds in other RNAs, but they have also been found to catalyze the aminotransferase activity of the ribosome....
s. This ancient RNA world
RNA world hypothesis

The RNA world hypothesis proposes that a world filled with life based on ribonucleic acid predated current life based on deoxyribonucleic acid ....
 where nucleic acid would have been used for both catalysis and genetics may have influenced the evolution
Evolution

In biology, evolution is change in the heritability trait of a population of organisms from one generation to the next. These changes are caused by a combination of three main processes: variation, reproduction, and selection....
 of the current genetic code based on four nucleotide bases. This would occur since the number of unique bases in such an organism is a trade-off between a small number of bases increasing replication accuracy and a large number of bases increasing the catalytic efficiency of ribozymes.

Unfortunately, there is no direct evidence of ancient genetic systems, as recovery of DNA from most fossils is impossible. This is because DNA will survive in the environment for less than one million years and slowly degrades into short fragments in solution. Claims for older DNA have been made, most notably a report of the isolation of a viable bacterium from a salt crystal 250-million years old, but these claims are controversial.

Uses in technology


Genetic engineering


Methods have been developed to purify DNA from organisms, such as phenol-chloroform extraction
Phenol-chloroform extraction

Phenol-chloroform extraction is a liquid-liquid extraction technique in biochemistry. It is widely used in molecular biology for isolating DNA, RNA and protein....
 and manipulate it in the laboratory, such as restriction digest
Restriction digest

A restriction digest is a procedure used in molecular biology to prepare Deoxyribonucleic acid for analysis or other processing. It is also known as DNA fragmentation ....
s and the polymerase chain reaction
Polymerase chain reaction

The polymerase chain reaction is a technique widely used in molecular biology. It derives its name from one of its key components, a DNA polymerase used to amplify a piece of DNA by in vitro enzyme DNA replication....
. Modern biology
Biology

Biology is a branch of the natural sciences concerned with the study of living organisms and their interaction with each other and their environment ....
 and biochemistry
Biochemistry

Biochemistry is the study of the chemistry processes in living organisms. It deals with the structure and function of cellular components such as proteins, carbohydrates, lipids, nucleic acids and other biomolecules....
 make intensive use of these techniques in recombinant DNA technology. Recombinant DNA
Recombinant DNA

Recombinant DNA is a form of synthetic DNA thereby combining DNA sequences that would not normally occur together. In terms of genetic modification, recombinant DNA is produced through the addition of relevant DNA into an existing organismal genome, such as the plasmid of bacteria, to code for or alter different traits for a specific purpos...
 is a man-made DNA sequence that has been assembled from other DNA sequences. They can be transformed
Transformation (genetics)

In molecular biology, transformation is the Introduction to genetics alteration of a cell resulting from the uptake, genomic incorporation, and expression of foreign genetic material ....
 into organisms in the form of plasmid
Plasmid

File:plasmid .svgA plasmid is an extra-chromosomal DNA molecule separate from the chromosome which is capable of replicating independently of the chromosomal DNA....
s or in the appropriate format, by using a viral vector
Viral vector

Viral vectors are a tool commonly used by molecular biologists to deliver genetic material into cell s. This process can be performed inside a living organism or in cell culture ....
. The genetically modified
Genetic engineering

Engineering There are a number of ways through which genetic engineering is accomplished. Essentially, the process has five main steps# Isolation of the genes of interest...
 organisms produced can be used to produce products such as recombinant protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s, used in medical research, or be grown in agriculture
Agriculture

Agriculture refers to the production of food and goods through farming and forestry. Agriculture was the key development that led to the rise of civilization, with the animal husbandry of domestication animals and plants creating food surpluses that enabled the development of more Population density and Social stratification societies....
.

Forensics


Forensic scientists can use DNA in blood
Blood

Blood is a specialized bodily fluid that delivers necessary substances to the body's Cell s ? such as nutrients and oxygen ? and transports waste products away from those same cells....
, semen
Semen

Semen is an organic fluid, also known as seminal fluid, that usually contains spermatozoon....
, skin
Skin

The skin is the outer covering of the body, also known as the epidermis. It is the largest organ of the integumentary system made up of multiple layers of epithelial biological tissue, and guards the underlying muscles, bones, ligaments and organ s....
, saliva
Saliva

Saliva is the watery and usually frothy substance produced in the mouths of humans and most other animals. Saliva is produced in and secreted from the salivary glands....
 or hair
Hair

Hair is a protein filament that epidermal growth from hair follicle deep within the dermis. The fine, soft hair found on many nonhuman mammals is typically called fur; wool is the characteristically curly hair found on sheep and goats....
 found at a crime scene to identify a matching DNA of an individual, such as a perpetrator. This process is called genetic fingerprinting
Genetic fingerprinting

DNA profiling is a technique employed by forensic scientists to assist in the identification of individuals on the basis of their respective DNA profiles....
, or more accurately, DNA profiling. In DNA profiling, the lengths of variable sections of repetitive DNA, such as short tandem repeat
Short tandem repeat

A short tandem repeat in DNA is a class of polymorphism that occurs when a pattern of two or more nucleotides are repeated and the repeated sequences are directly adjacent to each other....
s and minisatellite
Minisatellite

A minisatellite is a section of DNA that consists of a short series of bases 10-60bp. These occur at more than 1000 locations in the human genome....
s, are compared between people. This method is usually an extremely reliable technique for identifying a matching DNA. However, identification can be complicated if the scene is contaminated with DNA from several people. DNA profiling was developed in 1984 by British geneticist Sir Alec Jeffreys
Alec Jeffreys

Sir Alec John Jeffreys, Fellow of the Royal Society is a United Kingdom geneticist, who developed techniques for DNA fingerprinting and DNA profiling which are now used all over the world in forensic science to assist police detective work, and also to resolve paternity and immigration disputes....
, and first used in forensic science to convict Colin Pitchfork in the 1988 Enderby murders
Colin Pitchfork

Colin Pitchfork was the first criminal convicted for murder based on DNA fingerprinting evidence and the first to be caught as a result of mass screening....
 case.

People convicted of certain types of crimes may be required to provide a sample of DNA for a database. This has helped investigators solve old cases where only a DNA sample was obtained from the scene. DNA profiling can also be used to identify victims of mass casualty incidents. On the other hand, many convicted people have been released from prison on the basis of DNA techniques, which were not available when a crime had originally been committed.

Bioinformatics

Bioinformatics
Bioinformatics

Bioinformatics is the application of information technology to the field of molecular biology. The term bioinformatics was coined by Paulien Hogeweg in 1978 for the study of informatic processes in biotic systems....
 involves the manipulation, searching, and data mining
Data mining

Data mining is the process of extracting hidden patterns from data. As more data is gathered, with the amount of data doubling every three years, data mining is becoming an increasingly important tool to transform this data into information....
 of DNA sequence data. The development of techniques to store and search DNA sequences have led to widely-applied advances in computer science
Computer science

Computer science is the study of the theoretical foundations of information and computation, and of practical techniques for their implementation and application in computer systems....
, especially string searching algorithm
String searching algorithm

String searching algorithms, sometimes called string matching algorithms, are an important class of string algorithms that try to find a place where one or several string are found within a larger string or text....
s, machine learning
Machine learning

Machine learning is the subfield of artificial intelligence that is concerned with the design and development of algorithms that allow computers to improve their performance over time based on data, such as from sensor data or databases....
 and database theory
Database theory

Database theory encapsulates a broad range of topics related to the study and research of the theoretical realm of databases and database management systems....
. String searching or matching algorithms, which find an occurrence of a sequence of letters inside a larger sequence of letters, were developed to search for specific sequences of nucleotides. In other applications such as text editor
Text editor

A text editor is a type of software application used for editing plain text files.Text editors are often provided with operating systems or software development packages, and can be used to change configuration files and programming language source code....
s, even simple algorithms for this problem usually suffice, but DNA sequences cause these algorithms to exhibit near-worst-case behaviour due to their small number of distinct characters. The related problem of sequence alignment
Sequence alignment

In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural biology, or evolutionary relationships between the sequences....
 aims to identify homologous
Homology (biology)

In evolutionary biology, homology refers to any similarity between characteristics that is due to their common descent. The word homologous derives from the ancient Greek ??????e??, 'to agree'....
 sequences and locate the specific mutation
Mutation

In biology, mutations are changes to the nucleotide sequence of the genetic material of an organism. Mutations can be caused by copying errors in the genetic material during cell division, by exposure to ultraviolet or ionizing radiation, chemical mutagens, or virus , or can be induced by the organism, itself, by cellular processes such as s...
s that make them distinct. These techniques, especially multiple sequence alignment
Multiple sequence alignment

A multiple sequence alignment is a sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. In general, the input set of query sequences are assumed to have an evolutionary relationship by which they share a lineage and are descended from a common ancestor....
, are used in studying phylogenetic
Phylogenetics

In biology, phylogenetics is the study of evolutionary relatedness among various groups of organisms , which is discovered through molecular sequencing data and morphological data matrices....
 relationships and protein function. Data sets representing entire genomes' worth of DNA sequences, such as those produced by the Human Genome Project
Human Genome Project

The Human Genome Project was an international scientific research project with a primary goal to determine the sequence of chemical base pairs which make up DNA and to identify and map the approximately 20,000-25,000 genes of the human genome from both a physical and functional standpoint...
, are difficult to use without annotations, which label the locations of genes and regulatory elements on each chromosome. Regions of DNA sequence that have the characteristic patterns associated with protein- or RNA-coding genes can be identified by gene finding
Gene prediction

Gene finding typically refers to the area of computational biology that is concerned with algorithmically identifying stretches of sequence, usually Genome DNA, that are biologically functional....
 algorithms, which allow researchers to predict the presence of particular gene product
Gene product

A gene product is the biochemical material, either RNA or protein, resulting from Gene_expression of a gene. A measurement of the amount of gene product is sometimes used to infer how active a gene is....
s in an organism even before they have been isolated experimentally.

DNA nanotechnology


DNA nanotechnology uses the unique molecular recognition
Molecular recognition

The term molecular recognition refers to the specific interaction between two or more molecules through noncovalent bonding such as including hydrogen bonding, metal coordination, hydrophobic effect, van der Waals forces, pi-pi interactions, electrostatic and/or electromagnetic effects....
 properties of DNA and other nucleic acids to create self-assembling branched DNA complexes with useful properties. DNA is thus used as a structural material rather than as a carrier of biological information. This has led to the creation of two-dimensional periodic lattices (both tile-based as well as using the "DNA origami
DNA origami

Nanoscale folding of DNA, also known as DNA origami, is a process which allows researchers to create arbitrary two-dimensional shapes at the nanoscale using DNA....
" method) as well as three-dimensional structures in the shapes of polyhedra
Polyhedron

|}A polyhedron is often defined as a geometry object with flat faces and straight edges .This definition of a polyhedron is not very precise, and to a modern mathematician is quite unsatisfactory....
. Nanomechanical devices
DNA machine

A DNA machine is a molecular machine constructed from DNA. Research into DNA machines was pioneered in the late 1980s by Nadrian Seeman and co-workers from New York University....
 and algorithmic self-assembly
DNA computing

DNA computing is a form of computing which uses DNA, biochemistry and molecular biology, instead of the traditional silicon-based computer technology....
 have also been demonstrated, and these DNA structures have been used to template the arrangement of other molecules such as gold nanoparticles
Colloidal gold

Colloidal gold, also known as "nanogold", is a suspension of sub-micrometre-sized particles of gold in a fluid ? usually water. The liquid is usually either an intense red colour , or a dirty yellowish colour ....
 and streptavidin
Streptavidin

Streptavidin is a 53,000 Atomic_mass_unit tetrameric protein purified from the bacterium Streptomyces avidinii. It finds wide use in molecular biology through its extraordinarily strong affinity for the vitamin biotin; the dissociation constant of the biotin-streptavidin complex is on the order of ~10-15 mol/L, ranking among...
 proteins.

History and anthropology

Because DNA collects mutations over time, which are then inherited, it contains historical information and by comparing DNA sequences, geneticists can infer the evolutionary history of organisms, their phylogeny
Phylogenetics

In biology, phylogenetics is the study of evolutionary relatedness among various groups of organisms , which is discovered through molecular sequencing data and morphological data matrices....
. This field of phylogenetics is a powerful tool in evolutionary biology
Evolutionary biology

Evolutionary biology is a sub-field of biology concerned with the origin of species from a common descent and descent of species, as well as their evolution, multiplication and diversity over time....
. If DNA sequences within a species are compared, population geneticists
Population genetics

Population genetics is the study of the allele frequency distribution and change under the influence of the four evolutionary processes: natural selection, genetic drift, mutation and gene flow....
 can learn the history of particular populations. This can be used in studies ranging from ecological genetics
Ecological genetics

Ecological genetics is the study of genetics in the context of the interactions among organisms and between the organisms and their environment....
 to anthropology
Anthropology

Anthropology is the study of humans and humanity in its totality. Anthropology has origins in the natural sciences, and the humanities. In Great Britain it was originally divided into physical anthropology and cultural anthropology, which itself was divided into archaeology, technology, ethnology and sociology ....
; for example, DNA evidence is being used to try to identify the Ten Lost Tribes of Israel
Ten Lost Tribes

The phrase Ten Lost Tribes of Israel refers to the ancient Tribes of Israel that disappeared from the Hebrew Bible account after the Kingdom of Israel was destroyed, enslaved and exiled by ancient Assyria....
.

DNA has also been used to look at modern family relationships, such as establishing family relationships between the descendants of Sally Hemings
Sally Hemings

Sally Hemings was an African-American slavery owned by Thomas Jefferson. She is said to have been the half-sister of Jefferson's wife Martha Wayles Skelton Jefferson....
 and Thomas Jefferson
Thomas Jefferson

Thomas Jefferson was the List of Presidents of the United States President of the United States , the principal author of the United States Declaration of Independence , and one of the most influential Founding Fathers of the United States for his promotion of the ideals of republicanism in the United States....
. This usage is closely related to the use of DNA in criminal investigations detailed above. Indeed, some criminal investigations have been solved when DNA from crime scenes has matched relatives of the guilty individual.

History of DNA research

Rosalind Franklin
DNA was first isolated by the Swiss
Switzerland

Switzerland is a landlocked Swiss Alps country of roughly 7.7 million people in Western Europe with an area of 41,285 km?. Switzerland is a federal republic consisting of 26 states called Cantons of Switzerland....
 physician Friedrich Miescher
Friedrich Miescher

Johannes Friedrich Miescher was a Switzerland biologist. He isolated various phosphate-rich chemicals, which he called nuclein , from the nuclei of white blood cells in 1869 at Felix Hoppe-Seyler's laboratory at the University of T?bingen, Germany, paving the way for the identification of DNA as the carrier of inheritance....
 who, in 1869, discovered a microscopic substance in the pus
Pus

Pus is a whitish-yellow, yellow or yellow-brown substance produced during inflammatory pyogenic bacteriuml infection. An accumulation of pus in an enclosed tissue space is known as an abscess....
 of discarded surgical bandages. As it resided in the nuclei of cells, he called it "nuclein". In 1919, Phoebus Levene
Phoebus Levene

Phoebus Aaron Theodore Levene, M.D. was a Russian-American biochemist who studied the structure and function of nucleic acids. He characterized the different forms of nucleic acid, DNA from RNA, and found that DNA contained adenine, guanine, thymine, cytosine, deoxyribose, and a phosphate group....
 identified the base, sugar and phosphate nucleotide unit. Levene suggested that DNA consisted of a string of nucleotide units linked together through the phosphate groups. However, Levene thought the chain was short and the bases repeated in a fixed order. In 1937 William Astbury
William Astbury

William Thomas Astbury Fellow of the Royal Society was an English people physicist and molecular biology who made pioneering X-ray diffraction studies of biological molecules....
 produced the first X-ray diffraction
X-ray scattering techniques

X-ray scattering techniques are a family of non-destructive analytical scientific technique which reveal information about the crystallography structure, chemical composition, and physical properties of materials and thin films....
 patterns that showed that DNA had a regular structure.

In 1928, Frederick Griffith
Frederick Griffith

Frederick Griffith was a United Kingdom medical officer and genetics. In 1928, in what is today known as Griffith's experiment, he discovered what he called a transforming principle, which is today known to be DNA....
 discovered that traits
Trait (biology)

A trait is a distinct variant of a phenotype character of an organism that may be inherited, environmentally determined or somewhere in between....
 of the "smooth" form of the Pneumococcus could be transferred to the "rough" form of the same bacteria by mixing killed "smooth" bacteria with the live "rough" form. This system provided the first clear suggestion that DNA carried genetic information—the Avery-MacLeod-McCarty experiment
Avery-MacLeod-McCarty experiment

The Avery-MacLeod-McCarty experiment was an experimental demonstration, reported in 1944 by Oswald Avery, Colin MacLeod, and Maclyn McCarty, that DNA is the substance that causes bacterial transformation....
—when Oswald Avery
Oswald Avery

Oswald Theodore Avery was a Canadian-born United States physician and medicine researcher. The major part of his career was spent at the Rockefeller University Hospital in New York City....
, along with coworkers Colin MacLeod and Maclyn McCarty
Maclyn McCarty

Maclyn McCarty was an United States geneticist.Maclyn McCarty, who devoted his life as a physician-scientist to studying infectious disease organisms, was best known for his part in the monumental discovery that DNA, rather than protein, constituted the chemical nature of a gene....
, identified DNA as the transforming principle
Griffith's experiment

Griffith's experiment, conducted in 1928 by Frederick Griffith, was one of the first experiments suggesting that bacteria are capable of transferring genetic information through a process known as Transformation ....
 in 1943. DNA's role in heredity
Heredity

Heredity is the passing of traits to offspring . This is the process by which an offspring cell or organism acquires or becomes predisposed to the characteristics of its parent cell or organism....
 was confirmed in 1952, when Alfred Hershey
Alfred Hershey

Alfred Day Hershey was an American Nobel Prize-winning bacteriologist and geneticist.He was born in Owosso, Michigan and received his B.S. in chemistry at Michigan State University in 1930 and his Ph.D....
 and Martha Chase
Martha Chase

Martha Cowles Chase , also known as Martha C. Epstein, was an United States of America geneticist famously known for being a member of the 1952 team which experimentally showed that DNA rather than protein is the genetic material of life....
 in the Hershey-Chase experiment
Hershey-Chase experiment

The Hershey-Chase experiments were a series of experiments conducted in 1952 by Alfred Hershey and Martha Chase, confirming that DNA was the genetic material, which had first been in the 1944 Avery-MacLeod-McCarty experiment....
 showed that DNA is the genetic material of the T2 phage
Enterobacteria phage T2

Enterobacteria phage T2 is a virulent phage of the T4-like viruses genus, in the family Myoviridae. It infects E. coli and is the best known of the T-even phages....
.

In 1953, based on X-ray diffraction images
Photo 51

Photo 51 is the nickname given to an X-ray diffraction image of DNA taken by Rosalind Franklin in 1952 that was critical evidence in identifying the structure of DNA....
 taken by Rosalind Franklin
Rosalind Franklin

Rosalind Elsie Franklin was an English people biophysicist and X-ray crystallography who made important contributions to the understanding of the fine molecular structures of DNA, viruses, coal and graphite....
 and the information that the bases were paired, James D. Watson
James D. Watson

James Dewey Watson is an American molecular biology, best known as one of the co-discoverers of the structure of DNA. Watson, Francis Crick, and Maurice Wilkins were awarded the 1962 Nobel Prize in Physiology or Medicine "for their discoveries concerning the molecular structure of nucleic acids and its significance for information transfer...
 and Francis Crick
Francis Crick

Francis Harry Compton Crick Order of Merit Royal Society , Ph.D., was a British molecular biology, physics, and neuroscience, and most noted for being one of the co-discoverers of the structure of the DNA molecule in 1953....
 suggested what is now accepted as the first accurate model of DNA structure
Molecular structure of Nucleic Acids

The Molecular structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid was an article published by James D. Watson and Francis Crick in the scientific journal Nature in its 171st volume on page 737-738 ....
 in the journal Nature
Nature (journal)

Nature is a prominent scientific journal, first published on 4 November 1869. Although most scientific journals are now highly specialized, Nature is one of the few journals, along with other weekly journals such as Science and Proceedings of the National Academy of Sciences, that still publishes original research articles ac...
. Experimental evidence for Watson and Crick's model were published in a series of five articles in the same issue of Nature. Of these, Franklin
Rosalind Franklin

Rosalind Elsie Franklin was an English people biophysicist and X-ray crystallography who made important contributions to the understanding of the fine molecular structures of DNA, viruses, coal and graphite....
 and Raymond Gosling
Raymond Gosling

Raymond Gosling is a distinguished scientist who worked with both Maurice Wilkins and Rosalind Franklin at King's College London in deducing the structure of DNA, under the direction of Sir John Randall....
's paper was the first publication of X-ray diffraction data that supported the Watson and Crick model, this issue also contained an article on DNA structure by Maurice Wilkins
Maurice Wilkins

Maurice Hugh Frederick Wilkins Order of the British Empire Royal Society was a New Zealand-born UKmolecular biology, and Nobel Laureate who contributed research in the fields of phosphorescence, radar, isotope separation, and X-ray diffraction....
 and his colleagues. In 1962, after Franklin's death, Watson, Crick, and Wilkins jointly received the Nobel Prize
Nobel Prize

The Nobel Prize , established in the 1895 will of Swedish chemist Alfred Nobel; it was first awarded in Nobel Prize in Physics, Nobel Prize in Chemistry, Nobel Prize in Physiology or Medicine, Nobel Prize in Literature, and Nobel Peace Prize in 1901....
 in Physiology or Medicine
Nobel Prize in Physiology or Medicine

The Nobel Prize in Physiology or Medicine is awarded once a year by the Swedish Karolinska Institutet. It is one of the five Nobel Prizes established by the will of Alfred Nobel in 1895, awarded for outstanding contributions in Nobel Prize in Physics, Nobel Prize in Chemistry, Nobel Prize in Literature, Nobel Peace Prize, and Physiology or Medic...
. However, debate continues on who should receive credit for the discovery.

In an influential presentation in 1957, Crick laid out the "Central Dogma" of molecular biology
Central dogma of molecular biology

The central dogma of molecular biology was first enunciated by Francis Crick in 1958 and re-stated in a Nature paper published in 1970:In other words, 'once information gets into protein, it can't flow back to nucleic acid.'...
, which foretold the relationship between DNA, RNA, and proteins, and articulated the "adaptor hypothesis". Final confirmation of the replication mechanism that was implied by the double-helical structure followed in 1958 through the Meselson-Stahl experiment
Meselson-Stahl experiment

The Meselson-Stahl experiment was an experiment by Matthew Meselson and Franklin Stahl which demonstrated that DNA replication was semiconservative replication....
. Further work by Crick and coworkers showed that the genetic code was based on non-overlapping triplets of bases, called codons, allowing Har Gobind Khorana
Har Gobind Khorana

Har Gobind Khorana, or Hargobind Khorana is an Indian-American molecular biologist. He was awarded the Nobel Prize in Physiology or Medicine in 1968 for his work on the interpretation of the genetic code and its function in protein biosynthesis....
, Robert W. Holley
Robert W. Holley

Robert William Holley was an American biochemist, he was awarded the Nobel Prize in Physiology or Medicine in 1968 for describing the structure of alanine tRNA, linking DNA and protein synthesis....
 and Marshall Warren Nirenberg
Marshall Warren Nirenberg

Marshall Warren Nirenberg is a United States of America biochemist and genetics. He shared a Nobel Prize in Physiology or Medicine in 1968 with Har Gobind Khorana and Robert W....
 to decipher the genetic code. These findings represent the birth of molecular biology
Molecular biology

Molecular biology is the study of biology at a molecule level. The field overlaps with other areas of biology and chemistry, particularly genetics and biochemistry....
.

See also


Further reading

  • Calladine, Chris R.; Drew, Horace R.; Luisi, Ben F. and Travers, Andrew A. Understanding DNA, Elsevier Academic Press, 2003. ISBN 978-0-12155089-9
  • Clayton, Julie. (Ed.). 50 Years of DNA, Palgrave MacMillan Press, 2003. ISBN 978-1-40-391479-8
  • Judson, Horace Freeland. The Eighth Day of Creation: Makers of the Revolution in Biology, Cold Spring Harbor Laboratory Press, 1996. ISBN 978-0-87-969478-4
  • Olby, Robert
    Robert Olby

    Robert Cecil Olby is a research professor in the Department of History and Philosophy of Science at the University of Pittsburgh. Formerly at the University of Leeds, UK, Robert Olby is known as a historian of nineteenth and twentieth century biology, his special fields being genetics and molecular biology....
    . The Path to The Double Helix: Discovery of DNA, first published in October 1974 by MacMillan, with foreword by Francis Crick; ISBN 978-0-48-668117-7; the definitive DNA textbook, revised in 1994, with a 9 page postscript.
  • Ridley, Matt
    Matt Ridley

    Matthew White Ridley is an English journalist, science writer, businessman and aristocrat. Ridley was educated at Eton College and Magdalen College, Oxford where he received a doctorate in zoology before commencing a career in journalism....
    . Francis Crick: Discoverer of the Genetic Code (Eminent Lives) HarperCollins Publishers; 192 pp, ISBN 978-0-06-082333-7 2006
  • Rose, Steven. The Chemistry of Life, Penguin, ISBN 978-0-14-027273-4.
  • Watson, James D. and Francis H.C. Crick. (PDF). Nature
    Nature (journal)

    Nature is a prominent scientific journal, first published on 4 November 1869. Although most scientific journals are now highly specialized, Nature is one of the few journals, along with other weekly journals such as Science and Proceedings of the National Academy of Sciences, that still publishes original research articles ac...
     171, 737–738, 25 April 1953.
  • Watson, James D. DNA: The Secret of Life ISBN 978-0-375-41546-3.
  • Watson, James D. The Double Helix: A Personal Account of the Discovery of the Structure of DNA (Norton Critical Editions)
    The Double Helix

    The Double Helix: A Personal Account of the Discovery of the Structure of DNA is an autobiographical account of the discovery of the double helix structure of DNA written by James D....
    . ISBN 978-0-393-95075-5
  • Watson, James D. "Avoid boring people and other lessons from a life in science" (2007) New York: Random House. ISBN 978-0-375-41284-4


External links

  • Another DNA Learning Center site on DNA, genes, and heredity from Mendel to the human genome project.
  • From the official Nobel Prize web site
  • , Nature
    Nature (journal)

    Nature is a prominent scientific journal, first published on 4 November 1869. Although most scientific journals are now highly specialized, Nature is one of the few journals, along with other weekly journals such as Science and Proceedings of the National Academy of Sciences, that still publishes original research articles ac...
  • National Centre for Biotechnology Education
  • DNA from the Beginning Study Guide
  • Olby, R.
    Robert Olby

    Robert Cecil Olby is a research professor in the Department of History and Philosophy of Science at the University of Pittsburgh. Formerly at the University of Leeds, UK, Robert Olby is known as a historian of nineteenth and twentieth century biology, his special fields being genetics and molecular biology....
     (2003) Nature 421 (January 23): 402–405.
  • at Mandeville Special Collections Library, Geisel Library, University of California, San Diego
  • — watch videos and participate in real-time chat with top scientists