All Topics  
DNA sequencing

 
DNA Sequencing

   Email Print
   Bookmark   Link






 

DNA sequencing



 
 
The term DNA sequencing refers to methods for determining the order of the nucleotide
Nucleotide

Nucleotides are molecules that comprise the structural units of RNA and DNA. Additionally, nucleotides play central roles in metabolism. In that capacity, they serve as sources of chemical energy , participate in cell signaling , and are incorporated into important cofactors of enzymatic reactions ....
 bases, adenine
Adenine

Adenine is a nucleobase with a variety of roles in biochemistry including cellular respiration, in the form of both the energy-rich adenosine triphosphate and the cofactor s nicotinamide adenine dinucleotide and flavin adenine dinucleotide , and Protein biosynthesis, as a chemical component of DNA and RNA....
, guanine
Guanine

Guanine is one of the five main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, thymine, and uracil. In DNA, guanine is paired with cytosine....
, cytosine
Cytosine

Cytosine is one of the five main bases found in DNA and RNA. It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ....
, and thymine
Thymine

Thymine is one of the four bases in the nucleic acid of DNA that make up the letters GCAT. The others are adenine, guanine, and cytosine. Thymine always pairs with adenine....
, in a molecule of DNA
DNA

Deoxyribonucleic acid is a nucleic acid that contains the genetics instructions used in the development and functioning of all known living organisms and some viruses....
. The first DNA sequences were obtained by academic researchers, using laborious methods based on 2-dimensional chromatography in the early 1970s. Following the development of dye
Cyanine

Cyanine is a International Union of Pure and Applied Chemistry nomenclature name of a synthetic dye family belonging to polymethine group. Cyanines have many uses as fluorescent dyes, particularly in biomedical imaging....
-based sequencing methods with automated analysis, DNA sequencing has become easier and orders of magnitude faster.






Discussion
Ask a question about 'DNA sequencing'
Start a new discussion about 'DNA sequencing'
Answer questions from other users
Full Discussion Forum



Encyclopedia


The term DNA sequencing refers to methods for determining the order of the nucleotide
Nucleotide

Nucleotides are molecules that comprise the structural units of RNA and DNA. Additionally, nucleotides play central roles in metabolism. In that capacity, they serve as sources of chemical energy , participate in cell signaling , and are incorporated into important cofactors of enzymatic reactions ....
 bases, adenine
Adenine

Adenine is a nucleobase with a variety of roles in biochemistry including cellular respiration, in the form of both the energy-rich adenosine triphosphate and the cofactor s nicotinamide adenine dinucleotide and flavin adenine dinucleotide , and Protein biosynthesis, as a chemical component of DNA and RNA....
, guanine
Guanine

Guanine is one of the five main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, thymine, and uracil. In DNA, guanine is paired with cytosine....
, cytosine
Cytosine

Cytosine is one of the five main bases found in DNA and RNA. It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ....
, and thymine
Thymine

Thymine is one of the four bases in the nucleic acid of DNA that make up the letters GCAT. The others are adenine, guanine, and cytosine. Thymine always pairs with adenine....
, in a molecule of DNA
DNA

Deoxyribonucleic acid is a nucleic acid that contains the genetics instructions used in the development and functioning of all known living organisms and some viruses....
. The first DNA sequences were obtained by academic researchers, using laborious methods based on 2-dimensional chromatography in the early 1970s. Following the development of dye
Cyanine

Cyanine is a International Union of Pure and Applied Chemistry nomenclature name of a synthetic dye family belonging to polymethine group. Cyanines have many uses as fluorescent dyes, particularly in biomedical imaging....
-based sequencing methods with automated analysis, DNA sequencing has become easier and orders of magnitude faster. Knowledge of DNA sequences of gene
Gene

A gene is the basic unit of heredity in a living organism. All living things depend on genes. Genes hold the information to build and maintain their cell and pass genetic trait to offspring....
s and other parts of the genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
 of organisms has become indispensable for basic research studying biological processes, as well as in applied fields such as diagnostic or forensic
Forensic biology

Forensic biology is the Applied science of biology to law enforcement.It includes the subdisciplines of Forensic anthropology, botany, Forensic entomology, Forensic odontology and various DNA or protein based techniques....
 research. The advent of DNA sequencing has significantly accelerated biological research and discovery. The rapid speed of sequencing attained with modern DNA sequencing technology has been instrumental in the sequencing of the human genome
Human genome

The human genome is the genome of Homo sapiens, which is stored on 23 chromosome pairs. Twenty-two of these are autosome, while the remaining pair is XY sex-determination system....
, in the Human Genome Project
Human Genome Project

The Human Genome Project was an international scientific research project with a primary goal to determine the sequence of chemical base pairs which make up DNA and to identify and map the approximately 20,000-25,000 genes of the human genome from both a physical and functional standpoint...
. Related projects, often by scientific collaboration across continents, have generated the complete DNA sequences of many animal, plant, and microbial genomes.

RNA sequencing, which is technically easier to perform than DNA sequencing, was one of the earliest forms of nucleotide sequencing. The major landmark of RNA sequencing is the sequence of the first complete gene and the complete genome of Bacteriophage MS2
Bacteriophage MS2

The 'bacteriophage MS2'. MS2 phage is an icosahedral bacteriophage with a diameter of 27-34nm and an isoelectric point of 3.9. MS2 phage can be propagated in Escherichia coli, commonly E....
, identified and published by Walter Fiers
Walter Fiers

Walter Fiers is a Belgium molecular biologist.He obtained a degree of Engineer for Chemistry and Agricultural Industries at the University of Ghent in 1954, and started his research career as an Enzyme in the laboratory of Laurent Vandendriessche in Ghent....
 and his coworkers at the University of Ghent (Ghent
Ghent

Ghent is a city and a municipality located in the Flemish region, Belgium. It is the capital and biggest city of the East Flanders province. The city started as a settlement at the confluence of the Rivers Scheldt and Lys River and became in the Middle Ages one of the largest and richest cities of northern Europe....
, Belgium
Belgium

* A small German-speaking Community of Belgium exists in eastern Wallonia. Belgium's linguistic diversity and related political and cultural conflicts are reflected in the history of Belgium and a complex Communities and regions of Belgium....
), published between 1972 and 1976.

Prior to the development of rapid DNA sequencing methods in the early 1970s by Frederick Sanger
Frederick Sanger

Frederick Sanger, Order of Merit , Order of the Companions of Honour, Order of the British Empire, Royal Society is an England biochemistry and twice a Nobel laureate in chemistry....
 in England and Walter Gilbert
Walter Gilbert

Walter Gilbert is an United States Physics, Biochemistry, molecular biology pioneer, and Nobel laureate....
 and Allan Maxam
Allan Maxam

Allan Maxam is one of the pioneers of molecular genetics. He was one of the contributors to develop a DNA_sequencing method at Harvard University, while working as a student in the laboratory of Walter Gilbert ....
 at Harvard
Harvard University

Harvard University is a private university in Cambridge, Massachusetts, Massachusetts, United States, and a member of the Ivy League. Founded in 1636 by the colonial Massachusetts legislature, Harvard is the Colonial Colleges institution of higher learning in the United States....
, a number of laborious methods were used. For instance, in 1973, Gilbert and Maxam reported the sequence of 24 basepairs using a method known as wandering-spot analysis.

The chain-termination method developed by Sanger and coworkers in 1975 soon became the method of choice, owing to its relative ease and reliability.

Maxam-Gilbert sequencing


In 1976-1977, Allan Maxam
Allan Maxam

Allan Maxam is one of the pioneers of molecular genetics. He was one of the contributors to develop a DNA_sequencing method at Harvard University, while working as a student in the laboratory of Walter Gilbert ....
 and Walter Gilbert
Walter Gilbert

Walter Gilbert is an United States Physics, Biochemistry, molecular biology pioneer, and Nobel laureate....
 developed a DNA sequencing method based on chemical modification of DNA and subsequent cleavage at specific bases . Although Maxam and Gilbert published their chemical sequencing method two years after the ground-breaking paper of Sanger and Coulson on plus-minus sequencing, Maxam-Gilbert sequencing rapidly became more popular, since purified DNA could be used directly, while the initial Sanger method required that each read start be cloned for production of single-stranded DNA. However, with the improvement of the chain-termination method (see below), Maxam-Gilbert sequencing has fallen out of favour due to its technical complexity prohibiting its use in standard molecular biology kits, extensive use of hazardous chemicals, and difficulties with scale-up.

The method requires radioactive labelling at one end and purification of the DNA fragment to be sequenced. Chemical treatment generates breaks at a small proportion of one or two of the four nucleotide bases in each of four reactions (G, A+G, C, C+T). Thus a series of labelled fragments is generated, from the radiolabelled end to the first 'cut' site in each molecule. The fragments in the four reactions are arranged side by side in gel electrophoresis
Gel electrophoresis

Gel electrophoresis is a technique used for the separation of DNA , RNA , or protein molecules using an electric current applied to a gel matrix....
 for size separation. To visualize the fragments, the gel is exposed to X-ray film
Radiography

Radiography is the use of X-rays to view unseen or hard-to-image objects. The main diagnostic purposes of X-rays are to see inside one's body, most commonly the bones which can be viewed at an optimum resolution ....
 for autoradiography, yielding a series of dark bands each corresponding to a radiolabelled DNA fragment, from which the sequence may be inferred.

Also sometimes known as 'chemical sequencing', this method originated in the study of DNA-protein interactions (footprinting), nucleic acid structure and epigenetic modifications to DNA, and within these it still has important applications.

Chain-termination methods


Sequencing
Because the chain-terminator method (or Sanger method after its developer Frederick Sanger) is more efficient and uses fewer toxic chemicals and lower amounts of radioactivity than the method of Maxam and Gilbert, it rapidly became the method of choice. The key principle of the Sanger method was the use of dideoxynucleotide
Dideoxynucleotides

Dideoxynucleotides, or ddNTPs, are nucleotides lacking a 3'-hydroxyl group on their deoxyribose sugar. Since deoxyribose already lacks a 2'-OH, dideoxyribose lacks hydroxyl groups at both its 2' and 3' carbons....
 triphosphates (ddNTPs) as DNA chain terminators.

The classical chain-termination method requires a single-stranded DNA template, a DNA primer
Primer (molecular biology)

A primer is a strand of nucleic acid that serves as a starting point for DNA replication. They are required because the enzymes that catalyze replication, DNA polymerases, can only add new nucleotides to an existing strand of DNA....
, a DNA polymerase
DNA polymerase

A DNA polymerase is an enzyme that catalyze the polymerization of deoxyribonucleotides into a DNA strand. DNA polymerases are best-known for their role in DNA replication, in which the polymerase "reads" an intact DNA strand as a template and uses it to synthesize the new strand....
, radioactively or fluorescent
Fluorescence

Fluorescence is a luminescence that is mostly found as an optical phenomenon in cold bodies, in which the molecular absorption of a photon triggers the emission of a photon with a longer wavelength....
ly labeled nucleotides, and modified nucleotides that terminate DNA strand elongation. The DNA sample is divided into four separate sequencing reactions, containing all four of the standard deoxynucleotides (dATP, dGTP, dCTP and dTTP) and the DNA polymerase
DNA polymerase

A DNA polymerase is an enzyme that catalyze the polymerization of deoxyribonucleotides into a DNA strand. DNA polymerases are best-known for their role in DNA replication, in which the polymerase "reads" an intact DNA strand as a template and uses it to synthesize the new strand....
. To each reaction is added only one of the four dideoxynucleotides
Dideoxynucleotides

Dideoxynucleotides, or ddNTPs, are nucleotides lacking a 3'-hydroxyl group on their deoxyribose sugar. Since deoxyribose already lacks a 2'-OH, dideoxyribose lacks hydroxyl groups at both its 2' and 3' carbons....
 (ddATP, ddGTP, ddCTP, or ddTTP) which are the chain-terminating nucleotides, lacking a 3'-OH
Hydroxyl

Hydroxyl in chemistry stands for a molecule consisting of an oxygen atom and a hydrogen atom connected by a covalent bond. The neutral form is a hydroxyl Radical and the hydroxyl anion is called a hydroxide....
 group required for the formation of a phosphodiester bond
Phosphodiester bond

A phosphodiester bond is a group of strong covalent bond between the Phosphorus in a phosphate group and two other molecules over two ester bonds....
 between two nucleotides, thus terminating DNA strand extension and resulting in various DNA fragments of varying length.

The newly synthesized and labeled DNA fragments are heat denatured
Melting temperature

Melting temperature may refer to:* Melting point, the temperature at which a substance changes from solid to liquid state.* DNA melting temperature, the temperature at which a DNA double helix dissociates into single strands....
, and separated by size (with a resolution of just one nucleotide) by gel electrophoresis
Gel electrophoresis

Gel electrophoresis is a technique used for the separation of DNA , RNA , or protein molecules using an electric current applied to a gel matrix....
 on a denaturing polyacrylamide
Polyacrylamide gel

A Polyacrylamide Gel is a separation matrix used in Gel electrophoresis of biomolecules, such as proteins or DNA fragments. Traditional DNA sequencing techniques such as Dideoxy termination#Maxam-Gilbert sequencing or Dideoxy termination#Chain termination method methods used polyacrylamide gels to separate DNA fragments differing by a single...
-urea gel with each of the four reactions run in one of four individual lanes (lanes A, T, G, C); the DNA bands are then visualized by autoradiography or UV light, and the DNA sequence can be directly read off the X-ray film
Radiography

Radiography is the use of X-rays to view unseen or hard-to-image objects. The main diagnostic purposes of X-rays are to see inside one's body, most commonly the bones which can be viewed at an optimum resolution ....
 or gel image. In the image on the right, X-ray film was exposed to the gel, and the dark bands correspond to DNA fragments of different lengths. A dark band in a lane indicates a DNA fragment that is the result of chain termination after incorporation of a dideoxynucleotide (ddATP, ddGTP, ddCTP, or ddTTP). The relative positions of the different bands among the four lanes are then used to read (from bottom to top) the DNA sequence.

Technical variations of chain-termination sequencing include tagging with nucleotides containing radioactive phosphorus for radiolabelling, or using a primer labeled at the 5’ end with a fluorescent
Fluorescence

Fluorescence is a luminescence that is mostly found as an optical phenomenon in cold bodies, in which the molecular absorption of a photon triggers the emission of a photon with a longer wavelength....
 dye. Dye-primer sequencing facilitates reading in an optical system for faster and more economical analysis and automation. The later development by L Hood and coworkers of fluorescently labeled ddNTPs and primers set the stage for automated, high-throughput DNA sequencing. Chain-termination methods have greatly simplified DNA sequencing. For example, chain-termination-based kits are commercially available that contain the reagents needed for sequencing, pre-aliquoted and ready to use. Limitations include non-specific binding of the primer to the DNA, affecting accurate read-out of the DNA sequence, and DNA secondary structures affecting the fidelity of the sequence.

Dye-terminator sequencing


Dye-terminator sequencing utilizes labelling of the chain terminator ddNTPs, which permits sequencing in a single reaction, rather than four reactions as in the labelled-primer method. In dye-terminator sequencing, each of the four dideoxynucleotide chain terminators is labelled with fluorescent dyes, each of which with different wavelength
Wavelength

In physics, wavelength is the distance between repeating units of a propagating wave of a given frequency. It is commonly designated by the Greek language letter lambda ....
s of fluorescence and emission. Owing to its greater expediency and speed, dye-terminator sequencing is now the mainstay in automated sequencing. Its limitations include dye effects due to differences in the incorporation of the dye-labelled chain terminators into the DNA fragment, resulting in unequal peak heights and shapes in the electronic DNA sequence trace chromatogram after capillary electrophoresis
Capillary electrophoresis

Capillary electrophoresis , also known as capillary zone electrophoresis , can be used to separate ionic species by their charge and frictional forces....
 (see figure to the right). This problem has been addressed with the use of modified DNA polymerase enzyme systems and dyes that minimize incorporation variability, as well as methods for eliminating "dye blobs". The dye-terminator sequencing method, along with automated high-throughput DNA sequence analyzers, is now being used for the vast majority of sequencing projects.

Challenges


Common challenges of DNA sequencing include poor quality in the first 15-40 bases of the sequence and deteriorating quality of sequencing traces after 700-900 bases. Base calling software typically gives an estimate of quality to aid in quality trimming.

In cases where DNA fragments are cloned before sequencing, the resulting sequence may contain parts of the cloning vector. In contrast, PCR-based cloning and emerging sequencing technologies based on pyrosequencing
Pyrosequencing

Pyrosequencing is a method of DNA sequencing based on the "sequencing by synthesis" principle, which relies on detection of pyrophosphate release on nucleotide incorporation rather than chain termination with dideoxynucleotides....
 often avoid using cloning vectors.

Automation and sample preparation


Sanger Sequencing Read Display
Automated DNA-sequencing instruments (DNA sequencers) can sequence up to 384 DNA samples in a single batch (run) in up to 24 runs a day. DNA sequencers carry out capillary electrophoresis
Capillary electrophoresis

Capillary electrophoresis , also known as capillary zone electrophoresis , can be used to separate ionic species by their charge and frictional forces....
 for size separation, detection and recording of dye fluorescence, and data output as fluorescent peak trace chromatograms. Sequencing reactions by thermocycling, cleanup and re-suspension in a buffer solution
Buffer solution

A buffer solution is an aqueous solution consisting of a mixture of a weak acid and its conjugate base or a weak base and its conjugate acid. It has the property that the pH of the solution changes very little when a small amount of acid or base is added to it....
 before loading onto the sequencer are performed separately. A number of commercial and non-commercial software packages can trim low-quality DNA traces automatically. These programs score the quality of each peak and remove low-quality base peaks (generally located at the ends of the sequence). The accuracy of such algorithms is below visual examination by a human operator, but sufficient for automated processing of large sequence data sets.

Large-scale sequencing strategies


Current methods can directly sequence only relatively short (300-1000 nucleotides long) DNA fragments in a single reaction. . The main obstacle to sequencing DNA fragments above this size limit is insufficient power of separation for resolving large DNA fragments that differ in length by only one nucleotide.

Large-scale sequencing aims at sequencing very long DNA pieces, such as whole chromosome
Chromosome

A chromosome is an organized structure of DNA and protein that is found in Cell . A chromosome is a single piece of DNA that contains many genes, regulatory sequence and other genetic sequence....
s. Common approaches consist of cutting (with restriction enzyme
Restriction enzyme

A restriction enzyme is an enzyme that cuts double-stranded or single stranded DNA at specific recognition nucleotide sequences known as restriction sites....
s) or shearing (with mechanical forces) large DNA fragments into shorter DNA fragments. The fragmented DNA is cloned into a DNA vector, and amplified in Escherichia coli
Escherichia coli

'Escherichia coli' , is a Gram negative bacterium that is commonly found in the lower gastrointestinal tract of warm-blooded animals. Most E....
. Short DNA fragments purified from individual bacterial colonies are individually sequenced and assembled electronically
Sequence assembly

In bioinformatics, sequence assembly refers to sequence alignment and merging fragments of a much longer DNA sequence in order to reconstruct the original sequence....
 into one long, contiguous sequence. This method does not require any pre-existing information about the sequence of the DNA and is referred to as de novo sequencing. Gaps in the assembled sequence may be filled by primer walking
Primer walking

Primer walking is a sequencing method of choice for sequencing DNA fragments between 1.3 and 7 kilobases. Such fragments are too long to be sequenced in a single sequence read using the chain termination method....
. The different strategies have different tradeoffs in speed and accuracy; shotgun methods are often used for sequencing large genomes, but its assembly is complex and difficult, particularly with sequence repeats often causing gaps in genome assembly.

New sequencing methods


High-throughput sequencing

The high demand for low-cost sequencing has driven the development of high-throughput sequencing technologies that parallelize
Multiplex (assay)

A multiplex assay is a type of laboratory procedure that performs multiple assays concurrently. It is distinguished from procedures that perform one or a few assays at a time....
 the sequencing process, producing thousands or millions of sequences at once. High-throughput sequencing technologies are intended to lower the cost of DNA sequencing beyond what is possible with standard dye-terminator methods.

In vitro clonal amplification Molecular detection methods are not sensitive enough for single molecule sequencing, so most approaches use an in vitro cloning step to amplify individual DNA molecules. Emulsion PCR isolates individual DNA molecules along with primer-coated beads in aqueous bubbles within an oil phase. Polymerase chain reaction
Polymerase chain reaction

The polymerase chain reaction is a technique widely used in molecular biology. It derives its name from one of its key components, a DNA polymerase used to amplify a piece of DNA by in vitro enzyme DNA replication....
 (PCR) then coats each bead with clonal copies of the DNA molecule followed by immobilization for later sequencing. Emulsion PCR is used in the methods by Marguilis et al. (commercialized by 454 Life Sciences
454 Life Sciences

454 Life Sciences, a Roche company, is a biotechnology company based in Branford, Connecticut specializing in high-throughput DNA sequencing using a novel massively parallel sequencing-by-synthesis approach....
), Shendure and Porreca et al. (also known as "polony sequencing") and SOLiD sequencing, (developed by Agencourt, now Applied Biosystems
Applied Biosystems

Applied Biosystems, Inc. is the original name of a pioneer biotechnology company founded in 1981 in Foster City, California, in the San Francisco Bay Area....
). Another method for in vitro clonal amplification is bridge PCR, where fragments are amplified upon primers attached to a solid surface. The single-molecule method developed by Stephen Quake's laboratory (later commercialized by Helicos) skips this amplification step, directly fixing DNA molecules to a surface.

Parallelized sequencing DNA molecules are physically bound to a surface, and sequenced in parallel.Sequencing by synthesis, like dye-termination electrophoretic sequencing, uses a DNA polymerase
DNA polymerase

A DNA polymerase is an enzyme that catalyze the polymerization of deoxyribonucleotides into a DNA strand. DNA polymerases are best-known for their role in DNA replication, in which the polymerase "reads" an intact DNA strand as a template and uses it to synthesize the new strand....
 to determine the base sequence. Reversible terminator methods (used by Illumina and Helicos) use reversible versions of dye-terminators, adding one nucleotide at a time, detect fluorescence at each position in real time, by repeated removal of the blocking group to allow polymerization of another nucleotide. Pyrosequencing
Pyrosequencing

Pyrosequencing is a method of DNA sequencing based on the "sequencing by synthesis" principle, which relies on detection of pyrophosphate release on nucleotide incorporation rather than chain termination with dideoxynucleotides....
 (used by 454) also uses DNA polymerization, adding one nucleotide species at a time and detecting and quantifying the number of nucleotides added to a given location through the light emitted by the release of attached pyrophosphates.

Sequencing by ligation
Sequencing by ligation

Sequencing by ligation is a DNA sequencing method that uses the enzyme DNA ligase to identify the nucleotide present at a given position in a DNA sequence....
This enzymatic sequencing method uses a DNA ligase
DNA ligase

In molecular biology, DNA ligase is a special type of ligase that can link together two DNA strands that have double-strand break . The alternative, a single-strand break, is fixed by a different type of DNA ligase using the Complementary DNA as a template but still requires DNA ligase to create the final phosphodiester bond to fully repair...
 to determine the target sequence. Used in the polony method and in the SOLiD technology, it uses a pool of all possible oligonucleotides of a fixed length, labeled according to the sequenced position. Oligonucleotides are annealed and ligated; the preferential ligation by DNA ligase for matching sequences results in a signal informative of the nucleotide at that position.

Microfluidic Sanger Sequencing In microfluidic Sanger sequencing
Microfluidic Sanger Sequencing

The completion of the Human genome project has been a cornerstone in the advancement of biological studies. The outcomes of obtaining a complete reference map of the human genome have ushered in the post-genome era of studies....
 the entire thermocycling amplification of DNA fragments as well as their separation by electrophoresis is done on a single chip (approximately 100 cm in diameter) thus reducing the reagent usage as well as cost. In some instances researchers have shown that they can increase the through-put of conventional sequencing through the use of microchips. Research will still need to be done in order to make this use of technology effective.

Other sequencing technologies

Sequencing by hybridization
Sequencing by Hybridization

Sequencing by Hybridization is a class of methods for determining the order in which nucleotides occur on a strand of DNA. Typically used for looking for small changes relative to a known DNA sequence....
 is a non-enzymatic method that uses a DNA microarray
DNA microarray

A DNA microarray is a multiplex technology used in molecular biology and in medicine. It consists of an arrayed series of thousands of microscopic spots of DNA oligonucleotides, called features, each containing picoMole s of a specific DNA sequence....
. A single pool of DNA whose sequence is to be determined is fluorescently labeled and hybridized to an array containing known sequences. Strong hybridization signals from a given spot on the array identifies its sequence in the DNA being sequenced. Mass spectrometry
Mass spectrometry

Mass spectrometry is an analytical technique for the determination of the elemental composition of a sample or molecule. It is also used for elucidating the chemical structures of molecules, such as peptides and other chemical compounds....
 may be used to determine mass differences between DNA fragments produced in chain-termination reactions.

DNA sequencing methods currently under development include labeling the DNA polymerase, reading the sequence as a DNA strand transits through nanopores
Nanopore sequencing

Nanopore sequencing is a method under development since 1995 for determining the order in which nucleotides occur on a strand of DNA.A nanopore is simply a small hole, of the order of 1 nanometer in internal diameter....
, and microscopy-based techniques, such as AFM
Atomic force microscope

The atomic force microscope or scanning force microscope is a very high-resolution type of Scanning probe microscopy, with demonstrated resolution of fractions of a nanometer, more than 1000 times better than the diffraction limited....
 or electron microscopy
Electron microscope

An electron microscope is a type of microscope that uses a particle beam of electrons to illuminate a specimen and create a highly-magnified image....
 that are used to identify the positions of individual nucleotides within long DNA fragments (>5,000 bp) by nucleotide labeling with heavier elements (e.g., halogens) for visual detection and recording.

In October 2006, the X Prize Foundation
X Prize Foundation

The X PRIZE Foundation is a non-profit organisation prize institute that designs and manages public competitions for the benefit of humanity....
 established an initiative to promote the development of full genome sequencing
Full genome sequencing

Full genome sequencing , also known as whole genome sequencing, complete genome sequencing, or entire genome sequencing, is a laboratory process that determines the complete DNA sequence of an organism's genome at a single time....
 technologies, called the Archon X Prize
Archon X Prize

The Archon X PRIZE for Genomics, the second Ansari X PRIZE to be offered by the X PRIZE Foundation, based in Santa Monica, California, was announced on October 4, 2006....
, intending to award $10 million to "the first Team that can build a device and use it to sequence 100 human genomes within 10 days or less, with an accuracy of no more than one error in every 100,000 bases sequenced, with sequences accurately covering at least 98% of the genome, and at a recurring cost of no more than $10,000 (US) per genome."

Major landmarks in DNA sequencing


  • 1953
    1953 in science

    The year 1953 in science and technology involved some significant events, listed below....
     Discovery of the structure of the DNA double helix.


  • 1972
    1972 in science

    The year 1972 in science and technology involved some significant events, listed below....
     Development of recombinant DNA
    Recombinant DNA

    Recombinant DNA is a form of synthetic DNA thereby combining DNA sequences that would not normally occur together. In terms of genetic modification, recombinant DNA is produced through the addition of relevant DNA into an existing organismal genome, such as the plasmid of bacteria, to code for or alter different traits for a specific purpos...
     technology, which permits isolation of defined fragments of DNA; prior to this, the only accessible samples for sequencing were from bacteriophage or virus DNA.


  • 1975
    1975 in science

    The year 1975 in science and technology involved some significant events, listed below....
     The first complete DNA genome to be sequenced is that of bacteriophage fX174


  • 1977
    1977 in science

    The year 1977 in science and technology involved some significant events, listed below....
     Allan Maxam
    Allan Maxam

    Allan Maxam is one of the pioneers of molecular genetics. He was one of the contributors to develop a DNA_sequencing method at Harvard University, while working as a student in the laboratory of Walter Gilbert ....
     and Walter Gilbert
    Walter Gilbert

    Walter Gilbert is an United States Physics, Biochemistry, molecular biology pioneer, and Nobel laureate....
     publish "DNA sequencing by chemical degradation". Frederick Sanger
    Frederick Sanger

    Frederick Sanger, Order of Merit , Order of the Companions of Honour, Order of the British Empire, Royal Society is an England biochemistry and twice a Nobel laureate in chemistry....
    , independently, publishes "DNA sequencing by enzymatic synthesis".


  • 1980
    1980 in science

    The year 1980 in science and technology involved some significant events, listed below....
     Frederick Sanger and Walter Gilbert receive the Nobel Prize in Chemistry
    Nobel Prize in Chemistry

    The Nobel Prize in Chemistry is awarded annually by the Royal Swedish Academy of Sciences to scientists in the various fields of chemistry. It is one of the five Nobel Prizes established by the will of Alfred Nobel in 1895, awarded for outstanding contributions in chemistry, Nobel Prize in Physics, Nobel Prize in Literature, Nobel Peace Pri...


  • 1984
    1984 in science

    The year 1984 in science and technology involved some significant events....
     Medical Research Council
    Medical Research Council (UK)

    The Medical Research Council is a United Kingdom organisation dedicated to "improve human health through world-class medical research"....
     scientists decipher the complete DNA sequence of the Epstein-Barr virus
    Epstein-Barr virus

    The Epstein-Barr Virus , also called Human herpesvirus 4 , is a virus of the herpesviridae , and is one of the most common viruses in humans....
    , 170 kb.


  • 1986
    1986 in science

    The year 1986 in science and technology involved many significant events, some listed below....
     Leroy E. Hood's laboratory at the California Institute of Technology
    California Institute of Technology

    The California Institute of Technology is a private university research university located in Pasadena, California, United States. Caltech maintains a strong emphasis on the natural sciences and engineering....
     and Smith announce the first semi-automated DNA sequencing machine.


  • 1987
    1987 in science

    The year 1987 in science and technology involved many significant events, some listed below....
     Applied Biosystems markets first automated sequencing machine, the model ABI 370.


  • 1990
    1990 in science

    The year 1990 in science and technology involved some significant events....
     The U.S. National Institutes of Health
    National Institutes of Health

    The National Institutes of Health is an agency of the United States Department of Health and Human Services and is the primary agency of the United States government responsible for biomedical and health-related research....
     (NIH) begins large-scale sequencing trials on Mycoplasma capricolum, Escherichia coli
    Escherichia coli

    'Escherichia coli' , is a Gram negative bacterium that is commonly found in the lower gastrointestinal tract of warm-blooded animals. Most E....
    , Caenorhabditis elegans
    Caenorhabditis elegans

    'Caenorhabditis elegans' is a free-living, transparent nematode , about 1 mm in length, which lives in temperate soil environments. Research into the molecular biology and developmental biology of C....
    , and Saccharomyces cerevisiae
    Saccharomyces cerevisiae

    Saccharomyces cerevisiae is a species of budding yeast. It is perhaps the most useful yeast owing to its use since ancient times in baking and brewing....
     (at 75 cents (US)/base).


  • Richard Mathies et al. publish dye-based sequencing .


  • 1998
    1998 in science

    The year 1998 in science and technology involved many events, some of which are included below....
     Phil Green and Brent Ewing of Washington University publish phred
    Phred quality score

    DNA sequencing is a molecular biology technique that involves labeling the DNA with fluorescent dyes, separating the DNA by electrophoresis, and measuring the intensity of the fluorescence to determine the base order of individual nucleotides in a strand of DNA....
    for sequencer data analysis .


See also

  • Sequencing
    Sequencing

    In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule....
  • Full Genome Sequencing
    Full genome sequencing

    Full genome sequencing , also known as whole genome sequencing, complete genome sequencing, or entire genome sequencing, is a laboratory process that determines the complete DNA sequence of an organism's genome at a single time....
  • Genome project
    Genome project

    Genome projects are scientific endeavours that ultimately aim to determine the complete genome sequence of an organism . The genome sequence for any organism requires the DNA sequences for each of the chromosomes in an organism to be determined....
  • Single Molecule Real Time Sequencing
    Single Molecule Real Time Sequencing

    Single Molecule Real Time Sequencing is a parallelized single molecule DNA sequencing by synthesis technology developed by . Pacific Biosciences is a company based in Menlo Park, California, which was founded in 2004....
  • Applied Biosystems
    Applied Biosystems

    Applied Biosystems, Inc. is the original name of a pioneer biotechnology company founded in 1981 in Foster City, California, in the San Francisco Bay Area....
  • 454 Life Sciences
    454 Life Sciences

    454 Life Sciences, a Roche company, is a biotechnology company based in Branford, Connecticut specializing in high-throughput DNA sequencing using a novel massively parallel sequencing-by-synthesis approach....
  • Illumina (company)
    Illumina (company)

    Illumina, Inc. , incorporated in April 1998, develops, manufactures and markets integrated systems for the analysis of genetic variation and biological function....
  • Pacific Biosciences
  • Complete Genomics
  • Joint Genome Institute
    Joint Genome Institute

    The DOE Joint Genome Institute was created in 1997 to unite the expertise and resources in genome mapping, DNA sequencing, technology development, and information sciences pioneered at the DOE genome centers at Lawrence Berkeley National Laboratory , Lawrence Livermore National Laboratory , and Los Alamos National Laboratory ....
  • DNA field-effect transistor
  • DNA sequencing theory
    DNA sequencing theory

    DNA sequencing theory is the broad body of work that attempts to lay analytical foundations for DNA sequencing. The practical aspects revolve around designing and optimizing sequencing projects, predicting project performance, troubleshooting experimental results, characterizing factors such as sequence bias and the effects of software proces...


External links


  • - Single Molecule Real Time (SMRT) sequencing