All Topics  
Molecular phylogeny

 

   Email Print
   Bookmark   Link






 

Molecular phylogeny



 
 
Molecular phylogenetics, also known as molecular systematics, is the use of the structure of molecule
Molecule

In chemistry, a molecule is defined as a sufficiently stable, electric charge neutral group of at least two atoms in a definite arrangement held together by very strong chemical bonds....
s to gain information on an organism's evolutionary relationships. The result of a molecular phylogenetic
Phylogenetics

In biology, phylogenetics is the study of evolutionary relatedness among various groups of organisms , which is discovered through molecular sequencing data and morphological data matrices....
 analysis is expressed in a phylogenetic tree
Phylogenetic tree

A phylogenetic tree or evolutionary tree is a tree showing the evolutionary relationships among various biological species or other entities that are believed to have a common descent....
.

y living organism
Organism

In biology, an organism is any life thing . In at least some form, all organisms are capable of response to stimulus , reproduction, growth and developmental biology, and maintenance of homeostasis as a stable whole....
 contains DNA
DNA

Deoxyribonucleic acid is a nucleic acid that contains the genetics instructions used in the development and functioning of all known living organisms and some viruses....
, RNA
RNA

Ribonucleic acid is a type of molecule that consists of a long chain of nucleotide units. Each nucleotide consists of a nucleobase, a ribose sugar, and a phosphate....
, and protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s. Closely related organisms generally have a high degree of agreement in the molecular structure of these substances, while the molecules of organisms distantly related usually show a pattern of dissimilarity.






Discussion
Ask a question about 'Molecular phylogeny'
Start a new discussion about 'Molecular phylogeny'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Molecular phylogenetics, also known as molecular systematics, is the use of the structure of molecule
Molecule

In chemistry, a molecule is defined as a sufficiently stable, electric charge neutral group of at least two atoms in a definite arrangement held together by very strong chemical bonds....
s to gain information on an organism's evolutionary relationships. The result of a molecular phylogenetic
Phylogenetics

In biology, phylogenetics is the study of evolutionary relatedness among various groups of organisms , which is discovered through molecular sequencing data and morphological data matrices....
 analysis is expressed in a phylogenetic tree
Phylogenetic tree

A phylogenetic tree or evolutionary tree is a tree showing the evolutionary relationships among various biological species or other entities that are believed to have a common descent....
.

Techniques and applications

Every living organism
Organism

In biology, an organism is any life thing . In at least some form, all organisms are capable of response to stimulus , reproduction, growth and developmental biology, and maintenance of homeostasis as a stable whole....
 contains DNA
DNA

Deoxyribonucleic acid is a nucleic acid that contains the genetics instructions used in the development and functioning of all known living organisms and some viruses....
, RNA
RNA

Ribonucleic acid is a type of molecule that consists of a long chain of nucleotide units. Each nucleotide consists of a nucleobase, a ribose sugar, and a phosphate....
, and protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s. Closely related organisms generally have a high degree of agreement in the molecular structure of these substances, while the molecules of organisms distantly related usually show a pattern of dissimilarity. Molecular phylogeny uses such data to build a "relationship tree" that shows the probable evolution
Evolution

In biology, evolution is change in the heritability trait of a population of organisms from one generation to the next. These changes are caused by a combination of three main processes: variation, reproduction, and selection....
 of various organisms. Not until recent decades, however, has it been possible to isolate and identify these molecular structures.

The most common approach is the comparison of sequences for genes using sequence alignment
Sequence alignment

In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural biology, or evolutionary relationships between the sequences....
 techniques to identify similarity. Another application of molecular phylogeny is in DNA barcoding
DNA barcoding

DNA barcoding is a Taxonomy method that uses a short genetic marker in an organism's mtDNA to identify it as belonging to a particular species. It is based on a relatively simple concept: most eukaryote cells contain mitochondria and mitochondrial DNA has a relatively fast mutation rate, which results in significant variance in mtDNA sequenc...
, where the species of an individual organism is identified using small sections of mitochondrial DNA
Mitochondrial DNA

Mitochondrial DNA is the DNA located in organelles called mitochondrion. Most other DNA present in eukaryotic organisms is found in the cell nucleus....
. Another application of the techniques that make this possible can be seen in the very limited field of human genetics, such as the ever more popular use of genetic testing
Genetic testing

Genetic testing allows the Genetics diagnosis of vulnerabilities to inherit diseases, and can also be used to determine a person's ancestry. Normally, every person carries two copies of every gene, one inherited from their mother, one inherited from their father....
 to determine a child's paternity, as well as the emergence of a new branch of criminal forensics
Forensics

Forensic science is the application of a broad spectrum of sciences to answer questions of interest to the legal system. This may be in relation to a crime or to a civil action....
 focused on evidence known as genetic fingerprinting
Genetic fingerprinting

DNA profiling is a technique employed by forensic scientists to assist in the identification of individuals on the basis of their respective DNA profiles....
.

The effect on traditional biological classification schemes in the biological sciences has been dramatic as well. Work that was once immensely labor- and materials-intensive can now be done quickly and easily, leading to yet another source of information becoming available for systematic and taxonomic appraisal. This particular kind of data has become so popular that taxonomical schemes based solely on molecular data may be encountered.

Theoretical background

Early attempts at molecular systematics were also termed as chemotaxonomy
Chemotaxonomy

Chemotaxonomy , also called chemosystematics, is the attempt to classify and identify organisms , according to demonstrable differences and similarities in their biochemistry compositions....
 and made use of proteins, enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
s, carbohydrate
Carbohydrate

Carbohydrates or saccharides are the most abundant of the four major classes of biomolecules. They fill numerous roles in living things, such as the storage and transport of energy and structural components ....
s and other molecules which were separated and characterized using techniques such as chromatography
Chromatography

Chromatography is the collective term for a family of laboratory techniques for the separation of mixtures. It involves passing a mixture dissolved in a "mobile phase" through a stationary phase, which separates the analyte to be measured from other molecules in the mixture and allows it to be isolated....
. These have been largely replaced in recent times by DNA sequencing
DNA sequencing

The term DNA sequencing refers to methods for determining the order of the nucleotide bases, adenine, guanine, cytosine, and thymine, in a molecule of DNA....
 which produces the exact sequences of nucleotides or bases in either DNA or RNA segments extracted using different techniques. These are generally considered superior for evolutionary studies since the actions of evolution are ultimately reflected in the genetic sequences. At present it is still a long and expensive process to sequence the entire DNA of an organism (its genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
), and this has been done for only a few species. However it is quite feasible to determine the sequence of a defined area of a particular chromosome
Chromosome

A chromosome is an organized structure of DNA and protein that is found in Cell . A chromosome is a single piece of DNA that contains many genes, regulatory sequence and other genetic sequence....
. Typical molecular systematic analyses require the sequencing of around 1000 base pair
Base pair

In molecular biology, two nucleotides on opposite complementarity DNA or RNA strands that are connected via hydrogen bonds are called a base pair ....
s. At any location within such a sequence, the bases found in a given position may vary between organisms. The particular sequence found in a given organism is referred to as its haplotype
Haplotype

The term haplotype is a contraction of the term "Ploidy genotype." In genetics, a haplotype is a combination of alleles at multiple locus that are transmitted together on the same chromosome....
. In principle, since there are four base types, with 1000 base pairs, we could have 41000 distinct haplotypes. However, for organisms within a particular species or in a group of related species, it has been found empirically that only a minority of sites show any variation at all and most of the variations that are found are correlated, so that the number of distinct haplotypes that are found is relatively small.

In a molecular systematic analysis, the haplotypes are determined for a defined area of genetic material; ideally a substantial sample of individuals of the target species
Species

In biology, a species is one of the basic units of biological classification and a taxonomic rank. A species is often defined as a group of organisms capable of interbreeding and producing fertile offspring....
 or other taxon
Taxon

A taxon or taxonomic unit is a name designating an organism or a group of organisms. In biological nomenclature according to Carl Linnaeus, a taxon is assigned a taxonomic rank and can be placed at a particular level in a systematic hierarchy reflecting evolutionary relationships....
 are used however many current studies are based on single individuals. Haplotypes of individuals of closely related, but supposedly different, taxa are also determined. Finally, haplotypes from a smaller number of individuals from a definitely different taxon are determined: these are referred to as an out group. The base sequences for the haplotypes are then compared. In the simplest case, the difference between two haplotypes is assessed by counting the number of locations where they have different bases: this is referred to as the number of substitutions (other kinds of differences between haplotypes can also occur, for example the insertion of a section of nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
 in one haplotype that is not present in another). Usually the difference between organisms is re-expressed as a percentage divergence, by dividing the number of substitutions by the number of base pairs analysed: the hope is that this measure will be independent of the location and length of the section of DNA that is sequenced.

An older and superseded approach was to determine the divergences between the genotype
Genotype

The genotype is the trait we can't see. The genotype is the Genetics constitution of a cell, an organism, or an individual usually with reference to a specific character under consideration....
s of individuals by DNA-DNA hybridisation
DNA-DNA hybridisation

DNA-DNA hybridization generally refers to a molecular biology technique that measures the degree of genetic similarity between pools of DNA sequences....
. The advantage claimed for using hybridisation rather than gene sequencing was that it was based on the entire genotype, rather than on particular sections of DNA. Modern sequence comparison techniques overcome this objection by the use of multiple sequences.

Once the divergences between all pairs of samples have been determined, the resulting triangular matrix
Triangular matrix

In the mathematics discipline of linear algebra, a triangular matrix is a special kind of square matrix where the entries either below or above the main diagonal are zero....
 of differences is submitted to some form of statistical cluster analysis, and the resulting dendrogram
Dendrogram

A dendrogram is a Tree diagram frequently used to illustrate the arrangement of the clusters produced by a Cluster analysis. Dendrograms are often used in computational biology to illustrate the clustering of genes....
 is examined in order to see whether the samples cluster in the way that would be expected from current ideas about the taxonomy of the group, or not. Any group of haplotypes that are all more similar to one another than any of them is to any other haplotype may be said to constitute a clade
Clade

A clade is a term used in modern alpha taxonomy, the scientific classification of living and fossil organisms, to describe a monophyletic group, defined as a group consisting of a single common ancestor and all its descendants.The term "monophyletic group" is used in this article in the conventional sense of "an a...
. Statistical techniques such as bootstrapping
Bootstrapping (statistics)

In statistics, bootstrapping is a modern, computer-intensive, general purpose approach to statistical inference, falling within a broader class of Resampling methods....
 and jackknifing
Resampling (statistics)

In statistics, resampling is any of a variety of methods for doing one of the following:# Estimating the precision of sample statistics by using subsets of available data or drawing randomly with replacement from a set of data points ...
 help in providing reliability estimates for the positions of haplotypes within the evolutionary trees.

Characteristics and assumptions of molecular systematics

This example illustrates several characteristics of molecular systematics and its underlying assumptions.
  1. Molecular systematics is an essentially cladistic
    Cladistics

    Cladistics is the hierarchical classification of species based on evolutionary ancestry. Cladistics is distinguished from other taxonomic systems because it focuses on evolution rather than similarities between species, and because it places heavy emphasis on objective, quantitative analysis....
     approach: it assumes that classification must correspond to phylogenetic descent, and that all valid taxa must be monophyletic.
  2. Molecular systematics often uses the molecular clock
    Molecular clock

    The molecular clock is a technique in molecular evolution to relate the time that two species speciation to the number of molecular differences measured between the species' DNA sequences or proteins....
     assumption that quantitative similarity of genotype is a sufficient measure of the recency of genetic divergence. Particularly in relation to speciation
    Speciation

    Speciation is the evolutionary process by which new biological species arise. The biologist Orator F. Cook seems to have been the first to coin the term 'speciation' for the splitting of lineages or 'cladogenesis,' as opposed to 'anagenesis' or 'phyletic evolution' occurring within lineages....
    , this assumption could be wrong if either
    1. some relatively small genotypic modification acted to prevent interbreeding between two groups of organisms, or
    2. in different subgroups of the organisms being considered, genetic modification proceeded at different rates.
  3. In animals, it is often convenient to use mitochondrial DNA for molecular systematic analysis. However, because in mammal
    Mammal

    Mammals are a class of vertebrate animals whose name is derived from their distinctive feature, mammary glands, with which they feed their young....
    s mitochondria
    Mitochondrion

    In cell biology, a mitochondrion is a membrane-enclosed organelle found in most eukaryote cell . These organelles range from 0.5–10 micrometers in diameter....
     are inherited only from the mother, this is not fully satisfactory, because inheritance in the paternal line might not be detected: in the example above, Vilà et al. cite more limited studies with chromosomal DNA that support their conclusions.


These characteristics and assumptions are not wholly uncontroversial among biological systematists. As a cladistic method, molecular systematics is open to the same criticisms as cladistics in general. It can also be argued that it is a mistake to replace a classification based on visible and ecologically relevant characteristics by one based on genetic details that may not even be expressed in the phenotype. However the molecular approach to systematics, and its underlying assumptions, are gaining increasing acceptance. As gene sequencing becomes easier and cheaper, molecular systematics is being applied to more and more groups, and in some cases is leading to radical revisions of accepted taxonomies.

History of molecular phylogenetics


Molecular systematics was pioneered by Charles G. Sibley
Charles Sibley

Charles Gald Sibley was an United States ornithologist and molecular biologist. He had an immense influence on the scientific classification of birds, and the work that Sibley initiated has substantially altered our understanding of the evolutionary history of modern birds....
 (bird
Bird

Birds are wing, Bipedalismal, endothermic , vertebrate animals that lay egg . There are around 10,000 living species, making them the most numerous tetrapod vertebrates....
s), Herbert C. Dessauer (herpetology
Herpetology

Herpetology is the branch of zoology concerned with the study of amphibians and of reptiles .Herpetology is concerned with poikilothermic, or ectothermic, tetrapods....
), and Morris Goodman
Morris Goodman

Morris Goodman is an American scientist known for his work in molecular evolution and molecular systematics. Goodman is a Distinguished Professor at the Center for Molecular Medicine and Genetics at Wayne State University School of Medicine....
 (primate
Primate

A primate is a member of the biological order Primates , the group that contains lemurs, the Aye-aye, Lorisidaes, galagos, tarsiers, monkeys, and apes, with the last category including humans....
s), followed by Allan C. Wilson
Allan Wilson

Allan Charles Wilson was a pioneer in the use of Molecular evolution approaches to understand evolutionary change and reconstruct phylogeny. One of the great innovators of science, he revolutionised the study of human evolution....
, Robert K. Selander, and John C. Avise (who studied various groups). Work with protein electrophoresis
Protein electrophoresis

In medicine, protein electrophoresis is a method of analysing a mixture of proteins by means of gel electrophoresis, mainly in blood blood plasma ....
 began around 1956. Although the results were not quantitative and did not initially improve on morphological classification, they provided tantalizing hints that long-held notions of the classifications of bird
Bird

Birds are wing, Bipedalismal, endothermic , vertebrate animals that lay egg . There are around 10,000 living species, making them the most numerous tetrapod vertebrates....
s, for example, needed substantial revision. In the period of 1974–1986, DNA-DNA hybridization was the dominant technique.

Further reading

  • Felsenstein, J.
    Joe Felsenstein

    Joseph "Joe" Felsenstein is Professor in the Departments of Genome Sciences and Biology and Adjunct Professor in the Departments of Computer Science and Statistics at the University of Washington in Seattle....
     2004. Inferring phylogenies. Sinauer Associates Incorporated. ISBN 0-87893-177-5.
  • Hillis, D. M. & Moritz, C. 1996. Molecular systematics. 2nd ed. Sinauer Associates Incorporated. ISBN 0-87893-282-8.
  • Page, R. D. M. & Holmes, E. C. 1998. Molecular evolution: a phylogenetic approach. Blackwell Science, Oxford. ISBN 0-86542-889-1.


See also

  • molecular evolution
    Molecular evolution

    Molecular evolution is the process of evolution at the scale of DNA, RNA, and proteins. Molecular evolution emerged as a scientific field in the 1960s as researchers from molecular biology, evolutionary biology and population genetics sought to understand recent discoveries on the structure and function of nucleic acids and protein....
  • computational phylogenetics
    Computational phylogenetics

    Computational phylogenetics is the application of computational algorithms, methods and programs to Phylogenetics analyses. The goal is to assemble a phylogenetic tree representing a hypothesis about the evolutionary ancestry of a set of genes, species, or other taxa....
  • PhyloCode
    PhyloCode

    The International Code of Phylogenetic Nomenclature, known for short as the PhyloCode, is a developing draft for a formal set of rules governing phylogenetic nomenclature....


External links