All Topics  
Junk DNA

 

   Email Print
   Bookmark   Link






 

Junk DNA



 
 
In evolutionary biology
Evolutionary biology

Evolutionary biology is a sub-field of biology concerned with the origin of species from a common descent and descent of species, as well as their evolution, multiplication and diversity over time....
 and molecular biology
Molecular biology

Molecular biology is the study of biology at a molecule level. The field overlaps with other areas of biology and chemistry, particularly genetics and biochemistry....
, junk DNA is a provisional label for the portions of the DNA
DNA

Deoxyribonucleic acid is a nucleic acid that contains the genetics instructions used in the development and functioning of all known living organisms and some viruses....
 sequence of a chromosome
Chromosome

A chromosome is an organized structure of DNA and protein that is found in Cell . A chromosome is a single piece of DNA that contains many genes, regulatory sequence and other genetic sequence....
 or a genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
 for which no function
Function (biology)

A function is part of an answer to a question about why some object or process occurred in a system that evolved through a process of selection....
 has been identified. The term was introduced in 1972 by Susumu Ohno
Susumu Ohno

Susumu Ohno , was an Asian American geneticist and evolutionary biologist, and seminal researcher in the field of molecular evolution....
, but is as of 2008 somewhat outdated, being used mainly in popular science
Popular science

Popular science, sometimes called literature of science, is interpretation of science intended for a general audience. While science journalism focuses on recent scientific developments, popular science is broad-ranging, often written by scientists as well as journalists, and is presented in many formats, which can include books, televi...
 and in a colloquial
Colloquialism

A colloquialism is an expression not used in formal Speech communication, writing or paralinguistics. Colloquialisms are also sometimes referred to collectively as "colloquial language"....
 way in scientific publications. For some sequences once classified as junk DNA, functions have been found, and others are subject to ongoing research.






Discussion
Ask a question about 'Junk DNA'
Start a new discussion about 'Junk DNA'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In evolutionary biology
Evolutionary biology

Evolutionary biology is a sub-field of biology concerned with the origin of species from a common descent and descent of species, as well as their evolution, multiplication and diversity over time....
 and molecular biology
Molecular biology

Molecular biology is the study of biology at a molecule level. The field overlaps with other areas of biology and chemistry, particularly genetics and biochemistry....
, junk DNA is a provisional label for the portions of the DNA
DNA

Deoxyribonucleic acid is a nucleic acid that contains the genetics instructions used in the development and functioning of all known living organisms and some viruses....
 sequence of a chromosome
Chromosome

A chromosome is an organized structure of DNA and protein that is found in Cell . A chromosome is a single piece of DNA that contains many genes, regulatory sequence and other genetic sequence....
 or a genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
 for which no function
Function (biology)

A function is part of an answer to a question about why some object or process occurred in a system that evolved through a process of selection....
 has been identified. The term was introduced in 1972 by Susumu Ohno
Susumu Ohno

Susumu Ohno , was an Asian American geneticist and evolutionary biologist, and seminal researcher in the field of molecular evolution....
, but is as of 2008 somewhat outdated, being used mainly in popular science
Popular science

Popular science, sometimes called literature of science, is interpretation of science intended for a general audience. While science journalism focuses on recent scientific developments, popular science is broad-ranging, often written by scientists as well as journalists, and is presented in many formats, which can include books, televi...
 and in a colloquial
Colloquialism

A colloquialism is an expression not used in formal Speech communication, writing or paralinguistics. Colloquialisms are also sometimes referred to collectively as "colloquial language"....
 way in scientific publications. For some sequences once classified as junk DNA, functions have been found, and others are subject to ongoing research. About 95% of the human genome
Human genome

The human genome is the genome of Homo sapiens, which is stored on 23 chromosome pairs. Twenty-two of these are autosome, while the remaining pair is XY sex-determination system....
 has once been designated as "junk", including most sequences within intron
Intron

Introns, derived from the term "intragenic regions" and also called intervening sequence , are DNA regions in a gene that are not translated into proteins....
s and most intergenic DNA
Intergenic region

An Intergenic region is a stretch of DNA sequences located between Gene_cluster of genes that contain few or no genes. Occasionally some intergenic DNA acts to control genes close by, but most of it has no currently known function....
. While much of this sequence may be an evolution
Evolution

In biology, evolution is change in the heritability trait of a population of organisms from one generation to the next. These changes are caused by a combination of three main processes: variation, reproduction, and selection....
ary artifact that serves no present-day purpose, some junk DNA may function in ways that are not currently understood. Moreover, the conservation
Conservation (genetics)

Conservation may refer to:* Conservation genetics - "an interdisciplinary science that aims to apply genetic methods to the conservation and restoration of biodiversity."...
 of some junk DNA over many millions of years of evolution
Evolution

In biology, evolution is change in the heritability trait of a population of organisms from one generation to the next. These changes are caused by a combination of three main processes: variation, reproduction, and selection....
 may imply an essential function. Some consider the "junk" label as something of a misnomer
Misnomer

A misnomer is a term which suggests an interpretation that is known to be untrue. Such incorrect terms sometimes derived their names because of the form, action, or origin of the subject?becoming named popularly or widely referenced?long before their true natures were known....
, but others consider it appropriate as junk is stored away for possible new uses, rather than thrown out; others prefer the term "noncoding DNA
Noncoding DNA

In genetics, non-coding DNA describes DNA which does not contain genetic code for making proteins . In eukaryotes, a large percentage of many organisms' total genome sizes is comprised of noncoding DNA ....
" (although junk DNA often includes transposons that encode proteins with no clear value to their host genome). About 80% of the bases in the human genome may be transcribed, but transcription does not necessarily imply function.

Broadly, the science of functional genomics
Functional genomics

Functional genomics is a field of molecular biology that attempts to make use of the vast wealth of data produced by genomic projects to describe gene functions and interactions....
 has developed widely accepted techniques to characterize protein-coding gene
Gene

A gene is the basic unit of heredity in a living organism. All living things depend on genes. Genes hold the information to build and maintain their cell and pass genetic trait to offspring....
s, RNA genes, and regulatory regions. In the genomes of most plant
Plant

Plants are Life organisms belonging to the Kingdom Plantae. They include familiar organisms such as trees, herbs, bushes, grasses, vines, ferns, mosses, and green algae....
s and animal
Animal

Animals are a major group of multicellular, eukaryotic organisms of the Kingdom Animalia or Metazoa. Their body plan eventually becomes fixed as they develop, although some undergo a process of metamorphosis later on in their life....
s, however, these together constitute only a small percentage of genomic DNA (less than 2% in the case of humans). The function, if any, of the remainder remains under investigation. Most of it can be identified as repetitive elements
Repeated sequence (DNA)

In the study of DNA sequences, one can distinguish two main types of repeated sequence:*Tandem repeats:**Satellite DNA,**Minisatellite,**Microsatellite;...
 that have no known biological function for their host (although they are useful to geneticists for analyzing lineage
Genealogical DNA test

A genealogical DNA test examines the nucleotides at specific locations on a person's DNA for genetic genealogy purposes. The test results are not meant to have any informative medical value and do not determine specific genetic diseases or disorders ; they are intended only to give genealogical information....
 and phylogeny). Still, a large amount of sequence in these genomes falls under no existing classification other than "junk". For example, recent experiments removed 1% of the mouse genome and were unable to detect any effect on the phenotype
Phenotype

A phenotype is any observable characteristic or trait_ of an organism: such as its morphology , development, biochemical or physiological properties, or behavior....
. This result suggests that the DNA is nonfunctional. However, it remains a possibility that there is some function that the experiments performed on the mice were merely insufficient to detect.

While overall genome size
Genome size

Genome size refers to the total amount of DNA contained within one copy of a genome. It is typically measured in terms of mass in picograms or less frequently in Dalton or as the total number of nucleotide base pairs typically in megabases ....
, and by extension the amount of junk DNA, are correlated to organism complexity, there are many exceptions. For example, the genome of the unicellular Amoeba dubia
Amoeba dubia

Amoeba dubia is the largest species of the phylum Amoebozoa, more commonly referred to as amoebas. These are species of protozoa that move by means of temporary projections called pseudopods, and are well-known as a representative unicellular organism....
 has been reported to contain more than 200 times the amount of DNA in humans" .

The pufferfish
Pufferfish

Tetraodontidae is a family of primarily marine and estuarine fish. The family includes many familiar species which are variously called puffers, balloonfish, blowfish, bubblefish, globefish, swellfish, toadfish, and toadies....
 Takifugu
Takifugu

This article is about the genus of pufferfish Takifugu; for the Japanese dish, see fugu.Takifugu is a genus of pufferfish, often better known by the Japanese name fugu ....
 rubripes
genome is only about one tenth the size of the human genome, yet seems to have a comparable number of genes. Most of the difference appears to lie in what is now known only as junk DNA. This puzzle is known as the C-value enigma
C-value enigma

The C-value enigma or C-value paradox is a term used to describe the complex puzzle surrounding the extensive variation in nuclear genome size among eukaryotic species....
 or, more conventionally, the C-value paradox.

Types of junk DNA

  • Pseudogene
    Pseudogene

    Pseudogenes are defunct relatives of known genes that have lost their protein-coding ability or are otherwise no longer gene expression in the cell....
    s - Some chromosomal regions are composed of the now-defunct remains of ancient genes, known as pseudogenes, which were once functional copies of genes but have since lost their protein-coding ability (and, presumably, their biological function). After non-functionalization, pseudogenes are free to acquire genetic noise in the form of random mutations.
  • Retrotransposon
    Retrotransposon

    Retrotransposons are Genetics elements that can amplify themselves in a genome and are ubiquitous components of the DNA of many Eukaryote organisms....
    s - 8% of the human genome has been shown to be formed by retrotransposons of Human Endogenous Retroviruses
    Endogenous retrovirus

    Endogenous retroviruses are retroviruses derived from ancient infections of germ cells in humans, mammals and other vertebrates; as such their proviruses are inheritance to the next generation and now remain in the genome....
     (HERVs), although as much as 25% is recognisably formed of retrotransposons. This is a lower limit on how much of the genome is retrotransposons because older remains might not be recognizable having accumulated too many mutations. New research suggests that genome size variation in at least two kinds of plants is mostly because of retrotransposons.

Hypotheses of origin and function


There are some hypotheses, none conclusively established, for how junk DNA arose and why it persists in the genome:

  • Junk DNA might provide a reservoir of sequences from which potentially advantageous new genes can emerge. In this way, it may be an important genetic basis for evolution.
  • Some junk DNA could be spacer material that allows enzyme
    Enzyme

    Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
     complexes to form around functional elements more easily. In this way, the junk DNA could serve an important function even though the actual sequence of information it contains is irrelevant.
  • Some portions of junk DNA could serve presently unknown regulatory functions, controlling the expression of certain genes during the development of an organism from embryo
    Embryo

    An embryo is a multicellular organism ploidy eukaryote in its earliest stage of development, from the time of first cell division until birth, Egg , or germination....
     to adult, and/or development of certain organs/organelles.
  • Regulatory layers in some "junk DNA", such as through non-coding RNA
    Non-coding RNA

    A non-coding RNA is a functional RNA molecule that is not Translation into a protein. Less-frequently used synonyms are non-protein-coding RNA , non-messenger RNA , small non-messenger RNA , functional RNA ....
    s, may contain important genetic programming.
  • According to a comparative study of over 300 prokaryotic and over 30 eukaryotic genomes , eukaryotes appear to require a minimum amount of non-coding DNA. This minimum amount can be predicted using a growth model for regulatory genetic networks, implying that it is required for regulatory purposes. In humans the predicted minimum is about 5% of the total genome.


Evolutionary conservation of junk DNA


Comparative genomics
Comparative genomics

Comparative genomics is the study of the relationship of genome structure and function across different biological species or Strain . Comparative genomics is an attempt to take advantage of the information provided by the signatures of selection to understand the function and evolutionary processes that act on genomes....
 is a promising direction in studying the function of junk DNA. Biologically functional sequences tend to undergo mutation at a slower rate than nonfunctional sequence, since mutations in these sequences are likely to be selected against
Natural selection

Natural selection is the process by which favorable heritable trait become more common in successive generations of a population of Reproduction organisms, and unfavorable heritable traits become less common, due to differential reproduction of genotypes....
. For example, the coding sequence of a human protein-coding gene is typically about 80% identical to its mouse ortholog, while their genomes as a whole are much more widely diverged. Analyzing the patterns of conservation between the genomes of different species can suggest which sequences are functional, or at least which functional sequences are shared by those species. Functional elements stand out in such analyses as having diverged less than the surrounding sequence.

Comparative studies of several mammalian genomes suggest that approximately 5% of the human genome has evolved under purifying selection
Stabilizing selection

Stabilizing selection, also referred to as purifying selection or ambidirectional selection, is a type of natural selection in which genetic diversity decreases as the population stabilizes on a particular Trait value....
 since their divergence. Since known functional sequence comprises less than 2% of the human genome, there may be more junk DNA in the human genome than there is functional sequence.

A surprising recent finding was the discovery of nearly 500 ultraconserved elements, which are shared at extraordinarily high fidelity among the available vertebrate genomes, in what had previously been designated as junk DNA. The function of these sequences is currently under intense scrutiny, and there are preliminary indications that some may play a regulatory role in vertebrate development from embryo to adult.

Present results concerning evolutionarily conserved human junk DNA are expressed in preliminary, probabilistic terms, since only a handful of related genomes are available. As more vertebrate, and especially mammalian, genomes are sequenced, scientists will develop a clearer picture of this important class of sequence. However, it is always possible, though highly unlikely, that there are significant quantities of functional human DNA that are not shared among these species, and which would thus not be revealed by these studies. Conversely, there are some questions about the hypothesis that conserved sequences all must function .

Replication of junk DNA each time a cell divides may waste energy. Organisms with less junk DNA may therefore have a selective advantage, and natural selection
Natural selection

Natural selection is the process by which favorable heritable trait become more common in successive generations of a population of Reproduction organisms, and unfavorable heritable traits become less common, due to differential reproduction of genotypes....
 would tend to eliminate it. There are several possible explanations for why it has not been eliminated: (1) The energy required to replicate even large amounts of junk DNA may be relatively insignificant on the cellular or organismal scale, so no selective pressure results (selection coefficient
Selection coefficient

In population genetics, the selection coefficient is a measure of the relative Fitness of a phenotype. Usually denoted by the letter s, it compares the fitness of a phenotype to another favoured phenotype, and is the proportional amount that the considered phenotype is less fit as measured by fertile progeny....
s less than one over the population size are effectively neutral); (2) Junk DNA may provide a reservoir of potentially useful sequences or a protective buffer against harmful genetic damage or mutations; and (3) Junk DNA may accumulate faster than natural selection can eliminate it. In animals, the energy required for DNA synthesis is trivial compared to the metabolic energy invested in the movement of muscles.

Functions for Some Subsets of Junk DNA


Over the years evidence is accumulating that more and more of the so-called junk DNA might have a function, even if we do not know yet what that function is.

Different studies remark on the importance of junk DNA for social behavior in rodents (and, possibly humans) , regulation of gene expression and promotion of genetic diversity , evolution of sequences (for example, an antifreeze-protein gene in a species of fish ), as a source of microRNAs , and hosting DNA segments called LINE-1 capable of repairing broken strands of DNA.

However it is noted that the fact that some non-coding DNA has a purpose does not establish that all non-coding DNA has a purpose. In addition, sections of DNA can be randomized, cut, or added to with no apparent effect on the organism in question.

See also

  • Atavism
    Atavism

    The term atavism denotes the tendency to revert to ancestral type. An atavism is an evolutionary throwback, such as traits reappearing which had disappeared generations ago....
  • Alu repeat
  • Eukaryotic chromosome fine structure
    Eukaryotic chromosome fine structure

    Eukaryotic chromosome fine structure refers to the structure of sequences for eukaryotic chromosomes. Some fine sequences are included in more than one class, so the classification listed is not intended to be completely separate....
  • Function (biology)
    Function (biology)

    A function is part of an answer to a question about why some object or process occurred in a system that evolved through a process of selection....
  • Genealogical DNA test
    Genealogical DNA test

    A genealogical DNA test examines the nucleotides at specific locations on a person's DNA for genetic genealogy purposes. The test results are not meant to have any informative medical value and do not determine specific genetic diseases or disorders ; they are intended only to give genealogical information....
  • Intron
    Intron

    Introns, derived from the term "intragenic regions" and also called intervening sequence , are DNA regions in a gene that are not translated into proteins....
  • Repeated sequence (DNA)
    Repeated sequence (DNA)

    In the study of DNA sequences, one can distinguish two main types of repeated sequence:*Tandem repeats:**Satellite DNA,**Minisatellite,**Microsatellite;...
  • Retrotransposon
    Retrotransposon

    Retrotransposons are Genetics elements that can amplify themselves in a genome and are ubiquitous components of the DNA of many Eukaryote organisms....
  • Satellite DNA
    Satellite DNA

    Satellite DNA consists of highly repetitive DNA, and is so called because repetitions of a short DNA sequence tend to produce a different frequency of the nucleotides adenine, cytosine, guanine and thymine, and thus have a different density from bulk DNA - such that they form a second or 'satellite' band when genomic DNA is separated on a Den...
  • Selfish DNA
    Selfish DNA

    Selfish DNA refers to those sequences of DNA which, in their purest form, have two distinct properties: the DNA sequence spreads by forming additional copies of itself within the genome; and it makes no specific contribution to the reproductive success of its host organism....


Further reading

  • Gibbs W.W. (2003) "The unseen genome: gems among the junk", Scientific American
    Scientific American

    Scientific American is a popular science science magazine, published since August 28, 1845, making it one of the oldest continuously published magazines in the United States....
    , 289(5): 46-53. (A review, written for non-specialists, of recent discoveries of function within junk DNA.)