Genome engineering
Encyclopedia
Genome engineering refers to the strategies and techniques developed in recent years for the targeted, specific modification of the genetic information – or genome
Genome
In modern molecular biology and genetics, the genome is the entirety of an organism's hereditary information. It is encoded either in DNA or, for many types of virus, in RNA. The genome includes both the genes and the non-coding sequences of the DNA/RNA....

 – of living organisms.

It represents a very active field of research because of the wide range of possible applications, particularly in the areas of human health - the correction of a gene
Gene
A gene is a molecular unit of heredity of a living organism. It is a name given to some stretches of DNA and RNA that code for a type of protein or for an RNA chain that has a function in the organism. Living beings depend on genes, as they specify all proteins and functional RNA chains...

 carrying a harmful mutation, the production of therapeutic proteins, the elimination of persistent viral sequences - agricultural biotechnology - the development of new generations of genetically modified plant
Genetically modified plant
Genetically modified plants are plants whose DNA is modified using genetic engineering techniques. In most cases the aim is to introduce a new trait to the plant which does not occur naturally in this species...

s - and for the development of research tools - for example, to explore the function of a gene.

Various modifications can be performed:
  • Insertion involves introducing a gene into a chromosome
    Chromosome
    A chromosome is an organized structure of DNA and protein found in cells. It is a single piece of coiled DNA containing many genes, regulatory elements and other nucleotide sequences. Chromosomes also contain DNA-bound proteins, which serve to package the DNA and control its functions.Chromosomes...

     to obtain a new function (for example to obtain a better drought-resistant plant) or to compensate for a defective gene, particularly by making it possible to manufacture a functional protein if the protein produced by the patient is defective (such as factor VIII in hemophilia A).
  • Inactivation, or “knock-out”, is today mainly used in fundamental research to shed light on the function of a gene by observing the anomalies that occur as a result of its inactivation. It can also have other applications, for example to remove a persistent viral sequence from infected cells, or in agriculture to eliminate the irritant or allergenic properties of a plant.
  • Correction aims to remove and replace a defective gene sequence with a functional sequence. This correction can be performed on a very short sequence, sometimes just a few nucleotides, such as in the case of drepanocytosis (sickle cell anemia). In plants, this manipulation can also help improve the properties of a species without the addition of foreign DNA
    DNA
    Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions used in the development and functioning of all known living organisms . The DNA segments that carry this genetic information are called genes, but other DNA sequences have structural purposes, or are involved in...

    , as in the work carried out by the company Cibus.


How does it differ from previous genome modification technologies?

Early technologies developed to insert a gene into a living cell, such as transgenesis
Transgenesis
thumb|300px|right|A diagram comparing the genetic changes achieved through conventional plant breeding, transgenesis and cisgenesisTransgenesis is the process of introducing an exogenous gene – called a transgene – into a living organism so that the organism will exhibit a new property and transmit...

, are limited by the random nature of the insertion of the new sequence into the genome. The new gene is positioned blindly, and may inactivate or disturb the functioning of other genes or even cause severe unwanted effects; it may trigger a process of cancerization, for example. Furthermore, these technologies offer no degree of reproducibility, as there is no guarantee that the new sequence will be inserted at the same place in two different cells.

The major advantage of genome engineering, which uses more recent knowledge and technology, is that it enables a specific area of the DNA to be modified, thereby increasing the precision of the correction or insertion, preventing any cell toxicity and offering perfect reproducibility.

Genome engineering and synthetic genomics
Synthetic biology
Synthetic biology is a new area of biological research that combines science and engineering. It encompasses a variety of different approaches, methodologies, and disciplines with a variety of definitions...

 (designing artificial genomes) are currently among the most promising technologies in terms of applied biological research and industrial innovation.

General principles

Early approaches to genome engineering involved modifying genetic sequences using only homologous recombination
Homologous recombination
Homologous recombination is a type of genetic recombination in which nucleotide sequences are exchanged between two similar or identical molecules of DNA. It is most widely used by cells to accurately repair harmful breaks that occur on both strands of DNA, known as double-strand breaks...

. Using a homologous sequence located on another strand as a model can lead this natural DNA maintenance mechanism to repair a DNA strand.
It is possible to induce homologous recombinations between a cellular DNA strand and an exogenous DNA strand inserted in the cell by researchers, using a vector such as the modified genome of a retrovirus. The recombination phenomenon is flexible enough for a certain level of change (addition, suppression or modification of a DNA portion) to be introduced to the targeted homologous area.

In the 1980s, Mario R. Capecchi and Oliver Smithies
Oliver Smithies
Oliver Smithies is a British-born American geneticist and Nobel laureate, credited with the invention of gel electrophoresis in 1955, and the simultaneous discovery, with Mario Capecchi and Martin Evans, of the technique of homologous recombination of transgenic DNA with genomic DNA, a much more...

 worked on the homologous recombination of DNA as a “gene targeting
Gene targeting
Gene targeting is a genetic technique that uses homologous recombination to change an endogenous gene. The method can be used to delete a gene, remove exons, add a gene, and introduce point mutations. Gene targeting can be permanent or conditional...

” tool; in other words, as an instrument for the inactivation or modification of specific genes. Working with Martin J. Evans, they developed a process for the modification of the mouse genome by modifying the DNA of mouse embryonic stem cells in culture and injecting these modified stem cells into mouse embryos. Genetically modified mice generated using this method make useful laboratory models to study human diseases. This tool is now commonly used in medical research. The three researchers were awarded the 2007 Nobel Prize
Nobel Prize
The Nobel Prizes are annual international awards bestowed by Scandinavian committees in recognition of cultural and scientific advances. The will of the Swedish chemist Alfred Nobel, the inventor of dynamite, established the prizes in 1895...

 in Medicine for their work.

Modifying genomes using only homologous recombination remained a long and random process, until additional developments were made that could increase the rate of homologous recombination in somatic cell types. These developments fall under to two mechanistically distinct methods of triggering the cells inherent DNA repair mechanisms which are required to insert a foreign gene sequence into a live cell. The first method is by site directed endonucleases (restriction enzymes), which include specific technologies such as zinc finger nucleases (ZFNs) and meganucleases
Meganucleases
Meganucleases are endodeoxyribonucleases characterized by a large recognition site ; as a result this site generally occurs only once in any given genome. For example, the 18-base pair sequence recognized by the I-SceI meganuclease would on average require a genome twenty times the size of the...

. Site directed endonucleases achieve gene modification through causing double stranded DNA (dsDNA) breaks which triggers the cells natural DNA repair mechanism, predominantly non homologous end joining (NHEJ) as well as a low frequency of homologous recombination (HR). The second method is recombinant adeno-associated virus (rAAV) mediated genome engineering which induces high frequencies of homologous recombination alone, thus forgoing the need to perform dsDNA breaks.

Transfection by causing dsDNA breaks

Researchers wishing to efficiently eliminate a gene to study the resulting loss of its function are increasingly opting for a “molecular scissors” approach. These are enzymes with specific properties which enable them to cut the long double DNA strand along the sequence to be modified, thereby triggering the NHEJ and HR process at the required location.
Restriction enzymes commonly used in molecular biology to cut DNA interact with sequences of 1 to 10 nucleotides. These sequences, which are very short and generally palindromic, often occur at several sites in the genome (the human genome comprises 6.4 billion bases). Restriction enzymes are therefore likely to cut the DNA molecule several times. In their efforts to find a genome surgery approach offering a higher degree of accuracy and security, scientists therefore turned to more precise tools.
More targeted genome engineering can be performed by using enzymes which are able to recognize and interact with DNA sequences that are sufficiently long so as to occur only once, with high probability, in any given genome. The DNA modification therefore takes place precisely at the site of the target sequence. With recognition sites of over 12 base pairs, meganucleases
Meganucleases
Meganucleases are endodeoxyribonucleases characterized by a large recognition site ; as a result this site generally occurs only once in any given genome. For example, the 18-base pair sequence recognized by the I-SceI meganuclease would on average require a genome twenty times the size of the...

 and zinc finger nucleases offer this degree of precision.
Once the DNA has been cut, natural DNA repair mechanisms and homologous recombination enable the incorporation of a modified sequence or a new gene.
The success of these different stages (recognition, cleavage and recombination) depends on various factors, including the efficacy of the vector
Vector (molecular biology)
In molecular biology, a vector is a DNA molecule used as a vehicle to transfer foreign genetic material into another cell. The four major types of vectors are plasmids, viruses, cosmids, and artificial chromosomes...

 that introduces the enzyme into the cell, the enzyme cleavage activity, the cell’s capacity for homologous recombination and probably the state of the chromatin
Chromatin
Chromatin is the combination of DNA and proteins that make up the contents of the nucleus of a cell. The primary functions of chromatin are; to package DNA into a smaller volume to fit in the cell, to strengthen the DNA to allow mitosis and meiosis and prevent DNA damage, and to control gene...

 at the given locus
Locus (genetics)
In the fields of genetics and genetic computation, a locus is the specific location of a gene or DNA sequence on a chromosome. A variant of the DNA sequence at a given locus is called an allele. The ordered list of loci known for a particular genome is called a genetic map...

.

Meganuclease-based Engineering

Meganucleases
Meganucleases
Meganucleases are endodeoxyribonucleases characterized by a large recognition site ; as a result this site generally occurs only once in any given genome. For example, the 18-base pair sequence recognized by the I-SceI meganuclease would on average require a genome twenty times the size of the...

, discovered in the late 1980s, are enzymes in the endonuclease
Endonuclease
Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain, in contrast to exonucleases, which cleave phosphodiester bonds at the end of a polynucleotide chain. Typically, a restriction site will be a palindromic sequence four to six nucleotides long. Most...

 family which are characterized by their capacity to recognize and cut large DNA sequences (from 12 to 40 base pairs). The most widespread and best known meganucleases are the proteins in the LAGLIDADG family, which owe their name to a conserved amino acid sequence.

These enzymes were identified in the 1990s as promising tools for genome engineering. However, even though they occur in nature and each one exhibits slight variations in its DNA recognition site, there is virtually no chance of finding the exact meganuclease required to act on a specific DNA sequence. Each new genome engineering target therefore requires an initial protein engineering stage to produce a custom meganuclease.

There are two possible methods for creating custom meganucleases:
  • Mutagenesis
    Mutagenesis
    Mutagenesis is a process by which the genetic information of an organism is changed in a stable manner, resulting in a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using laboratory procedures...

     involves generating collections of variants using a meganuclease with properties similar to the desired enzyme, then selecting these variants using high-throughput screening. This procedure can be optimized by adopting what are known as “semi-rational” methods, in which the structural data is electronically processed in order to focus the mutagenesis to the part of the enzyme that interacts with DNA and triggers the cleavage.
  • Combinatorial assembly is a method whereby protein subunits from different enzymes can be associated or fused.


These two approaches can be combined. Scientists from the French biotechnology company Cellectis have identified in the structure of several meganucleases the areas responsible for DNA cleavage and the areas that interact with specific DNA sites. By acting on these recognition sites, they have been able to generate variants that interact with different DNA sequences from those of the initial meganucleases, while retaining their ability to cut the DNA and their high degree of specificity.

A large bank containing several tens of thousands of protein units has been created. These units can be combined to obtain chimeric meganucleases that recognize the target site, thereby providing research and development tools that meet a wide range of needs (fundamental research, health, agriculture, industry, energy, etc.).

This technique has enabled the development of several meganucleases specific for sequences in the genomes of viruses, plants, etc., and the industrial-scale production of two meganucleases able to cleave the human XPC gene; mutations in this gene result in Xeroderma pigmentosum, a severe monogenic disorder that predisposes the patients to skin cancer and burns whenever their skin is exposed to UV rays.

Another approach involves using computer models to try to predict as accurately as possible the activity of the modified meganucleases and the specificity of the recognized nucleic sequence. The Northwest Genome Engineering Consortium, a US consortium funded by the National Institutes of Health
National Institutes of Health
The National Institutes of Health are an agency of the United States Department of Health and Human Services and are the primary agency of the United States government responsible for biomedical and health-related research. Its science and engineering counterpart is the National Science Foundation...

, has adopted this approach with the aim of treating leukemia by modifying hematopoietic stem cells. The model’s prediction has been verified and guided by means of directed mutagenesis and in vitro biochemical analysis.

A third approach has been taken by the American biotechnology company Precision Biosciences, Inc. The company, funded by the National Institutes of Health and the National Institute of Standards and Technology, has developed a fully rational design process called the Directed Nuclease Editor (DNE) which is capable of creating highly specific engineered meganucleases that successfully target and modify a user-defined location in a genome.

Zinc finger nuclease-based Engineering

Zinc finger
Zinc finger
Zinc fingers are small protein structural motifs that can coordinate one or more zinc ions to help stabilize their folds. They can be classified into several different structural families and typically function as interaction modules that bind DNA, RNA, proteins, or small molecules...

 motifs
Structural motif
In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a supersecondary structure, which appears also in a variety of other molecules...

 occur in several transcription factors. The zinc ion, found in 8% of all human proteins, plays an important role in the organization of their three-dimensional structure. In transcription factors, it is most often located at the protein-DNA interaction sites, where it stabilizes the motif
Structural motif
In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a supersecondary structure, which appears also in a variety of other molecules...

. The C-terminal part of each finger is responsible for the specific recognition of the DNA sequence.

The recognized sequences are short, made up of around 3 base pairs, but by combining 6 to 8 zinc fingers whose recognition sites have been characterized, it is possible to obtain specific proteins for sequences of around 20 base pairs. It is therefore possible to control the expression of a specific gene. It has been demonstrated that this strategy can be used to promote a process of angiogenesis in animals. It is also possible to fuse a protein constructed in this way with the catalytic domain of an endonuclease in order to induce a targeted DNA break, and therefore to use these proteins as genome engineering tools.

The method generally adopted for this involves associating two proteins – each containing 3 to 6 specifically chosen zinc fingers – with the catalytic domain of the FokI
FokI
The enzyme FokI, naturally found in Flavobacterium okeanokoites, is a bacterial type IIS restriction endonuclease consisting of an N-terminal DNA-binding domain and a non-specific DNA cleavage domain at the C-terminal...

 endonuclease. The two proteins recognize two DNA sequences that are a few nucleotides apart. Linking the two zinc finger proteins to their respective sequences brings the two endonucleases associated with them closer together. This means that they can be dimerized and then cut the DNA molecule.

Several approaches are used to design specific zinc finger nucleases for the chosen sequences. The most widespread involves combining zinc-finger units with known specificities (modular assembly). Various selection techniques, using bacteria, yeast or mammal cells have been developed to identify the combinations that offer the best specificity and the best cell tolerance. Although the direct genome-wide characterization of zinc finger nuclease activity has not been reported, an assay that measures the total number of double-strand DNA breaks in cells found that only one to two such breaks occur above background in cells treated with zinc finger nucleases with a 24 bp composite recognition site and obligate heterodimer FokI
FokI
The enzyme FokI, naturally found in Flavobacterium okeanokoites, is a bacterial type IIS restriction endonuclease consisting of an N-terminal DNA-binding domain and a non-specific DNA cleavage domain at the C-terminal...

 nuclease domains.

Zinc finger nucleases are research and development tools that have already been used to modify a range of genomes, in particular by the laboratories in the Zinc Finger Consortium. The US company Sangamo Biosciences uses zinc finger nucleases to carry out research into the genetic engineering of stem cells and the modification of immune cells for therapeutic purposes. Modified T lymphocytes are currently undergoing phase I clinical trials to treat a type of brain tumor (glioblastoma) and in the fight against AIDS.

Transduction by stimulating homologous recombination

rAAV-based engineering

rAAV mediated genome engineering builds on Capecchi and Smithies’ Nobel Prize winning discovery that homologous recombination
Homologous recombination
Homologous recombination is a type of genetic recombination in which nucleotide sequences are exchanged between two similar or identical molecules of DNA. It is most widely used by cells to accurately repair harmful breaks that occur on both strands of DNA, known as double-strand breaks...

, a natural DNA repair mechanism, can be harnessed to perform precise genome alterations in mice. rAAV improves the efficiency of this approach to permit gene editing in any pre-established and differentiated human cell line, which in contrast to mouse ES cells, have low rates of homologous recombination.
Genome editing in human and other mammalian somatic cell types using homologous recombination is now achievable using recombinant adeno-associated virus (rAAV) vectors. These single-stranded DNA viral vectors have high transduction
Transduction
Transduction is a mechanism whereby genetic material may be transferred from the genes of a bacterium to another bacterium. This may include the actual covalent-bonding of new genetic markers...

 rates and have a unique property of stimulating endogenous HR without causing double strand DNA breaks in the genome, which are typical of other site-directed endonuclease mediated genome editing methods.

Users can design a rAAV vector to any target genomic locus and perform either gross or subtle endogenous gene alterations in mammalian somatic cell types. These include gene knock-outs for functional genomics, or the ‘knock-in’ of protein tag insertions to track translocation events at physiological levels in live cells. Most importantly, rAAV targets a single allele at a time and does not result in any off-target genomic alterations. Because of this, it is able to routinely and accurately model genetic diseases caused by subtle SNPs or point mutations that are increasingly the targets of novel drug discovery programs.

The use of rAAV based engineering has been documented in over 160 peer-reviewed publications. Researchers have employed rAAV based genome editing to engineer human cell lines for use as disease models. These isogenic human disease models
Isogenic human disease models
Isogenic human disease models are a family of cells that are selected or engineered to accurately model the genetics of a specific patient population, in vitro . They are provided with a genetically matched ‘normal cell’ to provide an isogenic system to research disease biology and novel...

 are precisely matched pairs of cell lines, where one harbours a cancer-associated mutation in an endogenous gene, just as it occurs in real patients, while the other is a genetically identical cell line carrying a normal version of that gene. These isogenic disease models provide a definitive means to understand disease biology.

Derived from this disease model panel, the company is also launching a range of genetically-defined genomic DNA diagnostic control materials. Further work is underway by Horizon to apply rAAV gene-editing in other translational areas, such as enhancing the yield of therapeutic biologic agents from engineered ‘bioproducer’ mammalian cell lines.

Another emerging application of rAAV based engineering is for gene therapy
Gene therapy
Gene therapy is the insertion, alteration, or removal of genes within an individual's cells and biological tissues to treat disease. It is a technique for correcting defective genes that are responsible for disease development...

 in patients, due to the accuracy and lack of off-target recombination events afforded by this approach. There are a number of clinical trials currently in progress.

See also

  • Biological engineering
    Biological engineering
    Biological engineering, biotechnological engineering or bioengineering is the application of concepts and methods of biology to solve problems in life sciences, using engineering's own analytical and synthetic methodologies and also its traditional...

  • Homing endonuclease
    Homing endonuclease
    The homing endonucleases are a type of restriction enzymes typically encoded by introns or inteins. They act on the cellular DNA of the cells that synthesize them, in the opposite alleles of the genes that encode them.- Origin and mechanism :...

  • homologous recombination
    Homologous recombination
    Homologous recombination is a type of genetic recombination in which nucleotide sequences are exchanged between two similar or identical molecules of DNA. It is most widely used by cells to accurately repair harmful breaks that occur on both strands of DNA, known as double-strand breaks...

  • Protein engineering
    Protein engineering
    Protein engineering is the process of developing useful or valuable proteins. It is a young discipline, with much research taking place into the understanding of protein folding and recognition for protein design principles....

  • Protein design
    Protein design
    Protein design is the design of new protein molecules, either from scratch or by making calculated variations on a known structure. The use of rational design techniques for proteins is a major aspect of protein engineering....

  • Meganuclease
  • Zinc finger nuclease
    Zinc finger nuclease
    Zinc-finger nucleases are artificial restriction enzymes generated by fusing a zinc finger DNA-binding domain to a DNA-cleavage domain. Zinc finger domains can be engineered to target desired DNA sequences and this enables zinc-finger nucleases to target unique sequences within complex genomes...

  • Recombinant AAV mediated genome engineering
    Recombinant AAV mediated genome engineering
    Recombinant adeno-associated virus based genome engineering is a genome editing platform centered around the use of rAAV vectors that enables insertion, deletion or substitiution of DNA sequences into the genomes of live mammalian cells...

  • Isogenic human disease models
    Isogenic human disease models
    Isogenic human disease models are a family of cells that are selected or engineered to accurately model the genetics of a specific patient population, in vitro . They are provided with a genetically matched ‘normal cell’ to provide an isogenic system to research disease biology and novel...

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK