All Topics  
Protein folding

 

   Email Print
   Bookmark   Link






 

Protein folding



 
 
Protein folding is the physical process by which a polypeptide folds into its characteristic and functional three-dimensional structure
Protein structure

Proteins are an important class of biological macromolecules present in all biological organisms, made up of such chemical element as carbon,hydrogen, nitrogen, oxygen, and sulphur....
. Each protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
 begins as a polypeptide, translated from a sequence of mRNA as a linear chain of amino acid
Amino acid

In chemistry, an amino acid is a molecule containing both amine and carboxyl functional groups. These molecules are particularly important in biochemistry, where this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent....
s. This polypeptide lacks any developed three-dimensional structure (the left hand side of the neighboring figure). However each amino acid in the chain can be thought of having certain 'gross' chemical features.






Discussion
Ask a question about 'Protein folding'
Start a new discussion about 'Protein folding'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Protein folding is the physical process by which a polypeptide folds into its characteristic and functional three-dimensional structure
Protein structure

Proteins are an important class of biological macromolecules present in all biological organisms, made up of such chemical element as carbon,hydrogen, nitrogen, oxygen, and sulphur....
. Each protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
 begins as a polypeptide, translated from a sequence of mRNA as a linear chain of amino acid
Amino acid

In chemistry, an amino acid is a molecule containing both amine and carboxyl functional groups. These molecules are particularly important in biochemistry, where this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent....
s. This polypeptide lacks any developed three-dimensional structure (the left hand side of the neighboring figure). However each amino acid in the chain can be thought of having certain 'gross' chemical features. These may be hydrophobic, hydrophilic, or electrically charged, for example. These interact with each other and their surroundings in the cell to produce a well-defined, three dimensional shape, the folded protein (the right hand side of the figure), known as the native state
Native state

In biochemistry, the native state of a protein is its operative or functional form. All protein molecules are simple unbranched chains of amino acids, but it is by assuming a specific three-dimensional shape that they are able to perform their biological function....
. The resulting three-dimensional structure is determined by the sequence of the amino acids. The mechanism of protein folding is not completely understood.

Experimentally determining the three dimensional structure of a protein is often very difficult and expensive. However the sequence of that protein is often known. Therefore scientists have tried to use different biophysical
Biophysics

Biophysics is an interdisciplinary science that employs and develops theories and methods of the physical sciences for the investigation of biology systems....
 techniques to computationally fold a protein, that is, to predict the structure of the complete protein from the sequence of the protein.

For many proteins the correct three dimensional structure is essential to function. Failure to fold into the intended shape usually produces inactive proteins with different properties (details found under prion
Prion

A prion is an infectious disease that is comprised entirely of a reproduction, mis-folded protein. The mis-folded form of the prion protein has been implicated in a number of diseases in a variety of mammals, including bovine spongiform encephalopathy in cattle and Creutzfeldt-Jakob disease in humans....
). Several neurodegenerative and other disease
Disease

A disease or medical condition is an abnormal condition of an organism that impairs bodily functions, associated with specific symptoms and Medical signs....
s are believed to result from the accumulation of misfolded (incorrectly folded) proteins.

Known facts about the process


The relationship between folding and amino acid sequence


The amino-acid sequence (or primary structure
Primary structure

In biochemistry, the primary structure of a biological molecule is the exact specification of its atomic composition and the chemical bonds connecting those atoms ....
) of a protein predisposes it towards its native conformation or conformations. It will fold spontaneously during or after synthesis. While these macromolecule
Macromolecule

The term macromolecule by definition implies "large molecule". In the context of biochemistry, the term may be applied to the four conventional biopolymers , as well as non-polymeric molecules with large molecular mass such as macrocycles....
s may be regarded as "folding themselves", the mechanism depends equally on the characteristics of the cytosol
Cytosol

The cytosol or intracellular fluid is the liquid found inside cell . In eukaryotes this liquid is separated by cell membranes from the contents of the organelles suspended in the cytosol, such as the mitochondrial matrix inside the mitochondrion....
, including the nature of the primary solvent
Solvent

A solvent is a liquid or gas that dissolves a solid, liquid, or gaseous solute, resulting in a solution.The most common solvent in everyday life is water....
 (water
Water

Water is a common chemical substance that is essential for the survival of all known forms of life. In typical usage, water refers only to its liquid form or States of matter, but the substance also has a solid state, ice, and a gaseous state, water vapor or steam....
 or lipid
Lipid

Lipids are broadly defined as any fat-soluble , naturally-occurring molecule, such as fats, oils, waxes, cholesterol, sterols, fat-soluble vitamins , monoglycerides, diglycerides, phospholipids, and others....
), macromolecular crowding
Macromolecular crowding

The phenomenon of macromolecular crowding alters the properties of molecules in a solution when high concentrations of macromolecules such as proteins are present....
, the concentration of salt
Salt

A salt, in chemistry, is defined as the product formed from the neutralisation reaction of acids and base . Salts are ionic compounds composed of cations and anions so that the product is electrically electric charge ....
s, the temperature
Temperature

In physics, temperature is a physical property of a Physical system that underlies the common notions of hot and cold; something that feels hotter generally has the greater temperature....
, and molecular chaperones.

Most folded proteins have a hydrophobic core in which side chain packing stabilizes the folded state, and charged or polar
Chemical polarity

In chemistry, polarity refers to the dipole-dipole intermolecular forces between the slightly electric charge end of one molecule to the negative end of another or the same molecule....
 side chains on the solvent-exposed surface where they interact with surrounding water molecules. It is generally accepted that minimizing the number of hydrophobic side-chains exposed to water is the principal driving force behind the folding process, although a recent theory has been proposed which reassesses the contributions made by hydrogen bonding.The strengths of hydrogen bonds in a protein vary, i.e. they are dependent on their microenvironment, thus H-bonds enveloped in a hydrophobic core contribute more than H-bonds exposed to the aqueous environment to the stability of the native state.

The process of folding in vivo
In vivo

In vivo means that which takes place inside an organism. In science, in vivo refers to experimentation done in or on the living tissue of a whole, living organism as opposed to a partial or dead one or a in vitro....
 often begins co-translationally
Translation (genetics)

Translation is the first stage of protein biosynthesis . Translation is the production of proteins by decoding mRNA produced in Transcription ....
, so that the N-terminus of the protein begins to fold while the C-terminal portion of the protein is still being synthesized
Protein biosynthesis

Protein synthesis is the process in which cell build proteins. The term is sometimes used to refer only to protein translation but more often it refers to a multi-step process, beginning with amino acid synthesis and transcription which are then used for translation ....
 by the ribosome
Ribosome

Ribosomes are complexes of RNA and protein that are found in all cell s. Ribosomes from bacteria, archaea and eukaryotes, the three domains of life on Earth, have significantly different structure and RNA....
. Specialized proteins called chaperones assist in the folding of other proteins. A well studied example is the bacteria
Bacteria

The Bacteria are a large group of unicellular microorganisms. Typically a few micrometres in length, bacteria have a wide range of shapes, ranging from spheres to rods and spirals....
l GroEL
GroEL

GroEL belongs to the chaperonin family of molecular chaperones, and is found in a large number of bacteria. It is required for the proper folding of many proteins....
 system, which assists in the folding of globular protein
Globular protein

Globular proteins, or spheroproteins are one of the two main protein classes, comprising sphere-like proteins that are more or less soluble in aqueous solution ....
s. In eukaryotic organism
Organism

In biology, an organism is any life thing . In at least some form, all organisms are capable of response to stimulus , reproduction, growth and developmental biology, and maintenance of homeostasis as a stable whole....
s chaperones are known as heat shock protein
Heat shock protein

Heat shock proteins are a class of functionally related proteins whose expression is increased when cell are exposed to elevated temperatures or other stress....
s. Although most globular proteins are able to assume their native state unassisted, chaperone-assisted folding is often necessary in the crowded intracellular environment to prevent aggregation; chaperones are also used to prevent misfolding and aggregation which may occur as a consequence of exposure to heat or other changes in the cellular environment.

For the most part, scientists have been able to study many identical molecules folding together en masse. At the coarsest level, it appears that in transitioning to the native state, a given amino acid sequence takes on roughly the same route and proceeds through roughly the same intermediates and transition states. Often folding involves first the establishment of regular secondary and supersecondary structures, particularly alpha helices
Alpha helix

A common motif in the secondary structure of proteins, the alpha helix is a right- or left-handed coiled conformation, resembling a spring , in which every backbone amino group donates a hydrogen bond to the backbone carbonyl group of the amino acid four residues earlier ....
 and beta sheet
Beta sheet

The ? sheet is the second form of regular secondary structure in proteins consisting of beta strands connected laterally by three or more hydrogen bonds, forming a generally twisted, pleated sheet ....
s, and afterwards tertiary structure
Tertiary structure

In biochemistry and chemistry, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates....
. Formation of quaternary structure
Quaternary structure

In biochemistry, quaternary structure is the arrangement of multiple protein folding protein molecules in a multi-subunit complex....
 usually involves the "assembly" or "coassembly" of subunits that have already folded. The regular alpha helix
Alpha helix

A common motif in the secondary structure of proteins, the alpha helix is a right- or left-handed coiled conformation, resembling a spring , in which every backbone amino group donates a hydrogen bond to the backbone carbonyl group of the amino acid four residues earlier ....
 and beta sheet
Beta sheet

The ? sheet is the second form of regular secondary structure in proteins consisting of beta strands connected laterally by three or more hydrogen bonds, forming a generally twisted, pleated sheet ....
 structures fold rapidly because they are stabilized by intramolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s, as was first characterized by Linus Pauling
Linus Pauling

Linus Carl Pauling was an United States scientist, peace activist, author and list of educators. He was one of the most influential chemists in history and ranks among the most important scientists in any field of the 20th century....
. Protein folding may involve covalent bond
Covalent bond

A covalent bond is a form of chemical bonding that is characterized by the sharing of pairs of electrons between atoms, or between atoms and other covalent bonds....
ing in the form of disulfide bridges
Disulfide bond

In chemistry, a disulfide bond is a single covalent bond derived from the coupling of thiol groups. The linkage is also called an SS-bond or disulfide bridge....
 formed between two cysteine
Cysteine

Cysteine is an a-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that humans can synthesize it....
 residues or the formation of metal clusters. Shortly before settling into their more energetically favourable native conformation, molecules may pass through an intermediate "molten globule
Molten globule

A molten globule is a stable, partially folded protein state found in mildly denaturation conditions such as low pH , mild denaturant, or high temperature....
" state.

The essential fact of folding, however, remains that the amino acid sequence of each protein contains the information that specifies both the native structure and the pathway to attain that state. This is not to say that nearly identical amino acid sequences always fold similarly. Conformations differ based on environmental factors as well; similar proteins fold differently based on where they are found. Folding is a spontaneous process
Spontaneous process

A spontaneous process is the time-evolution of a system in which it releases Gibbs free energy and moves to a lower, more thermodynamically stable, energy state....
 independent of energy inputs from nucleoside triphosphate
Nucleoside triphosphate

Nucleoside triphosphate is a nucleoside with three phosphates. Natural nucleoside triphosphates include adenosine triphosphate , guanosine triphosphate , cytidine triphosphate , thymidine triphosphate and uridine triphosphate ....
s. The passage of the folded state is mainly guided by hydrophobic interactions, formation of intramolecular hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s, and van der Waals forces, and it is opposed by conformational entropy
Conformational entropy

Conformational entropy is the entropy associated with the physical arrangement of a polymer chain that assumes a compact or globular protein state in solution....
.

Disruption of the native state


In certain solutions and under some conditions proteins will not fold into their biochemically functional forms. Temperatures above (and sometimes those below) the range that cells tend to live in will cause thermally unstable
Thermostability

Thermostability is the quality of a substance to resist irreversible change in its Chemical structure or physical structure at a high relative temperature....
 proteins to unfold or "denature
Denaturation (biochemistry)

Denaturation is a process in which proteins or nucleic acids lose their structure by application of some external stress or compound for example, treatment of proteins with strong acids or bases, high concentrations of inorganic salts, organic compound solvents , or heat....
" (this is why boiling makes an egg white
Egg white

File:Chicken egg01 monovular.jpgEgg white is the common name for the clear liquid contained within an Egg . It is the cytoplasm of the egg, which until fertilization is a single Cell ....
 turn opaque). High concentrations of solutes, extremes of pH
PH

pH is a measure of the Acid or Base of a solution. It is defined as the cologarithm of the Activity of dissolved hydrogen ions . Hydrogen ion activity coefficients cannot be measured experimentally, so they are based on theoretical calculations....
, mechanical forces, and the presence of chemical denaturants can do the same. A fully denatured protein lacks both tertiary and secondary structure, and exists as a so-called random coil
Random coil

A random coil is a polymer conformation where the monomer subunits are oriented randomness while still being chemical bond to graph units. It is not one specific shape, but a statistics distribution of shapes for all the chains in a statistical population of macromolecules....
. Under certain conditions some proteins can refold; however, in many cases denaturation is irreversible. Cells sometimes protect their proteins against the denaturing influence of heat with enzyme
Enzyme

Enzymes are biomolecules that catalysis chemical reactions. Almost all enzymes are proteins. In enzymatic reactions, the molecules at the beginning of the process are called Substrate , and the enzyme converts them into different molecules, the products....
s known as chaperones or heat shock protein
Heat shock protein

Heat shock proteins are a class of functionally related proteins whose expression is increased when cell are exposed to elevated temperatures or other stress....
s, which assist other proteins both in folding and in remaining folded. Some proteins never fold in cells at all except with the assistance of chaperone molecules, which either isolate individual proteins so that their folding is not interrupted by interactions with other proteins or help to unfold misfolded proteins, giving them a second chance to refold properly. This function is crucial to prevent the risk of precipitation
Precipitation (chemistry)

Precipitation is the formation of a solid in a solution during a chemical reaction. When the reaction occurs, the solid formed is called the precipitate, and the liquid remaining above the solid is called the supernate....
 into insoluble amorphous aggregates.

Incorrect protein folding and neurodegenerative disease


Aggregated proteins are associated with prion
Prion

A prion is an infectious disease that is comprised entirely of a reproduction, mis-folded protein. The mis-folded form of the prion protein has been implicated in a number of diseases in a variety of mammals, including bovine spongiform encephalopathy in cattle and Creutzfeldt-Jakob disease in humans....
-related illnesses such as Creutzfeldt-Jakob disease
Creutzfeldt-Jakob disease

Creutzfeldt–Jakob disease is a very rare and incurable degeneration neurology that is fatal. Among the types of transmissible spongiform encephalopathy found in humans, it is the most common....
, bovine spongiform encephalopathy
Bovine spongiform encephalopathy

Bovine Spongiform Encephalopathy , commonly known as Mad-Cow Disease , is a fatal, neurodegenerative disease in cattle, that causes a spongy degeneration in the brain and spinal cord....
 (mad cow disease), amyloid
Amyloid

Amyloids are insoluble fibrous protein aggregates sharing specific structural traits. Abnormal accumulation of amyloid in organs may lead to amyloidosis, and may play a role in various other neurodegenerative diseases....
-related illnesses such as Alzheimer's Disease
Alzheimer's disease

Alzheimer's disease , also called Alzheimer disease, Senile Dementia of the Alzheimer Type or simply Alzheimer's, is the most common form of dementia....
 and familial amyloid cardiomyopathy or polyneuropathy, as well as intracytoplasmic aggregation diseases such as Huntington's and Parkinson's disease. These age onset degenerative diseases are associated with the multimerization of misfolded proteins into insoluble, extracellular aggregates and/or intracellular inclusions including cross-beta sheet amyloid
Amyloid

Amyloids are insoluble fibrous protein aggregates sharing specific structural traits. Abnormal accumulation of amyloid in organs may lead to amyloidosis, and may play a role in various other neurodegenerative diseases....
 fibrils; it is not clear whether the aggregates are the cause or merely a reflection of the loss of protein homeostasis, the balance between synthesis, folding, aggregation and protein turnover. Misfolding and excessive degradation instead of folding and function leads to a number of proteopathy
Proteopathy

Proteopathy is the abnormal accumulation and toxicity of proteins in certain disease states. The proteopathies comprise more than 30 diseases that affect a variety of organs and tissues, including Alzheimer's disease, Parkinson's disease, type 2 diabetes, amyloidosis, selective hyperproteolytic diseases , and a wide range of other disorde...
 diseases such as antitrypsin-associated Emphysema
Emphysema

Emphysema is a chronic obstructive pulmonary disease . It is often caused by exposure to toxin Chemical substance, including long-term exposure to tobacco smoking....
, cystic fibrosis
Cystic fibrosis

Cystic Fibrosis is a Genetic disorder affecting the exocrine glands of the lungs, liver, pancreas, and intestines, causing progressive disability due to multisystem failure....
 and the lysosomal storage diseases, where loss of function is the origin of the disorder. While protein replacement therapy has historically been used to correct the latter disorders, an emerging approach is to use pharmaceutical chaperones to fold mutated proteins to render them functional. Christopher M. Dobson, Jeffery W. Kelly
Jeffery W. Kelly

Jeffery W. Kelly is the Dean of Graduate Studies and the Lita Annenberg Professor of Chemistry within the Skaggs Institute of Chemical Biology at The Scripps Research Institute in La Jolla, California....
, Dennis Selkoe, Stanley Prusiner, Peter T. Lansbury, William E. Balch, Richard I. Morimoto, Susan L. Lindquist and Byron C. Caughey have all contributed to this emerging understanding of diseases that are some of the most menacing of our era.

Kinetics and the Levinthal Paradox


The entire duration of the folding process varies dramatically depending on the protein of interest. The slowest folding proteins require many minutes or hours to fold, primarily due to proline isomerizations or wrong disulfide bond formations, and must pass through a number of intermediate states, like checkpoints, before the process is complete. On the other hand, very small single-domain
Protein domain

A protein domain is a part of protein sequence and tertiary structure that can biological evolution, function, and exist independently of the rest of the protein chain....
 proteins with lengths of up to a hundred amino acids typically fold in a single step. Time scales of milliseconds are the norm and the very fastest known protein folding reactions are complete within a few microseconds.

The Levinthal paradox
Levinthal paradox

The Levinthal paradox is a thought experiment in the theory of protein folding dynamics. In 1969 Cyrus Levinthal noted that, because of the very large number of degrees of freedom in an unfolded polypeptide chain, the molecule has an astronomical number of possible conformations....
 observes that if a protein were to fold by sequentially sampling all possible conformations, it would take an astronomical amount of time to do so, even if the conformations were sampled at a rapid rate (on the nanosecond or picosecond scale). Based upon the observation that proteins fold much faster than this, Levinthal then proposed that a random conformational search does not occur in folding, and the protein must, therefore, fold by a directed process.

Techniques for studying protein folding


Circular Dichroism

Circular dichroism
Circular dichroism

Circular dichroism is the differential absorption of left- and right-handed circular polarization light.A CD Spectrometer is an instrument that records this phenomenon as a function of wavelength....
 is one of the most general and basic tools to study protein folding. Circular dichroism
Circular dichroism

Circular dichroism is the differential absorption of left- and right-handed circular polarization light.A CD Spectrometer is an instrument that records this phenomenon as a function of wavelength....
 spectroscopy measures the absorption of circularly polarized light. In proteins, structures such as alpha helicies
Alpha helix

A common motif in the secondary structure of proteins, the alpha helix is a right- or left-handed coiled conformation, resembling a spring , in which every backbone amino group donates a hydrogen bond to the backbone carbonyl group of the amino acid four residues earlier ....
 and beta sheets are chiral, and thus absorb such light. The absorption of this light acts as a marker of the degree of foldedness of the protein ensemble. This technique can be used to measure equilibrium unfolding
Equilibrium unfolding

In biochemistry, equilibrium unfolding is the process of protein folding by gradually changing its solution conditions, i.e., its environment. Since equilibrium is maintained at all steps, the process is reversible ....
 of the protein by measuring the change in this absorption as a function of denaturant concentration or temperature
Temperature

In physics, temperature is a physical property of a Physical system that underlies the common notions of hot and cold; something that feels hotter generally has the greater temperature....
. A denaturant melt measures the free energy of unfolding as well as the protein's m value, or denaturant dependence. A temperature
Temperature

In physics, temperature is a physical property of a Physical system that underlies the common notions of hot and cold; something that feels hotter generally has the greater temperature....
 melt measures the melting temperature
Melting temperature

Melting temperature may refer to:* Melting point, the temperature at which a substance changes from solid to liquid state.* DNA melting temperature, the temperature at which a DNA double helix dissociates into single strands....
 (Tm) of the protein. This type of spectroscopy can also be combined with fast-mixing devices, such as stopped flow
Stopped flow

A stopped flow instrument is a rapid mixing device used to study the chemical kinetics of a reaction in solution. After two or more solutions containing the reagents are mixed, they are studied by whatever experimental methods are deemed suitable....
, to measure protein folding kinetics
Kinetics

Kinetics, derived from the Greek language word ????s?? meaning movement or the act of moving, may refer to:...
 and to generate chevron plot
Chevron plot

A chevron plot is a way of representing protein folding kinetics data in the presence of varying concentrations of Denaturation that disrupts the protein's native tertiary structure....
s.

Dual Polarization Interferometry

Dual polarisation interferometry
Dual Polarisation Interferometry

Dual polarisation interferometry is an analytical technique in chemistry that can probe layers adsorbed to the surface of a Waveguide by using the evanescent wave of a laser beam confined to the waveguide....
 (DPI) has emerged over the past decade as an analytical technique that can probe protein layers adsorbed to the surface of a waveguide by using the evanescent wave of a laser beam confined to the waveguide.

DPI focuses laser light into two waveguides, one, the "sensing" waveguide, with an exposed surface and one to create a reference beam. A two-dimensional interference pattern is formed in the far field by combining the light passing through the two waveguides. The DPI technique uses two polarisation of the laser to excite two polarisation modes of the waveguides. Measurement of the interferogram for both polarisations allows both the refractive index (protein density or fold) and the size (conformation
Conformation

Conformation generally means structure arrangement.In science, it may refer to:*Conformational isomerism, in chemistry, is the chemical structure of a molecule....
) of the adsorbed layer to be calculated. Real time measurements of biochemistry take place in a flow-through system. These measurements can be used to infer structural information about the molecular interactions at sub atomic resolution and is typically used to characterise biochemical interactions.

The latest versions of Dual Polarisation Interferometers also have the capability to quantify the order and disruption in birefringent thin films. This has been used, for example, to study the formation of lipid bilayers and their interaction with membrane proteins.

Modern studies of folding with high time resolution


The study of protein folding has been greatly advanced in recent years by the development of fast, time-resolved techniques. These are experimental methods for rapidly triggering the folding of a sample of unfolded protein, and then observing the resulting dynamics. Fast techniques in widespread use include ultrafast mixing of solutions, photochemical methods, and laser temperature jump spectroscopy. Among the many scientists who have contributed to the development of these techniques are Heinrich Roder, Harry Gray, Martin Gruebele, Brian Dyer, William Eaton, Sheena Radford, Chris Dobson, Sir Alan R. Fersht and Bengt Nölting
Bengt Nölting

Bengt N?lting is a German physicist and biophysicist who pioneered various methods in biophysics and engineering. Achievements include studying biological macromolecules, the development of self-evolving computer programs, and the development new energy technologies....
.

Energy landscape theory of protein folding


The protein folding phenomenon was largely an experimental endeavor until the formulation of energy landscape
Energy landscape

In physics, an energy landscape is a pair consisting of a topological space X representing the physical states or parameters of a system together with a continuous function f: X ? Rn representing the energies associated to these states or parameters such that the image of f represents a hypersurface in...
 theory by Joseph Bryngelson and Peter Wolynes in the late 1980s and early 1990s. This approach introduced the principle of minimal frustration, which asserts that evolution has selected the amino acid sequences of natural proteins so that interactions between side chains largely favor the molecule's acquisition of the folded state. Interactions that do not favor folding are selected against, although some residual frustration is expected to exist. A consequence of these evolutionarily selected sequences is that proteins are generally thought to have globally "funneled energy landscapes" (coined by José Onuchic) that are largely directed towards the native state. This "folding funnel
Folding funnel

The folding funnel hypothesis is a specific version of the energy landscape theory of protein folding, which assumes that a protein's native state corresponds to its free energy minimum under the solution conditions usually encountered in cell ....
" landscape allows the protein to fold to the native state through any of a large number of pathways and intermediates, rather than being restricted to a single mechanism. The theory is supported by both computational simulations of model proteins
Lattice protein

Lattice proteins are highly simplified computer models of proteins which are used to investigate protein folding.Because proteins are such large molecules, containing hundreds or thousands of atoms, it is not possible with current technology to simulate more than a few microseconds of their behaviour in all-atom detail....
 and numerous experimental studies, and it has been used to improve methods for protein structure prediction
Protein structure prediction

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
 and design
Protein design

Protein design is the design of new protein molecules from scratch, or the deliberate design of a new molecule by making calculated variations on a known structure....
.

Computational prediction of protein tertiary structure


De novo or ab initio techniques for computational protein structure prediction
Protein structure prediction

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
 is related to, but strictly distinct from, studies involving protein folding. Molecular Dynamics
Molecular dynamics

Molecular dynamics is a form of computer simulation in which atoms and molecules are allowed to interact for a period of time by approximations of known physics,...
 (MD) is an important tool for studying protein folding and dynamics in silico
In silico

In silico is an expression used to mean "performed on computer or via computer simulation." The phrase is coined in analogy to the Latin language phrases in vivo and in vitro which are commonly used in biology and refer to experiments done in living organisms and outside of living organisms, respectively....
. Because of computational cost, ab initio MD folding simulations with explicit water are limited to peptides and very small proteins. MD simulations of larger proteins remain restricted to dynamics of the experimental structure or its high-temperature unfolding. In order to simulate long time folding processes (beyond about 1 microsecond), like folding of small-size proteins (about 50 residues) or larger, some approximations or simplifications in protein models need to be introduced. An approach using reduced protein representation (pseudo-atoms representing groups of atoms are defined) and statistical potential
Statistical potential

In protein structure prediction, a statistical potential is an energy function derived from an analysis of known structures in the Protein Data Bank....
 is not only useful in protein structure prediction
Protein structure prediction

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
, but is also capable of reproducing the folding pathways.

Because of the many possible ways of folding, there can be many possible structures. A peptide consisting of just five amino acids can fold into over 100 billion possible structures.

Techniques for determination of protein structure


The determination of the folded structure of a protein is a lengthy and complicated process, involving methods like X-ray crystallography
X-ray crystallography

X-ray crystallography is a method of determining the arrangement of atoms within a crystal, in which a beam of X-rays strikes a crystal and scatters into many different directions....
 and NMR. One of the major areas of interest is the prediction of native structure
Protein structure prediction

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
 from amino-acid sequences alone using bioinformatics
Bioinformatics

Bioinformatics is the application of information technology to the field of molecular biology. The term bioinformatics was coined by Paulien Hogeweg in 1978 for the study of informatic processes in biotic systems....
 and computational simulation methods.

There are distributed computing projects which use idle CPU
Idle (CPU)

A computer processor is described as idle when it is not being used by any computer program.Programs which make use of CPU Idle Time mean that they run at a low priority so as not to impact programs that run at normal priority....
 time of personal computers to solve problems such as protein folding or prediction of protein structure. People can run these programs on their computer or PlayStation 3 to support them. See links below (for example Folding@Home
Folding@home

Folding@home is a distributed computing project designed to perform computationally intensive simulations of protein folding and other molecular dynamics ....
) to get information about how to participate in these projects.

See also

  • Protein structure prediction
    Protein structure prediction

    Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
  • Folding (chemistry)
    Folding (chemistry)

    In chemistry folding is the process by which a molecule assumes its shape or Conformational isomerism. The process can also be described as molecular self-assembly where the molecule is directed to form a specific shape through noncovalent interactions, such as hydrogen bond, metal coordination, hydrophobic effect, van der Waals force...
  • Anfinsen's dogma
    Anfinsen's dogma

    Anfinsen's dogma is a postulate in molecular biology championed by the Nobel prize laureate Christian B. Anfinsen. The dogma states that, at least for small globular proteins, the protein structure is determined only by the protein's amino acid primary structure....
  • Levinthal paradox
    Levinthal paradox

    The Levinthal paradox is a thought experiment in the theory of protein folding dynamics. In 1969 Cyrus Levinthal noted that, because of the very large number of degrees of freedom in an unfolded polypeptide chain, the molecule has an astronomical number of possible conformations....
  • Denaturation (biochemistry)
    Denaturation (biochemistry)

    Denaturation is a process in which proteins or nucleic acids lose their structure by application of some external stress or compound for example, treatment of proteins with strong acids or bases, high concentrations of inorganic salts, organic compound solvents , or heat....
  • Protein design
    Protein design

    Protein design is the design of new protein molecules from scratch, or the deliberate design of a new molecule by making calculated variations on a known structure....
  • Chevron plot
    Chevron plot

    A chevron plot is a way of representing protein folding kinetics data in the presence of varying concentrations of Denaturation that disrupts the protein's native tertiary structure....
  • Denaturation midpoint
    Denaturation midpoint

    Assuming two-state protein folding, denaturation midpoint is defined as that temperature or denaturant concentration at which both the folded and Denaturation are equally populated at equilibrium....
  • Equilibrium unfolding
    Equilibrium unfolding

    In biochemistry, equilibrium unfolding is the process of protein folding by gradually changing its solution conditions, i.e., its environment. Since equilibrium is maintained at all steps, the process is reversible ....
  • Folding@Home
    Folding@home

    Folding@home is a distributed computing project designed to perform computationally intensive simulations of protein folding and other molecular dynamics ....
  • Rosetta@Home
    Rosetta@home

    Rosetta@home is a distributed computing project for protein structure prediction on the Berkeley Open Infrastructure for Network Computing platform, run by the David Baker at the University of Washington....
  • Downhill folding
    Downhill folding

    Downhill folding, a key prediction of the energy landscape theory, is a process in which a protein folds without encountering any significant macroscopic free energy barrier....
  • Foldit
    Foldit

    Foldit is an experimental video game about protein folding, developed as a collaboration between the University of Washington's departments of Computer Science and Engineering and Biochemistry ....
     computer game
  • Software for molecular mechanics modeling
    Software for molecular mechanics modeling

    This is a list of of computer programs that are predominantly used for molecular mechanics calculations.Min - Optimization,MD - Molecular Dynamics,...


External links