All Topics  
Secondary structure

 

   Email Print
   Bookmark   Link






 

Secondary structure



 
 
In biochemistry
Biochemistry

Biochemistry is the study of the chemistry processes in living organisms. It deals with the structure and function of cellular components such as proteins, carbohydrates, lipids, nucleic acids and other biomolecules....
 and structural biology
Structural biology

Structural biology is a branch of molecular biology, biochemistry, and biophysics concerned with the molecular structure of biological macromolecules, especially proteins and nucleic acids, how they acquire the structures they have, and how alterations in their structures affect their function....
, secondary structure is the general three-dimensional form of local segments of biopolymer
Biopolymer

Biopolymers are a class of polymers produced by living organisms.Starch, proteins and peptides,and DNA and RNA are all examples of biopolymers, in which the monomeric units, respectively, are sugars, amino acids, and nucleotides....
s such as protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s and nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
s (DNA/RNA). It does not, however, describe specific atomic positions in three-dimensional space, which are considered to be tertiary structure
Tertiary structure

In biochemistry and chemistry, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates....
.

Secondary structure is formally defined by the hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s of the biopolymer, as observed in an atomic-resolution structure.






Discussion
Ask a question about 'Secondary structure'
Start a new discussion about 'Secondary structure'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Myoglobin
In biochemistry
Biochemistry

Biochemistry is the study of the chemistry processes in living organisms. It deals with the structure and function of cellular components such as proteins, carbohydrates, lipids, nucleic acids and other biomolecules....
 and structural biology
Structural biology

Structural biology is a branch of molecular biology, biochemistry, and biophysics concerned with the molecular structure of biological macromolecules, especially proteins and nucleic acids, how they acquire the structures they have, and how alterations in their structures affect their function....
, secondary structure is the general three-dimensional form of local segments of biopolymer
Biopolymer

Biopolymers are a class of polymers produced by living organisms.Starch, proteins and peptides,and DNA and RNA are all examples of biopolymers, in which the monomeric units, respectively, are sugars, amino acids, and nucleotides....
s such as protein
Protein

Proteins are organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid Residue ....
s and nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
s (DNA/RNA). It does not, however, describe specific atomic positions in three-dimensional space, which are considered to be tertiary structure
Tertiary structure

In biochemistry and chemistry, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates....
.

Secondary structure is formally defined by the hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
s of the biopolymer, as observed in an atomic-resolution structure. In proteins, the secondary structure is defined by patterns of hydrogen bonds between backbone amide and carboxyl groups (sidechain-mainchain and sidechain-sidechain hydrogen bonds are irrelevant), where the DSSP definition of a hydrogen bond
DSSP (protein)

In protein structure, the DSSP algorithm is the standard method for assigning secondary structure to the amino acids of a protein, given the atomic-resolution coordinates of the protein....
 is used. In nucleic acids, the secondary structure is defined by the hydrogen bonding between the nitrogenous bases.

The hydrogen bonding is correlated with other structural features, however, which has given rise to less formal definitions of secondary structure. For example, residues in protein helices generally adopt backbone dihedral angle
Dihedral angle

In geometry, the angle between two Plane s is called their dihedral or torsion angle.The dihedral angle of two planes can be seen by looking at the planes "edge on", i.e., along their line of intersection....
s in a particular region of the Ramachandran plot
Ramachandran plot

A Ramachandran plot , developed by Gopalasamudram Narayana Ramachandran, is a way to visualize dihedral angles φ against ψ of amino acid residues in protein structure....
; thus, a segment of residues with such dihedral angles is often called a "helix", regardless of whether it has the correct hydrogen bonds. Many other less formal definitions have been proposed, often applying concepts from the differential geometry of curves, such as curvature
Curvature

In mathematics, curvature refers to any of a number of loosely related concepts in different areas of geometry. Intuitively, curvature is the amount by which a geometric object deviates from being flat, or straight in the case of a line , but this is defined in different ways depending on the context....
 and torsion
Torsion

The term torsion may refer the following:*In geometry:** Torsion of curves** Torsion tensor in differential geometry** The closely related concepts of Reidemeister torsion and analytic torsion ...
. Least formally, structural biologists solving a new atomic-resolution structure will sometimes assign its secondary structure "by eye" and record their assignments in the corresponding PDB
Protein Data Bank (file format)

The Protein Data Bank file format is a textual file format describing the three dimensional structures of molecules held in the Protein Data Bank....
 file.

The rough secondary-structure content of a biopolymer (e.g., "this protein is 40% a-helix
Alpha helix

A common motif in the secondary structure of proteins, the alpha helix is a right- or left-handed coiled conformation, resembling a spring , in which every backbone amino group donates a hydrogen bond to the backbone carbonyl group of the amino acid four residues earlier ....
 and 20% ß-sheet
Beta sheet

The ? sheet is the second form of regular secondary structure in proteins consisting of beta strands connected laterally by three or more hydrogen bonds, forming a generally twisted, pleated sheet ....
.") can often be estimated spectroscopically
Spectroscopy

Spectroscopy was originally the study of the interaction between radiation and matter as a function of wavelength . In fact, historically, spectroscopy referred to the use of visible light dispersed according to its wavelength, e.g....
. For proteins, a common method is far-ultraviolet (far-UV, 170-250 nm) circular dichroism
Circular dichroism

Circular dichroism is the differential absorption of left- and right-handed circular polarization light.A CD Spectrometer is an instrument that records this phenomenon as a function of wavelength....
. A pronounced double minimum at 208 and 222 nm indicate a-helical structure, whereas a single minimum at 204 nm or 217 nm reflects random-coil or ß-sheet structure, respectively. A less common method is infrared
Infrared

Infrared radiation is electromagnetic radiation whose wavelength is longer than that of visible light , but shorter than that of terahertz radiation and microwaves ....
 spectroscopy, which detects differences in the bond oscillations of amide groups due to hydrogen-bonding. Finally, secondary-structure contents may be estimated accurately using the chemical shift
Chemical shift

In nuclear magnetic resonance , the chemical shift describes the dependence of nuclear magnetic energy levels on the electronic environment in a molecule....
s of an unassigned NMR
Nuclear magnetic resonance

Nuclear magnetic resonance is the name given to a physical resonance phenomenon involving the observation of specific quantum mechanics magnetism properties of an atomic atomic nucleus in the presence of an applied, external magnetic field....
 spectrum.

Secondary structure was introduced by Kaj Ulrik Linderstrøm-Lang in the 1952 Lane medical lectures at Stanford.

Proteins


Secondary structure in proteins consists of local inter-residue interactions mediated by hydrogen bonds, or not. The most common secondary structures are alpha helices
Alpha helix

A common motif in the secondary structure of proteins, the alpha helix is a right- or left-handed coiled conformation, resembling a spring , in which every backbone amino group donates a hydrogen bond to the backbone carbonyl group of the amino acid four residues earlier ....
 and beta sheet
Beta sheet

The ? sheet is the second form of regular secondary structure in proteins consisting of beta strands connected laterally by three or more hydrogen bonds, forming a generally twisted, pleated sheet ....
s. Other helices, such as the 310 helix and p helix
Pi helix

A pi helix is a type of secondary structure found in proteins. These structure are particularly common in membrane proteins...
, are calculated to have energetically favorable hydrogen-bonding patterns but are rarely if ever observed in natural proteins except at the ends of a helices due to unfavorable backbone packing in the center of the helix. Other extended structures such as the polyproline helix
Polyproline helix

In proteins, a left-handed polyproline II helix is formed when sequential residues all adopt backbone dihedral angles of roughly and have trans isomers of their peptide bonds....
 and alpha sheet
Alpha sheet

The alpha sheet is a hypothetical secondary structure in proteins, first proposed by Linus Pauling and Robert Corey in 1951. The hydrogen bonding pattern in an alpha sheet is similar to that of a beta sheet, but the orientation of the carbonyl and amino groups in the peptide bond units is distinctive; in a single alpha strand, all of the...
 are rare in native state
Native state

In biochemistry, the native state of a protein is its operative or functional form. All protein molecules are simple unbranched chains of amino acids, but it is by assuming a specific three-dimensional shape that they are able to perform their biological function....
 proteins but are often hypothesized as important protein folding
Protein folding

Protein folding is the physical process by which a polypeptide folds into its characteristic and functional protein structure.Each protein begins as a polypeptide, translated from a sequence of mRNA as a linear chain of amino acids....
 intermediates. Tight turns
Turn (biochemistry)

A turn is an element of secondary structure in proteins.According to the most common definition, a turn is defined by the close approach of two atoms , when the corresponding residues are not involved in a regular secondary structure element such as an alpha helix or beta sheet....
 and loose, flexible loops link the more "regular" secondary structure elements. The random coil
Random coil

A random coil is a polymer conformation where the monomer subunits are oriented randomness while still being chemical bond to graph units. It is not one specific shape, but a statistics distribution of shapes for all the chains in a statistical population of macromolecules....
 is not a true secondary structure, but is the class of conformations that indicate an absence of regular secondary structure.

Amino acid
Amino acid

In chemistry, an amino acid is a molecule containing both amine and carboxyl functional groups. These molecules are particularly important in biochemistry, where this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent....
s vary in their ability to form the various secondary structure elements. Proline
Proline

Proline is an a-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that humans can synthesize it....
 and glycine
Glycine

Glycine is the organic compound with the chemical formula NH2CH2COOH. It is the smallest of the 20 amino acids commonly found in proteins, coded by codons GGU, GGC, GGA and GGG....
 are sometimes known as "helix breakers" because they disrupt the regularity of the a helical backbone conformation; however, both have unusual conformational abilities and are commonly found in turns
Turn (biochemistry)

A turn is an element of secondary structure in proteins.According to the most common definition, a turn is defined by the close approach of two atoms , when the corresponding residues are not involved in a regular secondary structure element such as an alpha helix or beta sheet....
. Amino acids that prefer to adopt helical
Alpha helix

A common motif in the secondary structure of proteins, the alpha helix is a right- or left-handed coiled conformation, resembling a spring , in which every backbone amino group donates a hydrogen bond to the backbone carbonyl group of the amino acid four residues earlier ....
 conformations in proteins include methionine
Methionine

Methionine is an a-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This Essential amino acid is classified as nonpolar....
, alanine
Alanine

Alanine is an a-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 proteinogenic amino acids, i.e. the building blocks of proteins....
, leucine
Leucine

Leucine is an a-amino acid with the chemical formula HO2CCHCH2CH2. It is an essential amino acid, which means that humans cannot synthesise it....
, glutamate and lysine
Lysine

Lysine is an a-amino acid with the chemical formula HO2CCH4NH2. This amino acid is an essential amino acid, which means that humans cannot synthesize it....
 ("MALEK" in amino-acid
Amino acid

In chemistry, an amino acid is a molecule containing both amine and carboxyl functional groups. These molecules are particularly important in biochemistry, where this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent....
 1-letter codes); by contrast, the large aromatic residues (tryptophan
Tryptophan

Tryptophan is one of the 20 List of standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG....
, tyrosine
Tyrosine

Tyrosine or 4-hydroxyphenylalanine, is one of the 20 amino acids that are used by cell to protein biosynthesis proteins. This is a non-essential amino acid and it is found in casein....
 and phenylalanine
Phenylalanine

Phenylalanine is an a-amino acid with the chemical formula HO2CCHCH2C6H5, which is found naturally in the breast milk of mammals and manufactured for food and drink products and are also sold as nutritional supplements for their reputed analgesic and antidepressant effects....
) and -branched amino acids (isoleucine
Isoleucine

Isoleucine is an a-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be part of our diet....
, valine
Valine

Valine is an a-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteogenic amino acids....
, and threonine
Threonine

Threonine is an a-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as Chemical polarity....
) prefer to adopt ß-strand
Beta sheet

The ? sheet is the second form of regular secondary structure in proteins consisting of beta strands connected laterally by three or more hydrogen bonds, forming a generally twisted, pleated sheet ....
 conformations. However, these preferences are not strong enough to produce a reliable method of predicting secondary structure from sequence alone.

The DSSP code


There are several methods for defining protein secondary structure (e.g. STRIDE, but DEFINE) the Dictionary of Protein Secondary Structure, DSSP, method is commonly used to describe the protein secondary structure with single letter codes. The secondary structure is assigned based on hydrogen bonding patterns as those initially proposed by Pauling et al. in 1951 (before any protein structure
Protein structure

Proteins are an important class of biological macromolecules present in all biological organisms, made up of such chemical element as carbon,hydrogen, nitrogen, oxygen, and sulphur....
 had ever been experimentally determined). There are eight types of secondary structure that DSSP defines:

  • G = 3-turn helix (310 helix
    3 10 helix

    A 310 helix is a type of secondary structure found in proteins....
    ). Min length 3 residues.
  • H = 4-turn helix (a helix). Min length 4 residues.
  • I = 5-turn helix (p helix). Min length 5 residues.
  • T = hydrogen bonded turn (3, 4 or 5 turn)
  • E = extended strand in parallel and/or anti-parallel ß-sheet conformation. Min length 2 residues.
  • B = residue in isolated ß-bridge (single pair ß-sheet hydrogen bond formation)
  • S = bend (the only non-hydrogen-bond based assignment)


Amino acid residues which are not in any of the above conformations are assigned as the eighth type 'Coil': often codified as ' ' (space), C (coil) or '-' (dash). The helices (G,H and I) and sheet conformations are all required to have a reasonable length. This means that 2 adjacent residues in the primary structure must form the same hydrogen bonding pattern. If the helix or sheet hydrogen bonding pattern is too short they are designated as T or B, respectively. Other protein secondary structure assignment categories exist (sharp turns, Omega loops etc.), but they are less frequently used.

DSSP H-bond definition


Secondary structure is defined by hydrogen bond
Hydrogen bond

A hydrogen bond is the attractive force between one electronegative atom and a hydrogen covalently bonded to another electronegative atom. It results from a dipole-dipole force with a hydrogen atom bonded to nitrogen, oxygen or fluorine ....
ing, so the exact definition of a hydrogen bond is critical. The standard H-bond definition for secondary structure is that of DSSP
DSSP

DSSP may refer to:*Dessert spoon, a spoon with a capacity of about 2 teaspoons*DSSP , a programming language*DSSP , a method of scanning objects into 3D digital representations...
, which is a purely electrostatic model. It assigns charges of to the carbonyl carbon and oxygen, respectively, and charges of to the amide nitrogen and hydrogen, respectively. The electrostatic energy is

According to DSSP
DSSP

DSSP may refer to:*Dessert spoon, a spoon with a capacity of about 2 teaspoons*DSSP , a programming language*DSSP , a method of scanning objects into 3D digital representations...
, an H-bond exists if and only if is less than -0.5 kcal/mol. Although the DSSP formula is a relatively crude approximation of the physical H-bond energy, it is generally accepted as a tool for defining secondary structure.

Protein secondary-structure prediction


Predicting protein tertiary structure from only its amino acid sequence is a very challenging problem (see Protein structure prediction
Protein structure prediction

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
), but using the simpler secondary structure definitions is more tractable and has been the focus for research for a long time.

Although, the 8-state DSSP code is already a simplification from the 20 amino acid residues present in a protein the majority of secondary prediction methods simplify further to the three dominant states: Helix, Sheet and Coil. How the conversion is made from 8- to 3-state varies between methods. Early methods of secondary-structure prediction were based on the helix- or sheet-forming propensities of individual amino acids, sometimes coupled with rules for estimating the free energy of forming secondary structure elements. Such methods were typically ~60% accurate in predicting which of the three states (helix/sheet/coil) a residue adopts. A significant increase in accuracy (to nearly ~80%) was made by exploiting multiple sequence alignment
Multiple sequence alignment

A multiple sequence alignment is a sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. In general, the input set of query sequences are assumed to have an evolutionary relationship by which they share a lineage and are descended from a common ancestor....
; knowing the full distribution of amino acids that occur at a position (and in its vicinity, typically ~7 residues on either side) throughout evolution
Evolution

In biology, evolution is change in the heritability trait of a population of organisms from one generation to the next. These changes are caused by a combination of three main processes: variation, reproduction, and selection....
 provides a much better picture of the structural tendencies near that position. For illustration, a given protein might have a glycine
Glycine

Glycine is the organic compound with the chemical formula NH2CH2COOH. It is the smallest of the 20 amino acids commonly found in proteins, coded by codons GGU, GGC, GGA and GGG....
 at a given position, which by itself might suggest a random coil there. However, multiple sequence alignment might reveal that helix-favoring amino acids occur at that position (and nearby positions) in 95% of homologous proteins spanning nearly a billion years of evolution. Moreover, by examining the average hydrophobicity at that and nearby positions, the same alignment might also suggest a pattern of residue solvent accessibility
Accessible surface area

The accessible surface area is the surface area of a biomolecule that is accessible to a solvent. The ASA is usually quoted in square ?ngstrom ....
 consistent with an a-helix. Taken together, these factors would suggest that the glycine of the original protein adopts a-helical structure, rather than random coil. Several types of methods are used to combine all the available data to form a 3-state prediction, including neural network
Neural network

Traditionally, the term neural network had been used to refer to a network or circuit of neuron. The modern usage of the term often refers to artificial neural networks, which are composed of artificial neurons or nodes....
s, hidden Markov model
Hidden Markov model

A hidden Markov model is a statistical model in which the system being modeled is assumed to be a Markov process with unknown parameters; the challenge is to determine the hidden parameters from the observable data....
s and support vector machine
Support vector machine

Support vector machines are a set of related supervised learning methods used for statistical classification and regression analysis. Viewing input data as two sets of vectors in an high-dimensional, an SVM will construct a separating hyperplane in that hyperspace, one which maximizes the margin between the two data sets....
s. Modern prediction methods also provide a confidence score for their predictions at every position.

Secondary-structure prediction methods are continuously benchmarked, e.g., in the experiment. Based on ~270 weeks of testing, the most accurate methods at present are , , , and . Interestingly, it does not seem to be possible to improve upon these methods by taking a consensus of them . The chief area for improvement appears to be the prediction of ß-strands; residues confidently predicted as ß-strand are likely to be so, but the methods are apt to overlook some ß-strand segments (false negatives). There is likely an upper limit of ~90% prediction accuracy overall, due to the idiosyncrasies of the standard method (DSSP
DSSP

DSSP may refer to:*Dessert spoon, a spoon with a capacity of about 2 teaspoons*DSSP , a programming language*DSSP , a method of scanning objects into 3D digital representations...
) for assigning secondary-structure classes (helix/strand/coil) to PDB structures, against which the predictions are benchmarked.

Accurate secondary-structure prediction is a key element in the prediction of tertiary structure
Tertiary structure

In biochemistry and chemistry, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates....
, in all but the simplest (homology modeling
Protein structure prediction

Protein structure prediction is one of the most important goals pursued by bioinformatics and theoretical chemistry. Its aim is the prediction of the three-dimensional structure of proteins from their amino acid sequences, sometimes including additional relevant information such as the structures of related proteins....
) cases. For example, a confidently predicted pattern of six secondary structure elements ßaßßaß is the signature of a ferredoxin
Ferredoxin

Ferredoxins are iron-sulfur proteins that mediate electron transfer in a range of metabolic reactions. The term "ferredoxin" was coined by D.C....
 fold.

Nucleic acids

Stem Loop
Nucleic acid
Nucleic acid

A nucleic acid is a macromolecule composed of chains of monomeric nucleotides. In biochemistry these molecules carry genetic information or form structures within Cell ....
s also have secondary structure, most notably single-stranded RNA
RNA

Ribonucleic acid is a type of molecule that consists of a long chain of nucleotide units. Each nucleotide consists of a nucleobase, a ribose sugar, and a phosphate....
 molecules. RNA secondary structure is generally divided into helices (contiguous base pairs), and various kinds of loops (unpaired nucleotides surrounded by helices). The stem-loop
Stem-loop

Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded DNA or, more commonly, in RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same molecule, usually palindrome in nucleotide sequence, base-pair to form a double helix that ends in an unpaired loop....
 structure in which a base-paired helix ends in a short unpaired loop is extremely common and is a building block for larger structural motifs such as cloverleaf structures, which are four-helix junctions such as those found in transfer RNA
Transfer RNA

Transfer RNA is a small RNA that transfers a specific active amino acid to a growing polypeptide chain at the ribosomal site of protein synthesis during translation ....
. Internal loops (a short series of unpaired bases in a longer paired helix) and bulges (regions in which one strand of a helix has "extra" inserted bases with no counterparts in the opposite strand) are also frequent. Finally, both pseudoknot
Pseudoknot

A pseudoknot is an RNA structure containing at least two stem-loop structures in which half of one stem is intercalated between the two halves of another stem....
s and base triples are present in RNA (though not DNA).

Since it is almost entirely base pair-mediated, RNA secondary structure can be said to define which bases are paired in a molecule or complex. However, the traditional Watson-Crick base pair is not the only type of pairing that is permissible in RNA; Hoogsteen base pair
Hoogsteen base pair

A Hoogsteen base pair is a minor variation of base-pairing in nucleic acids such as the A?T pair. In this manner, two nucleobases on each strand can be held together by hydrogen bonds in the major groove....
ing is also common.

RNA secondary structure prediction


See also RNA structure
RNA structure

The functional form of single stranded RNA molecules frequently requires a specific tertiary structure. The scaffold for this structure is provided by secondary structural elements which are hydrogen bonds within the molecule....


One application of bioinformatics
Bioinformatics

Bioinformatics is the application of information technology to the field of molecular biology. The term bioinformatics was coined by Paulien Hogeweg in 1978 for the study of informatic processes in biotic systems....
 uses predicted RNA secondary structures in searching a genome
Genome

In classical genetics, the genome of a diploid organism including eukarya refers to a full set of chromosomes or genes in a gamete; thereby, a regular somatic cell contains two full sets of genomes....
 for noncoding but functional forms of RNA. For example, microRNA
Mirna

Mirna may refer to:people* Mirna * Mirna Jukic, a bronze medal winner in swimming* Mirna Khayat, a Lebanese music video director* Mirna Macur, a Slovenian social researcher...
s have canonical long stem-loop structures interrupted by small internal loops. A general method of calculating probable RNA secondary structure is dynamic programming
Dynamic programming

In mathematics and computer science, dynamic programming is a method of solving problems that exhibit the properties of overlapping subproblems and optimal substructure ....
, although this has the disadvantage that it cannot detect pseudoknot
Pseudoknot

A pseudoknot is an RNA structure containing at least two stem-loop structures in which half of one stem is intercalated between the two halves of another stem....
s or other cases in which base pairs are not fully nested. More general methods are based on stochastic context-free grammar
Stochastic context-free grammar

A stochastic context-free grammar is a context-free grammar in which each production is augmented with a probability. The probability of a derivation is then the product of the probabilities of the productions used in that derivation; thus some derivations are more consistent with the stochastic grammar than others....
s. A web server that implements a type of dynamic programming is .

For many RNA molecules, the secondary structure is highly important to the correct function of the RNA — often more so than the actual sequence. This fact aids in the analysis of non-coding RNA
Non-coding RNA

A non-coding RNA is a functional RNA molecule that is not Translation into a protein. Less-frequently used synonyms are non-protein-coding RNA , non-messenger RNA , small non-messenger RNA , functional RNA ....
 sometimes termed "RNA genes". RNA secondary structure can be predicted with some accuracy by computer and many bioinformatics
Bioinformatics

Bioinformatics is the application of information technology to the field of molecular biology. The term bioinformatics was coined by Paulien Hogeweg in 1978 for the study of informatic processes in biotic systems....
 applications use some notion of secondary structure in analysis of RNA.

Alignment


Both protein and RNA secondary structures can be used to aid in multiple sequence alignment
Multiple sequence alignment

A multiple sequence alignment is a sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. In general, the input set of query sequences are assumed to have an evolutionary relationship by which they share a lineage and are descended from a common ancestor....
. These alignments can be made more accurate by the inclusion of secondary structure information in addition to simple sequence information. This is sometimes less useful in RNA because base pairing is much more highly conserved than sequence. Distant relationships between proteins whose primary structures are unalignable can sometimes be found by secondary structure.

See also

  • Folding (chemistry)
    Folding (chemistry)

    In chemistry folding is the process by which a molecule assumes its shape or Conformational isomerism. The process can also be described as molecular self-assembly where the molecule is directed to form a specific shape through noncovalent interactions, such as hydrogen bond, metal coordination, hydrophobic effect, van der Waals force...
  • primary structure
    Primary structure

    In biochemistry, the primary structure of a biological molecule is the exact specification of its atomic composition and the chemical bonds connecting those atoms ....
  • tertiary structure
    Tertiary structure

    In biochemistry and chemistry, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates....
  • quaternary structure
    Quaternary structure

    In biochemistry, quaternary structure is the arrangement of multiple protein folding protein molecules in a multi-subunit complex....
  • translation
  • structural motif
    Structural motif

    In an unbranched, polymer biological molecule, such as a protein or a strand of RNA, a structural motif is a three-dimensional structural element or protein folding within the chain, which appears also in a variety of other molecules....


Further reading

  • C Branden and J Tooze (1999). Introduction to Protein Structure 2nd ed. Garland Publishing: New York, NY.


  • M. Zuker "Computer prediction of RNA structure", Methods in Enzymology, 180:262-88 (1989). (The classic paper on dynamic programming algorithms to predict RNA secondary structure.)


  • L. Pauling and R.B Corey. Configurations of polypeptide chains with favored orientations of the polypeptide around single bonds: Two pleated sheets. Proc. Natl. Acad. Sci. Wash., 37:729-740 (1951). (The original beta-sheet conformation article.)


  • L. Pauling, R.B. Corey and H.R. Branson. Two hydrogen-bonded helical configurations of the polypeptide chain. Proc. Natl. Acad. Sci. Wash., 37:205-211 (1951). (alpha- and pi-helix conformations, since they predicted that helices would not be possible.)


External links

  • Rasmol
    RasMol

    RasMol is a computer program written for molecular graphics visualization intended and used primarily for the depiction and exploration of Structural biology, such as those found in the Protein Data Bank....
     (protein visualization program, implements DSSP)