All Topics  
Protein

 

 

 

 

 

Protein


 
 


Proteins are large organic compoundOrganic compound

An organic compound is any member of a large class of chemical compounds whose molecules contain carbon and hydrogen; theref...
s made of amino acidAmino acid Summary

In chemistry, an amino acid is any molecule that contains both amine and carboxyl functional groups....
s arranged in a linear chain and joined together by peptide bondPeptide bond

A peptide bond is a chemical bond formed between two molecules when the carboxyl group of one molecule reacts with the amino...
s between the carboxyl and amino groups of adjacent amino acid residuesResidue (chemistry)

In chemistry, residue refers to a portion of a larger molecule, such as a methyl group....
. The sequence of amino acids in a protein is defined by a geneFacts About Gene

A gene is the unit of heredity in living organisms....
 and encoded in the genetic codeGenetic code

The genetic code is the set of rules by which information encoded in genetic material is translated into proteins by livin...
. Although this genetic code specifies 20 "standard" amino acids plus selenocysteineSelenocysteine

Selenocysteine is an amino acid that is present in several enzymes ....
 and - in certain archaeaArchaea

Archaea , also called Archaebacteria , is a major division of living organisms....
 - pyrrolysinePyrrolysine

Pyrrolysine is a naturally-occurring genetically-coded amino acid....
, the residues in a protein are sometimes chemically altered in post-translational modification: either before the protein can function in the cellCell (biology)

The cell is the structural and functional unit of all living organisms, and is sometimes called the "building block of life....
, or as part of control mechanisms. Proteins can also work together to achieve a particular function, and they often associate to form stable complexProtein complex

A protein complex is a group of two or more associated proteins....
es.

Like other biological macromolecules such as polysaccharidePolysaccharide

Polysaccharides are relatively complex carbohydrates....
s and nucleic acidNucleic acid

A nucleic acid is a complex, high-molecular-weight biochemical macromolecule composed of nucleotide chains that convey genet...
s, proteins are essential parts of organisms and participate in every process within cellCell (biology) Summary

The cell is the structural and functional unit of all living organisms, and is sometimes called the "building block of life....
s.






Discussion
Ask a question about 'Protein'
Start a new discussion about 'Protein'
Answer questions from other users
Full Discussion Forum






Timeline

1838   Proteins discovered by Jöns Jakob Berzelius






Encyclopedia




Proteins are large organic compoundOrganic compound

An organic compound is any member of a large class of chemical compounds whose molecules contain carbon and hydrogen; theref...
s made of amino acidAmino acid Summary

In chemistry, an amino acid is any molecule that contains both amine and carboxyl functional groups....
s arranged in a linear chain and joined together by peptide bondPeptide bond

A peptide bond is a chemical bond formed between two molecules when the carboxyl group of one molecule reacts with the amino...
s between the carboxyl and amino groups of adjacent amino acid residuesResidue (chemistry)

In chemistry, residue refers to a portion of a larger molecule, such as a methyl group....
. The sequence of amino acids in a protein is defined by a geneFacts About Gene

A gene is the unit of heredity in living organisms....
 and encoded in the genetic codeGenetic code

The genetic code is the set of rules by which information encoded in genetic material is translated into proteins by livin...
. Although this genetic code specifies 20 "standard" amino acids plus selenocysteineSelenocysteine

Selenocysteine is an amino acid that is present in several enzymes ....
 and - in certain archaeaArchaea

Archaea , also called Archaebacteria , is a major division of living organisms....
 - pyrrolysinePyrrolysine

Pyrrolysine is a naturally-occurring genetically-coded amino acid....
, the residues in a protein are sometimes chemically altered in post-translational modification: either before the protein can function in the cellCell (biology)

The cell is the structural and functional unit of all living organisms, and is sometimes called the "building block of life....
, or as part of control mechanisms. Proteins can also work together to achieve a particular function, and they often associate to form stable complexProtein complex

A protein complex is a group of two or more associated proteins....
es.

Like other biological macromolecules such as polysaccharidePolysaccharide

Polysaccharides are relatively complex carbohydrates....
s and nucleic acidNucleic acid

A nucleic acid is a complex, high-molecular-weight biochemical macromolecule composed of nucleotide chains that convey genet...
s, proteins are essential parts of organisms and participate in every process within cellCell (biology) Summary

The cell is the structural and functional unit of all living organisms, and is sometimes called the "building block of life....
s. Many proteins are enzymeEnzyme

Enzymes are proteins that accelerate, or catalyze, chemical reactions....
s that catalyzeFacts About Catalysis

In chemistry and biology, catalysis is the acceleration of a chemical reaction by means of a substance, called a catalyst,...
 biochemical reactions and are vital to metabolismMetabolism

Metabolism is the biochemical modification of chemical compounds in living organisms and cells....
. Proteins also have structural or mechanical functions, such as actinActin

Actin is a globular structural protein that polymerizes in a helical fashion to form an actin filament....
 and myosinMyosin

Myosins are a large family of motor proteins found in eukaryotic tissues....
 in muscle and the proteins in the cytoskeletonCytoskeleton

...
, which form a system of scaffoldingScaffolding

Scaffolding is a temporary framework used to support people and material in the construction or repair of buildings and othe...
 that maintains cell shape. Other proteins are important in cell signalingCell signaling

Cell signaling is part of a complex system of communication that governs basic cellular activities and coordinates cell acti...
, immune responseAntibody

An antibody or immunoglobulin is a large Y-shaped protein used by the immune system to identify and neutralize foreign...
s, cell adhesionCell adhesion

The study of cell adhesion is part of cell biology....
, and the cell cycleCell cycle

The cell cycle, or cell-division cycle , is the series of events in a eukaryotic cell between one cell division and th...
. Proteins are also necessary in animals' diets, since animals cannot synthesize all the amino acids they need and must obtain essential amino acidEssential amino acid

An essential amino acid or 'indispensible amino acid, is an amino acid that cannot be synthesized de novo by the o...
s from food. Through the process of digestionDigestion Summary

For the industrial process see anaerobic digestion...
, animals break down ingested protein into free amino acids that are then used in metabolism.

The word protein comes from the GreekGreek language

Greek has a documented history of 3,500 years, the longest of any single language within the Indo-European family....
 word p??ta ("prota"), meaning "of primary importance." Proteins were first described and named by the Swedish chemist Jöns Jakob BerzeliusJöns Jakob Berzelius

Jns Jakob Berzelius was a Swedish chemist....
 in 1838. However, the central role of proteins in living organisms was not fully appreciated until 1926, when James B. SumnerFacts About James B. Sumner

James Batcheller Sumner was an American chemist....
 showed that the enzyme ureaseUrease

Urease is an enzyme that catalyzes the hydrolysis of urea into carbon dioxide and ammonia....
 was a protein. The first protein to be sequenced was insulinInsulin

Insulin is a polypeptide hormone that regulates carbohydrate metabolism....
, by Frederick SangerFrederick Sanger

Frederick Sanger, OM, CH, CBE, FRS is an English biochemist and a two times Nobel laureate in Chemistry....
, who won the Nobel Prize for this achievement in 1958. The first protein structures to be solved included hemoglobinHemoglobin

Hemoglobin or haemoglobin is the iron-containing oxygen-transport metalloprotein in the red cells of the blood in mam...
 and myoglobinFacts About Myoglobin

Myoglobin is a single-chain protein of 153 amino acids, containing a heme group in the center....
, by Max PerutzMax Perutz

Max Ferdinand Perutz, OM was an Austrian-British molecular biologist....
 and Sir John Cowdery KendrewJohn Kendrew Summary

John Cowdery Kendrew was an English biochemist and crystallographer who shared the 1962 Nobel Prize in Chemistry with Max Pe...
, respectively, in 1958. The three-dimensional structures of both proteins were first determined by x-ray diffraction analysis; Perutz and Kendrew shared the 1962 Nobel Prize in ChemistryNobel Prize in Chemistry Overview

This is a list of Nobel Prize laureates in Chemistry from 1901 to 2005....
 for these discoveries.

Biochemistry





Proteins are linear polymers built from 20 different L-a-amino acidAmino acid

In chemistry, an amino acid is any molecule that contains both amine and carboxyl functional groups....
s. All amino acids possess common structural features, including an a carbonAlpha carbon

The alpha carbon in organic chemistry refers to the first carbon after the carbon that attaches to the functional group....
 to which an amino group, a carboxyl group, and a variable side chainSide chain

A side chain in organic chemistry and biochemistry which is a part of a molecule that is attached to a core structure....
 are bondedChemical bond

A chemical bond is the physical phenomenon of chemical species being held together by attraction of atoms to each other thro...
. Only proline differs from this basic structure as it contains an unusual ring to the N-end amine group, which forces the CO–NH amide moiety into a fixed conformation. The side chains of the standard amino acids, detailed in the list of standard amino acidsList of standard amino acids

This list of standard amino acids details the chemical structures and properties of the twenty standard amino acids used in ...
, have different chemical properties that produce three-dimensional protein structure and are therefore critical to protein function. The amino acids in a polypeptide chain are linked by peptide bondPeptide bond

A peptide bond is a chemical bond formed between two molecules when the carboxyl group of one molecule reacts with the amino...
s formed in a dehydration reactionDehydration reaction

In chemistry, a dehydration reaction is a chemical reaction that involves the loss of water from the reacting molecule....
. Once linked in the protein chain, an individual amino acid is called a residue, and the linked series of carbon, nitrogen, and oxygen atoms are known as the main chain or protein backbone. The peptide bond has two resonanceResonance (chemistry) Summary

Resonance in chemistry is a tool used to represent certain types of molecular structures....
 forms that contribute some double-bond character and inhibit rotation around its axis, so that the alpha carbons are roughly coplanar. The other two dihedral angleDihedral angle Summary

In geometry, the angle between two planes is called their dihedral angle....
s in the peptide bond determine the local shape assumed by the protein backbone.

Due to the chemical structure of the individual amino acids, the protein chain has directionality. The end of the protein with a free carboxyl group is known as the C-terminus or carboxy terminus, whereas the end with a free amino group is known as the N-terminus or amino terminus.

The words protein, polypeptide, and peptidePeptide

Peptides , are the family of short molecules formed from the linking, in a defined order, of various a-amino acids....
are a little ambiguous and can overlap in meaning. Protein is generally used to refer to the complete biological molecule in a stable conformationTertiary structure

In biochemistry, the tertiary structure of a protein is its overall shape, also known as its fold....
, whereas peptide is generally reserved for a short amino acid oligomers often lacking a stable three-dimensional structure. However, the boundary between the two is not well defined and usually lies near 20–30 residues. Polypeptide can refer to any single linear chain of amino acids, usually regardless of length, but often implies an absence of a defined conformationTertiary structure

In biochemistry, the tertiary structure of a protein is its overall shape, also known as its fold....
.

Synthesis


Proteins are assembled from amino acids using information encoded in geneGene

A gene is the unit of heredity in living organisms....
s. Each protein has its own unique amino acid sequence that is specified by the nucleotideNucleotide

A nucleotide is a chemical compound that consists of a heterocyclic base, a sugar, and one or more phosphate groups....
 sequence of the gene encoding this protein. The genetic codeFacts About Genetic code

The genetic code is the set of rules by which information encoded in genetic material is translated into proteins by livin...
 is a set of three-nucleotide sets called codons and each three-nucleotide combination stands for an amino acid, for example AUG stands for methionineMethionine Summary

Methionine is an essential nonpolar amino acid, and a lipotropic....
. Because DNADNA

Deoxyribonucleic acid is a nucleic acid that contains the genetic instructions for the biological development of a cellu...
 contains four nucleotides, the total number of possible codons is 64; hence, there is some redundancy in the genetic code, with some amino acids specified by more than one codon. Genes encoded in DNA are first transcribedTranscription (genetics)

Transcription is the process through which a DNA sequence is enzymatically copied by an RNA polymerase to produce a complem...
 into pre-messenger RNAMessenger RNA

Messenger Ribonucleic Acid is RNA that encodes and carries information from DNA during transcription to sites of protein sy...
 (mRNA) by proteins such as RNA polymeraseFacts About RNA polymerase

RNA polymerase is an enzyme responsible for making RNA from a DNA template....
. Most organisms then process the pre-mRNA (also known as a primary transcript) using various forms of post-transcriptional modificationPost-transcriptional modification

Post transcriptional modification is a genetic process in cell biology by which, in eukaryotic cells precursor messenger RNA...
 to form the mature mRNA, which is then used as a template for protein synthesis by the ribosomeRibosome Overview

A ribosome is an organelle composed of ribosomal RNA and ribosomal proteins ....
. In prokaryoteProkaryote

Prokaryotes are organisms without a cell nucleus , or indeed any other membrane-bound organelles, in most cases unicellula...
s the mRNA may either be used as soon as it is produced, or be bound by a ribosome after having moved away from the nucleoidNucleoid

In prokaryotes, the nucleoid is an irregularly shaped region within the cell where the genetic material is localised....
. In contrast, eukaryoteEukaryote

|-| style = "background: pink; padding: 4px;" | Animalia - Animals...
s make mRNA in the cell nucleusCell nucleus

In cell biology, the nucleus is an organelle found in most eukaryotic cells....
 and then translocate it across the nuclear membrane into the cytoplasmCytoplasm

Cytoplasm is a jelly-like material that fills cells....
, where protein synthesisProtein biosynthesis

Protein biosynthesis is the process in which cells build proteins....
 then takes place. The rate of protein synthesis is higher in prokaryotes than eukaryotes and can reach up to 20 amino acids per second.

The process of synthesizing a protein from an mRNA template is known as translationTranslation (genetics)

Translation is the second process of protein biosynthesis....
. The mRNA is loaded onto the ribosome and is read three nucleotides at a time by matching each codon to its base pairFacts About Base pair

In molecular biology, two nucleotides on opposite complementary DNA or RNA strands that are connected via hydrogen bonds are calle...
ing anticodon located on a transfer RNATransfer RNA

Transfer RNA is a small RNA chain that transfers a specific amino acid to a growing polypeptide chain at the ribosomal site of pro...
 molecule, which carries the amino acid corresponding to the codon it recognizes. The enzyme aminoacyl tRNA synthetaseAminoacyl tRNA synthetase

An aminoacyl tRNA synthetase is an enzyme that catalyzes the esterification of a specific amino acid to its cognate tRNA to form a...
 "charges" the tRNA molecules with the correct amino acids. The growing polypeptide is often termed the nascent chain. Proteins are always biosynthesized from N-terminus to C-terminus.

The size of a synthesized protein can be measured by the number of amino acids it contains and by its total molecular massMolecular mass

The molecular mass of a substance, formerly also called molecular weight and abbreviated as MW, is the mass of ...
, which is normally reported in units of daltons (synonymous with atomic mass unitAtomic mass unit

The unified atomic mass unit , or dalton , is a small unit of mass used to express atomic and molecular masses....
s), or the derivative unit kilodalton (kDa). YeastYeast

Yeasts are single-celled fungi, a few species of which are commonly used to leaven bread, ferment alcoholic beverages, and ...
 proteins are on average 466 amino acids long and 53 kDa in mass. The largest known proteins are the titinTitin

Titin, also known as connectin, is a protein that is important in the contraction of striated muscle tissues....
s, a component of the muscleMuscle

Muscle is contractile tissue of the body and is derived from the mesodermal layer of embryonic germ cells....
 sarcomereSarcomere

A sarcomere is the basic unit of a cross striated muscle's myofibril....
, with a molecular mass of almost 3,000 kDa and a total length of almost 27,000 amino acids.

Chemical synthesis

Short proteins can also be synthesized chemically by a family of methods known as peptide synthesisPeptide synthesis

In organic chemistry, peptide synthesis is the creation of peptides, which are organic compounds in which multiple amino aci...
, which rely on organic synthesisOrganic synthesis

Organic synthesis is the construction of organic molecules via chemical processes....
 techniques such as chemical ligationChemical ligation

Chemical Ligation ist a set of techniques used for creating long peptide or protein chains....
 to produce peptides in high yield. Chemical synthesis allows for the introduction of non-natural amino acids into polypeptide chains, such as attachment of fluorescent probes to amino acid side chains. These methods are useful in laboratory biochemistryBiochemistry

Biochemistry is the study of the chemical processes and chemical transformations in living organisms....
 and cell biologyCell biology

Cell biology is an academic discipline that studies cells....
, though generally not for commercial applications. Chemical synthesis is inefficient for polypeptides longer than about 300 amino acids, and the synthesized proteins may not readily assume their native tertiary structureTertiary structure

In biochemistry, the tertiary structure of a protein is its overall shape, also known as its fold....
. Most chemical synthesis methods proceed from C-terminus to N-terminus, opposite the biological reaction.

Structure of proteins


Most proteins foldProtein folding

Protein folding is the process by which a protein assumes its characteristic functional shape or tertiary structure, also kn...
 into unique 3-dimensional structures. The shape into which a protein naturally folds is known as its native stateNative state

In biochemistry, the native state of a protein is its operative or functional form....
. Although many proteins can fold unassisted, simply through the chemical properties of their amino acids, others require the aid of molecular chaperones to fold into their native states. Biochemists often refer to four distinct aspects of a protein's structure:
  • Primary structurePrimary structure Overview

    In biochemistry, the primary structure of a biological molecule is the exact specification of its atomic composition and the...
    : the amino acid sequencePeptide sequence

    Peptide sequence or amino acid sequence is the order in which amino acid residues, connected by peptide bonds, lie in ...
  • Secondary structureSecondary structure

    Secondary structure in biochemistry and structural biology describes the general three-dimensional form of local segments'...
    : regularly repeating local structures stabilized by hydrogen bondHydrogen bond

    In chemistry, a hydrogen bond is a type of attractive intermolecular force that exists between two partial electric charges ...
    s. The most common examples are the alpha helixAlpha helix

    A common motif in the secondary structure of proteins, the alpha helix is...
     and beta sheetBeta sheet

    The β sheet is a commonly occurring form of regular secondary structure in proteins....
    . Because secondary structures are local, many regions of different secondary structure can be present in the same protein molecule.
  • Tertiary structureTertiary structure

    In biochemistry, the tertiary structure of a protein is its overall shape, also known as its fold....
    : the overall shape of a single protein molecule; the spatial relationship of the secondary structures to one another. Tertiary structure is generally stabilized by nonlocal interactions, most commonly the formation of a hydrophobic core, but also through salt bridgeSalt bridge (protein)

    In protein chemistry, the term salt bridge or salt bond is used to denote chemical bonds between positively an...
    s, hydrogen bonds, disulfide bondFacts About Disulfide bond

    In chemistry, a disulfide bond is a single covalent bond between two sulfur atoms that are themselves not bonded to sulfur....
    s, and even post-translational modifications. The term "tertiary structure" is often used as synonymous with the term fold.
  • Quaternary structureQuaternary structure

    In biochemistry, quaternary structure is the arrangement of multiple folded protein molecules in a multi-subunit complex....
    : the shape or structure that results from the interactionProtein-protein interaction

    Protein-protein interactions refer to the association of protein molecules and the study of these associations from the pers...
     of more than one protein molecule, usually called protein subunitProtein subunit

    In structural biology, a protein subunit or subunit protein is a single protein molecule that assembles with other pro...
    s
    in this context, which function as part of the larger assembly or protein complexProtein complex

    A protein complex is a group of two or more associated proteins....
    .



Proteins are not entirely rigid molecules. In addition to these levels of structure, proteins may shift between several related structures while they perform their biological function. In the context of these functional rearrangements, these tertiary or quaternary structures are usually referred to as "conformations", and transitions between them are called conformational changes. Such changes are often induced by the binding of a substrateSubstrate (biochemistry) Summary

In biochemistry, a substrate is a molecule upon which an enzyme acts....
 molecule to an enzyme's active siteActive site

The active site of an enzyme is the binding site where catalysis occurs....
, or the physical region of the protein that participates in chemical catalysis. In solution all proteins also undergo variation in structure through thermal vibration and the collision with other molecules, see the animation on the right.


Proteins can be informally divided into three main classes, which correlate with typical tertiary structures: globular proteinGlobular protein Summary

Globular proteins, or spheroproteins are one of the two main protein classes, comprising globelike proteins that are m...
s, fibrous proteinFibrous protein

Fibrous proteins, also called scleroproteins, are long filamentous protein molecules that form one of the two main cla...
s, and membrane proteinMembrane protein

A membrane protein is a protein molecule that is attached to, or associated with the membrane of a cell or an organelle....
s. Almost all globular proteins are soluble and many are enzymes. Fibrous proteins are often structural; membrane proteins often serve as receptorsReceptor (biochemistry)

In biochemistry, a receptor is a protein on the cell membrane or within the cytoplasm or cell nucleus that binds to a specif...
 or provide channels for polar or charged molecules to pass through the cell membrane.

A special case of intramolecular hydrogen bonds within proteins, poorly shielded from water attack and hence promoting their own dehydrationDehydration

Dehydration is the removal of water from an object....
, are called dehydronDehydron

A dehydron is an intramolecular hydrogen bond poorly shielded from water attack, with a propensity to promote its own dehydr...
s.

Structure determination

Discovering the tertiary structure of a protein, or the quaternary structure of its complexes, can provide important clues about how the protein performs its function. Common experimental methods of structure determination include X-ray crystallographyX-ray crystallography

X-ray crystallography is a technique in crystallography in which the pattern produced by the diffraction of X-rays through t...
 and NMR spectroscopy, both of which can produce information at atomAtom

In chemistry and physics, an atom is the smallest possible particle of a chemical element that retains its chemical propert...
ic resolution. Cryoelectron microscopy is used to produce lower-resolution structural information about very large protein complexes, including assembled virusFacts About Virus

A virus is a microscopic particle that can infect the cells of a biological organism....
es; a variant known as electron crystallographyElectron crystallography

Electron crystallography is a method to determine the arrangement of atoms in solids using an electron microscope....
 can also produce high-resolution information in some cases, especially for two-dimensional crystals of membrane proteins. Solved structures are usually deposited in the Protein Data BankProtein Data Bank Summary

The Protein Data Bank is a repository for 3-D structural data of proteins and nucleic acids....
 (PDB), a freely available resource from which structural data about thousands of proteins can be obtained in the form of Cartesian coordinates for each atom in the protein.

Many more gene sequences are known than protein structures. Further, the set of solved structures is biased toward proteins that can be easily subjected to the conditions required in X-ray crystallographyX-ray crystallography

X-ray crystallography is a technique in crystallography in which the pattern produced by the diffraction of X-rays through t...
, one of the major structure determination methods. In particular, globular proteins are comparatively easy to crystallize in preparation for X-ray crystallography. Membrane proteins, by contrast, are difficult to crystallize and are underrepresented in the PDB. Structural genomicsStructural genomics

Structural genomics consists in the determination of the three dimensional structure of all proteins of a given organism, by...
 initiatives have attempted to remedy these deficiencies by systematically solving representative structures of major fold classes. Protein structure predictionProtein structure prediction

Protein structure prediction is one of the most significant technologies pursued by computational structural biology and the...
 methods attempt to provide a means of generating a plausible structure for proteins whose structures have not been experimentally determined.

Cellular functions



Proteins are the chief actors within the cell, said to be carrying out the duties specified by the information encoded in genes. With the exception of certain types of RNARNA Summary

Ribonucleic acid is a nucleic acid polymer consisting of nucleotide monomers....
, most other biological molecules are relatively inert elements upon which proteins act. Proteins make up half the dry weight of an Escherichia coliEscherichia coli

Escherichia coli , usually abbreviated to E....
cell, whereas other macromolecules such as DNA and RNA make up only 3% and 20%, respectively. The set of proteins expressed in a particular cell or cell type is known as its proteomeProteome

The term proteome was coined by Mark Wilkins in 1995 and is used to describe the entire complement of proteins in a given b...
.



The chief characteristic of proteins that allows their diverse set of functions is their ability to bind other molecules specifically and tightly. The region of the protein responsible for binding another molecule is known as the binding siteBinding site

In biochemistry, a binding site is a region on a protein, DNA, or RNA to which specific other molecules and ions — in ...
 and is often a depression or "pocket" on the molecular surface. This binding ability is mediated by the tertiary structure of the protein, which defines the binding site pocket, and by the chemical properties of the surrounding amino acids' side chains. Protein binding can be extraordinarily tight and specific; for example, the ribonuclease inhibitorRibonuclease inhibitor

Ribonuclease inhibitor is a large, acidic, leucine-rich repeat protein that forms extremely tight complexes with certain rib...
 protein binds to human angiogeninAngiogenin

Angiogenin is a small polypeptide that is implicated in angiogenesis in tumor growth ....
 with a sub-femtomolar dissociation constantDissociation constant

In chemistry and biochemistry, a dissociation constant is a specific type of equilibrium constant that measures the propensi...
 (<10-15 M) but does not bind at all to its amphibian homolog onconase (>1 M). Extremely minor chemical changes such as the addition of a single methyl group to a binding partner can sometimes suffice to nearly eliminate binding; for example, the aminoacyl tRNA synthetaseAminoacyl tRNA synthetase

An aminoacyl tRNA synthetase is an enzyme that catalyzes the esterification of a specific amino acid to its cognate tRNA to form a...
 specific to the amino acid valineValine Overview

Valine is one of the 20 proteinogenic amino acids....
 discriminates against the very similar side chain of the amino acid isoleucineIsoleucine

Isoleucine is one of the 20 basic amino acids, and forms part of the structure of almost all proteins....
.

Proteins can bind to other proteins as well as to small-molecule substrates. When proteins bind specifically to other copies of the same molecule, they can oligomerOligomer

In chemistry, an oligomer consists of a finite number of monomer units, in contrast to a polymer which, at least in principl...
ize to form fibrils; this process occurs often in structural proteins that consist of globular monomers that self-associate to form rigid fibers. Protein-protein interactionProtein-protein interaction

Protein-protein interactions refer to the association of protein molecules and the study of these associations from the pers...
s also regulate enzymatic activity, control progression through the cell cycleCell cycle

The cell cycle, or cell-division cycle , is the series of events in a eukaryotic cell between one cell division and th...
, and allow the assembly of large protein complexProtein complex

A protein complex is a group of two or more associated proteins....
es that carry out many closely related reactions with a common biological function. Proteins can also bind to, or even be integrated into, cell membranes. The ability of binding partners to induce conformational changes in proteins allows the construction of enormously complex signalingCell signaling

Cell signaling is part of a complex system of communication that governs basic cellular activities and coordinates cell acti...
 networks.

Enzymes

The best-known role of proteins in the cell is their duty as enzymeEnzyme

Enzymes are proteins that accelerate, or catalyze, chemical reactions....
s, which catalyzeCatalysis

In chemistry and biology, catalysis is the acceleration of a chemical reaction by means of a substance, called a catalyst,...
 chemical reactions. Enzymes are usually highly specific catalysts that accelerate only one or a few chemical reactions. Enzymes carry out most of the reactions involved in metabolismMetabolism

Metabolism is the biochemical modification of chemical compounds in living organisms and cells....
 and catabolismCatabolism

Catabolism is the metabolic process that breaks down molecules into smaller units....
, as well as DNA replicationDNA replication

DNA replication or DNA synthesis is the process of copying a double-stranded DNA strand in a cell, prior to cell divis...
, DNA repairDNA repair

DNA repair refers to a collection of processes by which a cell identifies and corrects damage to the DNA molecules that enco...
, and RNA synthesis. Some enzymes act on other proteins to add or remove chemical groups in a process known as post-translational modification. About 4,000 reactions are known to be catalyzed by enzymes. The rate acceleration conferred by enzymatic catalysis is often enormous - as much as 1017-fold increase in rate over the uncatalyzed reaction in the case of orotate decarboxylase (78 million years without the enzyme, 18 milliseconds with the enzyme).

The molecules bound and acted upon by enzymes are known as substrateSubstrate (biochemistry)

In biochemistry, a substrate is a molecule upon which an enzyme acts....
s. Although enzymes can consist of hundreds of amino acids, it is usually only a small fraction of the residues that come in contact with the substrate, and an even smaller fraction - 3-4 residues on average - that are directly involved in catalysis. The region of the enzyme that binds the substrate and contains the catalytic residues is known as the active siteActive site

The active site of an enzyme is the binding site where catalysis occurs....
. This was first suggested by Emil FischerEmil Fischer

Emil Fischer may refer to:* Emil Fischer , famous German dramatic basso...
 in 1894 that both the enzymeEnzyme

Enzymes are proteins that accelerate, or catalyze, chemical reactions....
 and the substrateSubstrate

Substrate may mean:*Substrate, the material used in the bottom of an aquarium....
 must be geometrically compatible for them to bind and perform a certain task. This is referred to as the Lock and Key Theory.

Cell signaling and ligand transport


Many proteins are involved in the process of cell signalingFacts About Cell signaling

Cell signaling is part of a complex system of communication that governs basic cellular activities and coordinates cell acti...
 and signal transductionSignal transduction

In biology, signal transduction is any process by which a cell converts one kind of signal or stimulus into another....
. Some proteins, such as insulinInsulin

Insulin is a polypeptide hormone that regulates carbohydrate metabolism....
, are extracellular proteins that transmit a signal from the cell in which they were synthesized to other cells in distant tissuesBiological tissue

Biological tissue is a collection of interconnected cells that perform a similar function within an organism....
. Others are membrane proteinMembrane protein

A membrane protein is a protein molecule that is attached to, or associated with the membrane of a cell or an organelle....
s that act as receptorsReceptor (biochemistry)

In biochemistry, a receptor is a protein on the cell membrane or within the cytoplasm or cell nucleus that binds to a specif...
 whose main function is to bind a signaling molecule and induce a biochemical response in the cell. Many receptors have a binding site exposed on the cell surface and an effector domain within the cell, which may have enzymatic activity or may undergo a conformational changeConformational change

In molecular biology, a protein may change its shape in order to undertake a new function; each possible shape is called a conform...
 detected by other proteins within the cell.

Antibodies are protein components of adaptive immune systemAdaptive immune system

The adaptive immune system is composed of highly specialized, systemic cells and processes that eliminate or prevent pathoge...
 whose main function is to bind antigenAntigen

An antigen is a substance that stimulates an immune response, especially the production of antibodies....
s, or foreign substances in the body, and target them for destruction. Antibodies can be secreted into the extracellular environment or anchored in the membranes of specialized B cellB cell

B cells are lymphocytes that play a large role in the humoral immune response as opposed to the cell-mediated immune respons...
s known as plasma cellPlasma cell

Plasma cells are cells of the immune system that secrete large amounts of antibodies....
s. Whereas enzymes are limited in their binding affinity for their substrates by the necessity of conducting their reaction, antibodies have no such constraints. An antibody's binding affinity to its target is extraordinarily high.

Many ligand transport proteins bind particular small biomolecules and transport them to other locations in the body of a multicellular organism. These proteins must have a high binding affinity when their ligandLigand

In chemistry, a ligand is an atom, ion, or molecule that generally donates one or more of its electrons through a coordinat...
 is present in high concentrations, but must also release the ligand when it is present at low concentrations in the target tissues. The canonical example of a ligand-binding protein is hemoglobinHemoglobin

Hemoglobin or haemoglobin is the iron-containing oxygen-transport metalloprotein in the red cells of the blood in mam...
, which transports oxygenOxygen

Oxygen is a chemical element with the chemical symbol O and atomic number 8....
 from the lungLung

The lung is the essential respiration organ in air-breathing vertebrates....
s to other organs and tissues in all vertebrateVertebrate

Vertebrata is a subphylum of chordates, specifically, those with backbones or spinal columns....
s and has close homologHomology (biology)

In biology, two or more structures are said to be homologous if they are alike because of shared ancestry....
s in every biological kingdomKingdom (biology)

In biology, a kingdom or regnum is the top-level, or nearly the top-level, taxon of organisms in scientific classifica...
.

Transmembrane proteinTransmembrane protein

A transmembrane protein is an integral membrane protein that spans from the internal to the external surface of the biologic...
s can also serve as ligand transport proteins that alter the permeabilitySemipermeable membrane

A semipermeable membrane, also termed a selectively permeable membrane, a partially permeable membrane or a d...
 of the cell membrane to small molecules and ions. The membrane alone has a hydrophobic core through which polarChemical polarity

Chemical polarity, also known as bond polarity or just polarity, is a concept in chemistry which describes how e...
 or charged molecules cannot diffuseDiffusion

Diffusion, being the spontaneous spreading of matter , heat, or momentum, is one type of transport phenomenon....
. Membrane proteins contain internal channels that allow such molecules to enter and exit the cell. Many ion channelIon channel

Ion channels are pore-forming proteins that help to establish and control the small voltage gradient that exists across the ...
 proteins are specialized to select for only a particular ion; for example, potassiumPotassium

Potassium is a chemical element. It has the symbol K and atomic number 19....
 and sodiumSodium

Sodium is a chemical element which has the symbol Na , atomic number 11, atomic mass 22.9898 g/mol, oxidation number +1....
 channels often discriminate for only one of the two ions.

Structural proteins

Structural proteins confer stiffness and rigidity to otherwise-fluid biological components. Most structural proteins are fibrous proteinFibrous protein

Fibrous proteins, also called scleroproteins, are long filamentous protein molecules that form one of the two main cla...
s; for example, actinActin Overview

Actin is a globular structural protein that polymerizes in a helical fashion to form an actin filament....
 and tubulinTubulin

A Tubulin is one of several members of a small family of globular proteins....
 are globular and soluble as monomers, but polymerPolymer

Polymer is a term used to describe molecules consisting of structural units and a large number of repeating units connected ...
ize to form long, stiff fibers that comprise the cytoskeletonCytoskeleton

...
, which allows the cell to maintain its shape and size. CollagenCollagen Overview

Collagen is the main protein of connective tissue in animals and the most abundant protein in mammals, making up about 40% o...
 and elastinElastin

Elastin, is a protein in connective tissue that is elastic and allows many tissues in the body to resume their shape after s...
 are critical components of connective tissueConnective tissue

Connective tissue is one of the four types of tissue in traditional classifications It is largely a category of exclusion ra...
 such as cartilageCartilage

Cartilage is a type of dense connective tissue....
, and keratinKeratin

Keratins are a family of fibrous structural proteins; tough and insoluble, they form the hard but nonmineralized structures ...
 is found in hard or filamentous structures such as hairHair

Hair is a filamentous outgrowth from the skin, found mainly in mammals....
, nailsNail (anatomy)

In anatomy, a nail is a horn-like piece at the end of an animal finger or toe. See also claw. ...
, featherFeather

Feathers are one of the epidermal growths that form the distinctive outer covering, or plumage, on birds....
s, hoovesHoof

[Image:HoofRearHooves.jpg|thumb|200px|right|Rear hooves of a horse]]...
, and some animal shellAnimal shell

The hard, rigid outer covering of certain animals is called a shell....
s.

Other proteins that serve structural functions are motor proteins such as myosinMyosin

Myosins are a large family of motor proteins found in eukaryotic tissues....
, kinesinKinesin

Kinesin is a class of motor protein dimer found in biological cells....
, and dyneinDynein

Dynein is a motor protein in cells which converts the chemical energy contained in ATP into the mechanical energy of movemen...
, which are capable of generating mechanical forces. These proteins are crucial for cellular motilityMotility

Motility is a biological term which refers to the ability to move spontaneously and independently....
 of single celled organisms and the spermSpermatozoon

A spermatozoon or spermatozoan , from the ancient Greek spe?a and ??? and more commonly known as a sperm ...
 of many sexually reproducing multicellular organisms. They also generate the forces exerted by contracting muscleMuscle

Muscle is contractile tissue of the body and is derived from the mesodermal layer of embryonic germ cells....
s.

Methods of study


As some of the most commonly studied biological molecules, the activities and structures of proteins are examined both in vitroIn vitro

In vitro refers to the technique of performing a given experiment in a test tube, or, generally, in a controlled enviro...
and in vivoIn vivo Summary

In vivo means that which takes place inside an organism....
. In vitro studies of purified proteins in controlled environments are useful for learning how a protein carries out its function: for example, enzyme kineticsEnzyme kinetics

Enzyme kinetics is the study of the rates of chemical reactions that are catalysed by enzymes....
 studies explore the chemical mechanismReaction mechanism

In chemistry, a reaction mechanism is the step by step sequence of elementary reactions by which overall chemical change occ...
 of an enzyme's catalytic activity and its relative affinity for various possible substrate molecules. By contrast, in vivo experiments on proteins' activities within cells or even within whole organisms can provide complementary information about where a protein functions and how it is regulated.

Protein purification

In order to perform in vitroIn vitro Overview

In vitro refers to the technique of performing a given experiment in a test tube, or, generally, in a controlled enviro...
analysis, a protein must be purified away from other cellular components. This process usually begins with cell lysisCytolysis

Cytolysis is the lysis, or death, of cells due to the rupture of the cell membrane....
, in which a cell's membrane is disrupted and its internal contents released into a solution known as a crude lysateCrude lysate

A crude lysate is the solution produced when cells are destroyed by disrupting their cell membranes in a process known as cy...
. The resulting mixture can be purified using ultracentrifugation, which fractionates the various cellular components into fractions containing soluble proteins; membrane lipidLipid

Lipids are a class of hydrocarbon-containing organic compounds essential for the structure and function of living cells....
s and proteins; cellular organelleOrganelle Overview

In cell biology, an organelle is a discrete structure of a cell having specialized functions....
s, and nucleic acidNucleic acid

A nucleic acid is a complex, high-molecular-weight biochemical macromolecule composed of nucleotide chains that convey genet...
s. PrecipitationPrecipitation (chemistry)

Precipitation is the formation of a solid in a solution during a physical reaction, such as evaporation....
 by a method known as salting outSalting out Overview

Salting out is a method of separating proteins based on the principle that proteins are less soluble at high salt concentrat...
 can concentrate the proteins from this lysate. Various types of chromatographyChromatography

Chromatography is the collective term for a family of laboratory techniques for the separation of mixtures....
 are then used to isolate the protein or proteins of interest based on properties such as molecular weight, net charge and binding affinity. The level of purification can be monitored using various types of gel electrophoresisGel electrophoresis

Gel electrophoresis is a group of techniques used by scientists to separate molecules based on physical characteristics such...
 if the desired protein's molecular weight and isoelectric pointIsoelectric point Overview

The isoelectric point is the pH at which a molecule carries no net electrical charge....
 are known, by spectroscopyFacts About Spectroscopy

Spectroscopy is the study of matter by investigating light, sound, or particles that is emitted, absorbed or scattered by th...
 if the protein has distinguishable spectroscopic features, or by enzyme assayEnzyme assay

Enzyme assays are laboratory methods for measuring enzymatic activity....
s if the protein has enzymatic activity. Additionally, proteins can be isolated according their charge using electrofocusing.

For natural proteins, a series of purification steps may be necessary to obtain protein sufficiently pure for laboratory applications. To simplify this process, genetic engineeringGenetic engineering

Genetic engineering, genetic modification and gene splicing are terms for the process of manipulating genes, us...
 is often used to add chemical features to proteins that make them easier to purify without affecting their structure or activity. Here, a "tag" consisting of a specific amino acid sequence, often a series of histidineHistidine

Histidine is one of the 20 most common natural amino acids present in proteins....
 residues (a "His-tag"), is attached to one terminus of the protein. As a result, when the lysate is passed over a chromatography column containing nickelNickel

Nickel is a metallic chemical element in the periodic table that has the symbol Ni and atomic number 28....
, the histidine residues ligate the nickel and attach to the column while the untagged components of the lysate pass unimpeded.

Cellular localization


The study of proteins in vivo is often concerned with the synthesis and localization of the protein within the cell. Although many intracellular proteins are synthesized in the cytoplasmCytoplasm

Cytoplasm is a jelly-like material that fills cells....
 and membrane-bound or secreted proteins in the endoplasmic reticulumEndoplasmic reticulum

The endoplasmic reticulum or ER is an organelle found in all eukaryotic cells that is an interconnected network of tu...
, the specifics of how proteins are targetedProtein targeting

Protein targeting or protein sorting is the mechanisms by which a cell transports proteins to the appropriate position...
 to specific organelles or cellular structures is often unclear. A useful technique for assessing cellular localization uses genetic engineering to express in a cell a fusion proteinFusion protein

A fusion protein is a protein created through genetic engineering from two or more proteins/peptides....
 or chimeraChimera (protein)

A Chimera is a human-engineered protein that is encoded by a nucleotide sequence made by a splicing together of two or more...
 consisting of the natural protein of interest linked to a "reporterReporter gene Summary

In molecular biology, a reporter gene is a gene that researchers attach to another gene of interest in cell culture, animals...
" such as green fluorescent proteinGreen fluorescent protein

The green fluorescent protein is a protein from the jellyfish Aequorea victoria that fluoresces green when exposed to bl...
 (GFP). The fused protein's position within the cell can be cleanly and efficiently visualized using microscopyMicroscopy

Microscopy is any technique for producing visible images of structures or details too small to otherwise be seen by the huma...
, as shown in the figure opposite. In these cases, additional fluorescent chimeric proteins are generally required to prove the inferred localization.

Other methods for elucidating the cellular location of proteins requires the use of known compartmental markers for regions such as the ER, the Golgi, lysosomes/vacuoles, mitochondria, chloroplasts, plasma membrane, etc. With the use of fluorescently-tagged versions of these markers or of antibodies to known markers, it becomes much simpler to identify the localization of a protein of interest. For example, indirect immunofluorescence will allow for fluorescence colocalization and demonstration of location. Fluorescent dyes are used to label cellular compartments for a similar purpose.

Other possibilities exist, as well. For example, immunohistochemistryImmunohistochemistry

Immunohistochemistry or IHC refers to the process of localizing proteins in cells of a tissue section exploiting the p...
 usually utilizes an antibody to one or more proteins of interest that are conjugated to enzymes yielding either luminescent or chromogenic signals that can be compared between samples, allowing for localization information.

Another applicable technique is cofractionation in sucrose (or other material) gradients using isopycnic centrifugationIsopycnic centrifugation

Isopycnic centrifugation or equilibrium centrifugation is a process used to isolate nucleic acids such as DNA....
. While this technique does not prove colocalization of a compartment of known density and the protein of interest, it does increase the likelihood, and is more amenable to large-scale studies.

Finally, the gold-standard method of cellular localization is immunoelectron microscopy. This technique also uses an antibody to the protein of interest, along with classical electron microscopy techniques. The sample is prepared for normal electron microscopic examination, and then treated with an antibody to the protein of interest that is conjugated to an extremely electro-dense material, usually gold. This allows for the localization of both ultrastructural details as well as the protein of interest.

Through another genetic engineering application known as site-directed mutagenesisSite-directed mutagenesis

Site-directed mutagenesis is a molecular biology technique in which a mutation is created at a defined site in a DNA molecul...
, researchers can alter the protein sequence and hence its structure, cellular localization, and susceptibility to regulation, which can be followed in vivo by GFP tagging or in vitro by enzyme kineticsEnzyme kinetics

Enzyme kinetics is the study of the rates of chemical reactions that are catalysed by enzymes....
 and binding studies.

Proteomics and bioinformatics

The total complement of proteins present at a time in a cell or cell type is known as its proteomeProteome

The term proteome was coined by Mark Wilkins in 1995 and is used to describe the entire complement of proteins in a given b...
, and the study of such large-scale data sets defines the field of proteomicsProteomics

Proteomics is the large-scale study of protein, particularly their structures and functions....
, named by analogy to the related field of genomicsGenomics

Genomics is the study of an organism's genome and the use of the genes....
. Key experimental techniques in proteomics include 2D electrophoresisTwo-dimensional gel electrophoresis

Two-dimensional gel electrophoresis, commonly abbreviated as 2-DE or 2-D electrophoresis, is a form of gel elect...
, which allows the separation of a large number of proteins, mass spectrometryMass spectrometry

Mass spectrometry is an analytical technique used to measure the mass-to-charge ratio of ions....
, which allows rapid high-throughput identification of proteins and sequencing of peptides (most often after in-gel digestionIn-gel digestion Overview

The in-gel digestion is part of the sample preparation for the mass spectrometric identification of proteins in course of pr...
), protein microarrayProtein microarray

A protein microarray is a piece of glass on which different molecules of protein have been affixed at separate locations in ...
s, which allow the detection of the relative levels of a large number of proteins present in a cell, and two-hybrid screeningTwo-hybrid screening

Two-hybrid screening is a molecular biology technique used to discover protein-protein interactions by testing for physical ...
, which allows the systematic exploration of protein-protein interactionFacts About Protein-protein interaction

Protein-protein interactions refer to the association of protein molecules and the study of these associations from the pers...
s. The total complement of biologically possible such interactions is known as the interactomeInteractome Summary

Interactome is the whole set of molecular interactions in cells....
. A systematic attempt to determine the structures of proteins representing every possible fold is known as structural genomicsStructural genomics

Structural genomics consists in the determination of the three dimensional structure of all proteins of a given organism, by...
.

The large amount of genomic and proteomic data available for a variety of organisms, including the human genomeHuman genome

The human genome is the genome of Homo sapiens, which is composed of 24 distinct chromosomes with a total of approximate...
, allows researchers to efficiently identify homologousFacts About Homology (biology)

In biology, two or more structures are said to be homologous if they are alike because of shared ancestry....
 proteins in distantly related organisms by sequence alignmentSequence alignment

In bioinformatics, a sequence alignment is a way of arranging the primary sequences of DNA, RNA, or protein to identify regi...
.