Information about Protein Folding

Enlarge picture
Protein before and after folding.
Protein folding is the physical process by which a polypeptide folds into its characteristic three-dimensional structure.[1] Each protein begins as a polypeptide, translated from a sequence of mRNA as a linear chain of amino acids. This polypeptide lacks any developed three-dimensional structure (the left hand side of the neighboring figure). However each amino acid in the chain can be thought of having certain 'gross' chemical features. These may be hydrophobic, hydrophilic, or electrically charged, for example. These interact with each other and their surroundings in the cell to produce a well-defined, three dimensional shape, the folded protein (the right hand side of the figure), known as the native state. The resulting three-dimensional structure is determined by the sequence of the amino acids.[2] The mechanism of protein folding is not completely understood.

Experimentally determining the three dimensional structure of a protein is often very difficult and expensive. However the sequence of that protein is often known. Therefore scientists have tried to use different biophysical techniques to manually fold a protein. That is, to predict the structure of the complete protein from the sequence of the protein.

For many proteins the correct three dimensional structure is essential to function.[3] Failure to fold into the intended shape usually produces inactive proteins with different properties (details found under prion). Several neurodegenerative and other diseases are believed to result from the accumulation of misfolded (incorrectly folded) proteins.[4]

Known facts about the process

The relationship between folding and amino acid sequence

The amino-acid sequence (or primary structure) of a protein predisposes it towards its native conformation or conformations. It will fold spontaneously during or after synthesis. While these macromolecules may be regarded as "folding themselves", the mechanism depends equally on the characteristics of the cytosol, including the nature of the primary solvent (water or lipid), the concentration of salts, the temperature, and molecular chaperones.

Most folded proteins have a hydrophobic core in which side chain packing stabilizes the folded state, and charged or polar side chains on the solvent-exposed surface where they interact with surrounding water molecules. It is generally accepted that minimizing the number of hydrophobic sidechains exposed to water is the principal driving force behind the folding process,[5] although a recent theory has been proposed which reassesses the contributions made by hydrogen bonding.[6]

The process of folding in vivo often begins co-translationally, so that the N-terminus of the protein begins to fold while the C-terminal portion of the protein is still being synthesized by the ribosome. Specialized proteins called chaperones assist in the folding of other proteins.[7] A well studied example is the bacterial GroEL system, which assists in the folding of globular proteins. In eukaryotic organisms chaperones are known as heat shock proteins. Although most globular proteins are able to assume their native state unassisted, chaperone-assisted folding is often necessary in the crowded intracellular environment to prevent aggregation; chaperones are also used to prevent misfolding and aggregation which may occur as a consequence of exposure to heat or other changes in the cellular environment.

For the most part, scientists have been able to study many identical molecules folding together en masse. At the coarsest level, it appears that in transitioning to the native state, a given amino acid sequence takes on roughly the same route and proceeds through roughly the same intermediates and transition states. Often folding involves first the establishment of regular secondary and supersecondary structures, particularly alpha helices and beta sheets, and afterwards tertiary structure. Formation of quaternary structure usually involves the "assembly" or "coassembly" of subunits that have already folded. The regular alpha helix and beta sheet structures fold rapidly because they are stabilized by intramolecular hydrogen bonds, as was first characterized by Linus Pauling. Protein folding may involve covalent bonding in the form of disulfide bridges formed between two cysteine residues or the formation of metal clusters. Shortly before settling into their more energetically favourable native conformation, molecules may pass through an intermediate "molten globule" state.

The essential fact of folding, however, remains that the amino acid sequence of each protein contains the information that specifies both the native structure and the pathway to attain that state. This is not to say that identical amino acid sequences always fold similarly. Conformations differ based on environmental factors as well; similar proteins fold differently based on where they are found. Folding is a spontaneous process independent of energy inputs from nucleoside triphosphates. The passage of the folded state is mainly guided by hydrophobic interactions, formation of intramolecular hydrogen bonds, and van der Waals forces, and it is opposed by conformational entropy, which must be overcome by extrinsic factors such as chaperones.

Disruption of the native state

In certain solutions and under some conditions proteins will not fold into their biochemically functional forms. Temperatures above (and sometimes those below) the range that cells tend to live in will cause proteins to unfold or "denature" (this is why boiling makes the white of an egg opaque). High concentrations of solutes, extremes of pH, mechanical forces, and the presence of chemical denaturants can do the same. A fully denatured protein lacks both tertiary and secondary structure, and exists as a so-called random coil. Under certain conditions some proteins can refold; however, in many cases denaturation is irreversible.[8] Cells sometimes protect their proteins against the denaturing influence of heat with enzymes known as chaperones or heat shock proteins, which assist other proteins both in folding and in remaining folded. Some proteins never fold in cells at all except with the assistance of chaperone molecules, which either isolate individual proteins so that their folding is not interrupted by interactions with other proteins or help to unfold misfolded proteins, giving them a second chance to refold properly. This function is crucial to prevent the risk of precipitation into insoluble amorphous aggregates.

Incorrect protein folding and neurodegenerative disease

Misfolded proteins are responsible for prion-related illnesses such as Creutzfeldt-Jakob disease, bovine spongiform encephalopathy (mad cow disease), amyloid-related illnesses such as Alzheimer's Disease, and a number of other forms of proteopathy such as cystic fibrosis. These diseases are associated with the multimerization of misfolded proteins into insoluble, extracellular aggregates and/or intracellular inclusions; it is not clear whether the plaques are the cause or merely a symptom of illness.

Kinetics and the Levinthal Paradox

The entire duration of the folding process varies dramatically depending on the protein of interest. The slowest folding proteins require many minutes or hours to fold, primarily due to proline isomerizations or wrong disulfide bond formations, and must pass through a number of intermediate states, like checkpoints, before the process is complete.[9] On the other hand, very small single-domain proteins with lengths of up to a hundred amino acids typically fold in a single step.[10] Time scales of milliseconds are the norm and the very fastest known protein folding reactions are complete within a few microseconds.[11]

The Levinthal paradox[12] observes that if a protein were to fold by sequentially sampling all possible conformations, it would take an astronomical amount of time to do so, even if the conformations were sampled at a rapid rate (on the nanosecond or picosecond scale). Based upon the observation that proteins fold much faster than this, Levinthal then proposed that a random conformational search does not occur in folding, and the protein must, therefore, fold by a directed process.

Techniques for studying protein folding

Modern studies of folding with high time resolution

The study of protein folding has been greatly advanced in recent years by the development of fast, time-resolved techniques. These are experimental methods for rapidly triggering the folding of a sample of unfolded protein, and then observing the resulting dynamics. Fast techniques in widespread use include ultrafast mixing of solutions, photochemical methods, and laser temperature jump spectroscopy. Among the many scientists who have contributed to the development of these techniques are Heinrich Roder, Harry Gray, Martin Gruebele, Brian Dyer, William Eaton, Sheena Radford, Chris Dobson, Sir Alan R. Fersht and Bengt Nölting.

Energy landscape theory of protein folding

The protein folding phenomenon was largely an experimental endeavor until the formulation of energy landscape theory by Joseph Bryngelson and Peter Wolynes in the late 1980s and early 1990s. This approach introduced the principle of minimal frustration, which asserts that evolution has selected the amino acid sequences of natural proteins so that interactions between side chains largely favor the molecule's acquisition of the folded state. Interactions that do not favor folding are selected against, although some residual frustration is expected to exist. A consequence of these evolutionarily selected sequences is that proteins are generally thought to have globally "funneled energy landscapes" (coined by José Onuchic) that are largely directed towards the native state. This "folding funnel" landscape allows the protein to fold to the native state through any of a large number of pathways and intermediates, rather than being restricted to a single mechanism. The theory is supported by computational simulations of model proteins and has been used to improve methods for protein structure prediction and design.

Computational prediction of protein tertiary structure

De novo or ab initio techniques for computational protein structure prediction is related to, but strictly distinct from, studies involving protein folding. Molecular Dynamics (MD) is an important tool for studying protein folding and dynamics in silico. Because of computational cost, ab initio MD folding simulations with explicit water are limited to peptides and very small proteins. MD simulations of larger proteins remain restricted to dynamics of the experimental structure or its high-temperature unfolding. In order to simulate long time folding processes (beyond about 1 microsecond), like folding of small-size proteins (about 50 residues) or larger, some approximations or simplifications in protein models need to be introduced. An approach using reduced protein representation (pseudo-atoms representing groups of atoms are defined) and statistical potential is not only useful in protein structure prediction, but is also capable of reproducing the folding pathways.[13]

Because of the many possible ways of folding, there can be many possible structures. A peptide consisting of just five amino acids can fold into over 100 billion possible structures. [1]

Techniques for determination of protein structure

The determination of the folded structure of a protein is a lengthy and complicated process, involving methods like X-ray crystallography and NMR. One of the major areas of interest is the prediction of native structure from amino-acid sequences alone using bioinformatics and computational simulation methods.

See also

References

1. ^ Alberts, Bruce; Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Walters (2002). "The Shape and Structure of Proteins", Molecular Biology of the Cell; Fourth Edition. New York and London: Garland Science. ISBN 0-8153-3218-1. 
2. ^ Anfinsen C (1972). "The formation and stabilization of protein structure". Biochem. J. 128 (4): 737-49. PMID 4565129. 
3. ^ Jeremy M. Berg, John L. Tymoczko, Lubert Stryer; Web content by Neil D. Clarke (2002). "3. Protein Structure and Function", Biochemistry. San Francisco: W.H. Freeman. ISBN 0-7167-4684-0. 
4. ^ Science of Folding@Home (July 18, 2005). Retrieved on 2007-04-22.
5. ^ Pace C, Shirley B, McNutt M, Gajiwala K (1996). "Forces contributing to the conformational stability of proteins". FASEB J. 10 (1): 75-83. PMID 8566551. 
6. ^ Rose G, Fleming P, Banavar J, Maritan A (2006). "A backbone-based theory of protein folding". Proc. Natl. Acad. Sci. U.S.A. 103 (45): 16623-33. PMID 17075053. 
7. ^ Lee S, Tsai F (2005). "Molecular chaperones in protein quality control". J. Biochem. Mol. Biol. 38 (3): 259-65. PMID 15943899. 
8. ^ Shortle D (1996). "The denatured state (the other half of the folding equation) and its role in protein stability". FASEB J. 10 (1): 27-34. PMID 8566543. 
9. ^ P.S. Kim & R.L. Baldwin (1990). "Intermediates in the folding reactions of small proteins". Annu. Rev. Biochem. 59: 631-660. 
10. ^ S.E. Jackson (Aug 1998). "How do small single-domain proteins fold?". Fold. Des. 3: R81-R91. ISSN 1359-0278. 
11. ^ J. Kubelka, et al. (2004). "The protein folding "speed limit"". Curr. Opin. Struct. Biol. 14: 76-88. DOI:10.1016/j.sbi.2004.01.013. 
12. ^ C. Levinthal (1968). "Are there pathways for protein folding?". J. Chim. Phys. 65: 44-45. 
13. ^ Kmiecik S and Kolinski A (2007). "Characterization of protein-folding pathways by reduced-space modeling". Proc. Natl. Acad. Sci. U.S.A. 104 (30): 12330-12335. PMID 17636132. 

External links

Peptides (from the Greek πεπτίδια, "small digestibles") are short polymers formed from the linking, in a defined order, of α-amino acids.
..... Click the link for more information.
Protein structure, from primary to quaternary structure.]] Biochemistry refers to four distinct aspects of a protein's structure:
  • Primary structure - the amino acid sequence of the peptide chains.

..... Click the link for more information.
Peptides (from the Greek πεπτίδια, "small digestibles") are short polymers formed from the linking, in a defined order, of α-amino acids.
..... Click the link for more information.
Messenger Ribonucleic Acid (mRNA) is a molecule of RNA encoding a chemical "blueprint" for a protein product. mRNA is transcribed from a DNA template, and carries coding information to the sites of protein synthesis: the ribosomes.
..... Click the link for more information.
amino acid is a molecule that contains both amine and carboxyl functional groups. In biochemistry, this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is an organic substituent.
..... Click the link for more information.
hydrophobicity (from the combining form of water in Attic Greek hydro- and for fear phobos) refers to the physical property of a molecule (known as a hydrophobe) that is repelled from a mass of water [1].
..... Click the link for more information.
Hydrophile, from the Greek (hydros) "water" and φιλια (philia) "friendship," refers to a physical property of a molecule that can transiently bond with water (H2O) through hydrogen bonding.
..... Click the link for more information.
native state of a protein is its operative or functional form. All protein molecules are simple unbranched chains of amino acids, but it is by assuming a specific three-dimensional shape that they are able to perform their biological function.
..... Click the link for more information.
Biophysics (also biological physics) is an interdisciplinary science that applies the theories and methods of physics to questions of biology.

Biophysics research today is comprised of a lot of specific biological studies, which don't share a unique identifying
..... Click the link for more information.
Prion Diseases (TSEs)
Classification & external resources

ICD-10 A81
ICD-9 046

A prion (IPA: /ˈpriːɒn/[1]
..... Click the link for more information.
Neurodegenerative disease (Greek νέυρο-, néuro-, "nerval" and Latin dēgenerāre, "to decline" or "to worsen") is a condition in which cells of the brain and spinal cord are lost.
..... Click the link for more information.
disease is an abnormal condition of an organism that impairs bodily functions. In human beings, "disease" is often used more broadly to refer to any condition that causes discomfort, dysfunction, distress, social problems, and/or death to the person afflicted, or similar problems
..... Click the link for more information.
primary structure of a biological molecule is the exact specification of its atomic composition and the chemical bonds connecting those atoms (including stereochemistry). For a typical unbranched, un-crosslinked biopolymer (such as a molecule of DNA, RNA or typical intracellular
..... Click the link for more information.
Translation is the second process of protein biosynthesis (part of the overall process of gene expression). Translation occurs in the cytoplasm where the ribosomes are located. Ribosomes are made of a small and large subunit which surrounds the mRNA.
..... Click the link for more information.
macromolecule implies large molecule. In the context of science and engineering, the term may be applied to conventional polymers and biopolymers (such as DNA) as well as non-polymeric molecules with large molecular mass such as lipids or macrocycles.
..... Click the link for more information.
The cytosol (cf. cytoplasm, which also includes the organelles) is the internal fluid of the cell, and a portion of cell metabolism occurs here. Proteins within the cytosol play an important role in signal transduction pathways and glycolysis.
..... Click the link for more information.
A solvent is a liquid that dissolves a solid, liquid, or gaseous solute, resulting in a solution. The most common solvent in everyday life is water. Most other commonly-used solvents are organic (carbon-containing) chemicals. These are called organic solvents.
..... Click the link for more information.
Water is a common chemical substance that is essential to all known forms of life.[1] In typical usage, water refers only to its liquid form or state, but the substance also has a solid state, ice, and a gaseous state, water vapor.
..... Click the link for more information.
Lipids can be broadly defined as any fat-soluble (hydrophobic), naturally-occurring molecules. The term is more-specifically used to refer to fatty-acids and their derivatives (including tri-, di-, and monoglycerides and phospholipids) as well as other fat-soluble sterol-containing
..... Click the link for more information.
Salt is a mineral essential for animal life, composed primarily of sodium chloride. Salt for human consumption is produced in different forms: unrefined salt (such as sea salt), refined salt (table salt), and iodized salt.
..... Click the link for more information.
trillion fold).]]

Temperature is a physical property of a system that underlies the common notions of hot and cold; something that is hotter generally has the greater temperature. Temperature is one of the principal parameters of thermodynamics.
..... Click the link for more information.
chaperones are proteins that assist the non-covalent folding/unfolding and the assembly/disassembly of other macromolecular structures, but do not occur in these structures when the latter are performing their normal biological functions.
..... Click the link for more information.
hydrophobic effect is the property that nonpolar molecules tend to form intermolecular aggregates in an aqueous medium and analogous intramolecular interactions.[1][2] The name arises from the combination of water in Attic Greek hydro-
..... Click the link for more information.
Chemical polarity, also known as bond polarity or simply polarity, is a concept in chemistry which describes how equally bonding electrons are shared between atoms.
..... Click the link for more information.
In vivo (Latin: (with)in the living) means that which takes place inside an organism. In science, in vivo refers to experimentation done in or on the living tissue of a whole, living organism as opposed to a partial or dead one.
..... Click the link for more information.
Translation is the second process of protein biosynthesis (part of the overall process of gene expression). Translation occurs in the cytoplasm where the ribosomes are located. Ribosomes are made of a small and large subunit which surrounds the mRNA.
..... Click the link for more information.
The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) refers to the end of a protein or polypeptide terminated by an amino acid with a free amine group (-NH2).
..... Click the link for more information.
The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal end, or COOH-terminus) of a protein or polypeptide is the end of the amino acid chain terminated by a free carboxyl group (-COOH).
..... Click the link for more information.
Protein biosynthesis (synthesis) is the process in which cells build proteins. The term is sometimes used to refer only to protein translation but more often it refers to a multi-step process, beginning with amino acid synthesis and transcription which are then used for translation.
..... Click the link for more information.
A ribosome is a small, dense, functional structure found in most known cells that assemble proteins and polypeptides used in cell division. It catalyses the assembly of individual amino acids into polypeptide chains by reading messenger RNAs and binding amino acids that are
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter