Information about Secondary Structure
A representation of the 3D structure of the myoglobin protein. Alpha helices are shown in colour, and random coil in white, there are no beta sheets shown. This protein was the first to have its structure resolved by X-ray crystallography by Max Perutz and Sir John Cowdery Kendrew in 1958, which led to them receiving a Nobel Prize in Chemistry in 1962.
Secondary structure is formally defined by the hydrogen bonds of the biopolymer, as observed in an atomic-resolution structure. In proteins, the secondary structure is defined by patterns of hydrogen bonds between backbone amide groups (sidechain-mainchain and sidechain-sidechain hydrogen bonds are irrelevant), where the DSSP definition of a hydrogen bond is used. In nucleic acids, the secondary structure is defined by the hydrogen bonding between the nitrogenous bases.
The hydrogen bonding is correlated with other structural features, however, which has given rise to less formal definitions of secondary structure. For example, residues in protein helices generally adopt backbone dihedral angles in a particular region of the Ramachandran plot; thus, a segment of residues with such dihedral angles is often called a "helix", regardless of whether it has the correct hydrogen bonds. Many other less formal definitions have been proposed, often applying concepts from the differential geometry of curves, such as curvature and torsion. Least formally, structural biologists solving a new atomic-resolution structure will sometimes assign its secondary structure "by eye" and record their assignments in the corresponding PDB file.
The rough secondary-structure content of a biopolymer (e.g., "this protein is 40% α-helix and 20% β-sheet.") can often be estimated spectroscopically. For proteins, a common method is far-ultraviolet (far-UV, 170-250 nm) circular dichroism. A pronounced double minimum at 208 and 222 nm indicate α-helical structure, whereas a single minimum at 204 nm or 217 nm reflects random-coil or β-sheet structure, respectively. A less common method is infrared spectroscopy, which detects differences in the bond oscillations of amide groups due to hydrogen-bonding. Finally, secondary-structure contents may be estimated accurately using the chemical shifts of an unassigned NMR spectrum.
Secondary structure was introduced by Kaj Ulrik Linderstrom-Lang in the 1952 Lane medical lectures at Stanford.
Proteins
Secondary structure in proteins consists of local inter-residue interactions mediated by hydrogen bonds. The most common secondary structures are alpha helices and beta sheets. Other helices, such as the 310 helix and π helix, are calculated to have energetically favorable hydrogen-bonding patterns but are rarely if ever observed in natural proteins except at the ends of α helices due to unfavorable backbone packing in the center of the helix. Other extended structures such as the polyproline helix and alpha sheet are rare in native state proteins but are often hypothesized as important protein folding intermediates. Tight turns and loose, flexible loops link the more "regular" secondary structure elements. The random coil is not a true secondary structure, but is the class of conformations that indicate an absence of regular secondary structure.Amino acids vary in their ability to form the various secondary structure elements. Proline and glycine are sometimes known as "helix breakers" because they disrupt the regularity of the α helical backbone conformation; however, both have unusual conformational abilities and are commonly found in turns. Amino acids that prefer to adopt helical conformations in proteins include methionine, alanine, leucine, glutamate and lysine ("MALEK" in amino-acid 1-letter codes); by contrast, the large aromatic residues (tryptophan, tyrosine and phenylalanine) and
-branched amino acids (isoleucine, valine, and threonine) prefer to adopt β-strand conformations. However, these preferences are not strong enough to produce a reliable method of predicting secondary structure from sequence alone.
The DSSP code
The DSSP code is frequently used to describe the protein secondary structures with a single letter code. DSSP is an acronym for "Dictionary of Protein Secondary Structure", which was the title of the original article actually listing the secondary structure of the proteins with known 3D structure (Kabsch and Sander 1983). The secondary structure is assigned based on hydrogen bonding patterns as those initially proposed by Pauling et al. in 1951 (before any protein structure had ever been experimentally determined).
- G = 3-turn helix (310 helix). Min length 3 residues.
- H = 4-turn helix (alpha helix). Min length 4 residues.
- I = 5-turn helix (pi helix). Min length 5 residues.
- T = hydrogen bonded turn (3, 4 or 5 turn)
- E = beta sheet in parallel and/or anti-parallel sheet conformation (extended strand). Min length 2 residues.
- B = residue in isolated beta-bridge (single pair beta-sheet hydrogen bond formation)
- S = bend (the only non-hydrogen-bond based assignment)
DSSP H-bond definition
Secondary structure is defined by hydrogen bonding, so the exact definition of a hydrogen bond is critical. The standard H-bond definition for secondary structure is that of DSSP, which is a purely electrostatic model. It assigns charges of to the carbonyl carbon and oxygen, respectively, and charges of to the amide nitrogen and hydrogen, respectively. The electrostatic energy is
is less than -0.5 kcal/mol. Although the DSSP formula is a relatively crude approximation of the physical H-bond energy, it is generally accepted as a tool for defining secondary structure.
Protein secondary-structure prediction
Early methods of secondary-structure prediction were based on the helix- or sheet-forming propensities of individual amino acids, sometimes coupled with rules for estimating the free energy of forming secondary structure elements. Such methods are typically ~60% accurate in predicting which of the three states (helix/sheet/coil) a residue adopts. A significant increase in accuracy (to nearly ~80%) was made by exploiting multiple sequence alignment; knowing the full distribution of amino acids that occur at a position (and in its vicinity, typically ~7 residues on either side) throughout evolution provides a much better picture of the structural tendencies near that position. For illustration, a given protein might have a glycine at a given position, which by itself might suggest a random coil there. However, multiple sequence alignment might reveal that helix-favoring amino acids occur at that position (and nearby positions) in 95% of homologous proteins spanning nearly a billion years of evolution. Moreover, by examining the average hydrophobicity at that and nearby positions, the same alignment might also suggest a pattern of residue solvent accessibility consistent with an α-helix. Taken together, these factors would suggest that the glycine of the original protein adopts α-helical structure, rather than random coil. Several types of methods are used to combine all the available data to form a 3-state prediction, including neural networks, hidden Markov models and support vector machines. Modern prediction methods also provide a confidence score for their predictions at every position.Secondary-structure prediction methods are continuously benchmarked, e.g., in the EVA experiment. Based on ~270 weeks of testing, the most accurate methods at present are PsiPRED, SAM, PORTER, PROF and SABLE. Interestingly, it does not seem to be possible to improve upon these methods by taking a consensus of them [citation needed]. The chief area for improvement appears to be the prediction of β-strands; residues confidently predicted as β-strand are likely to be so, but the methods are apt to overlook some β-strand segments (false negatives). There is likely an upper limit of ~90% prediction accuracy overall, due to the idiosyncrasies of the standard method (DSSP) for assigning secondary-structure classes (helix/strand/coil) to PDB structures, against which the predictions are benchmarked.
Accurate secondary-structure prediction is a key element in the prediction of tertiary structure, in all but the simplest (homology modeling) cases. For example, a confidently predicted pattern of six secondary structure elements βαββαβ is the signature of a ferredoxin fold.
Nucleic acids
Nucleic acids also have secondary structure, most notably single-stranded RNA molecules. RNA secondary structure is generally divided into helices (contiguous base pairs), and various kinds of loops (unpaired nucleotides surrounded by helices). The stem-loop structure in which a base-paired helix ends in a short unpaired loop is extremely common and is a building block for larger structural motifs such as cloverleaf structures, which are four-helix junctions such as those found in transfer RNA. Internal loops (a short series of unpaired bases in a longer paired helix) and bulges (regions in which one strand of a helix has "extra" inserted bases with no counterparts in the opposite strand) are also frequent. Finally, both pseudoknots and base triples are present in RNA (though not DNA). Extremely complex DNA secondary structures have been designed and produced in DNA origami and other products of DNA nanotechnology.Since it is almost entirely base pair-mediated, RNA secondary structure can be said to define which bases are paired in a molecule or complex. However, the traditional Watson-Crick base pair is not the only type of pairing that is permissible in RNA; Hoogsteen base pairing is also common.
RNA secondary structure prediction
See also RNA structureOne application of bioinformatics uses predicted RNA secondary structures in searching a genome for noncoding but functional forms of RNA. For example, microRNAs have canonical long stem-loop structures interrupted by small internal loops. A general method of calculating probable RNA secondary structure is dynamic programming, although this has the disadvantage that it cannot detect pseudoknots or other cases in which base pairs are not fully nested. More general methods are based on stochastic context-free grammars. A web server that implements a type of dynamic programming is Mfold.
For many RNA molecules, the secondary structure is highly important to the correct function of the RNA — often more so than the actual sequence. This fact aids in the analysis of non-coding RNA sometimes termed "RNA genes". RNA secondary structure can be predicted with some accuracy by computer (for example, the RNAsoft web server), and many bioinformatics applications use some notion of secondary structure in analysis of RNA.
Alignment
Both protein and RNA secondary structures can be used to aid in multiple sequence alignment. These alignments can be made more accurate by the inclusion of secondary structure information in addition to simple sequence information. This is sometimes less useful in RNA because base pairing is much more highly conserved than sequence. Distant relationships between proteins whose primary structures are unalignable can sometimes be found by secondary structure.See also
- Folding (chemistry)
- primary structure
- tertiary structure
- quaternary structure
- translation
- structural motif
References
- C Branden and J Tooze (1999). Introduction to Protein Structure 2nd ed. Garland Publishing: New York, NY.
- W. Kabsch and C. Sander. Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen Bonded and Geometrical Features. Biopolymers 22: 2577-2637 (1983). http://dx.doi.org/10.1002/bip.360221211
- M. Zuker "Computer prediction of RNA structure", Methods in Enzymology, 180:262-88 (1989). (The classic paper on dynamic programming algorithms to predict RNA secondary structure.)
- L. Pauling and R.B Corey. Configurations of polypeptide chains with favored orientations of the polypeptide around single bonds: Two pleated sheets. Proc. Natl. Acad. Sci. Wash., 37:729-740 (1951). (The original beta-sheet conformation article.)
- L. Pauling, R.B. Corey and H.R. Branson. Two hydrogen-bonded helical configurations of the polypeptide chain. Proc. Natl. Acad. Sci. Wash., 37:205-211 (1951). (alpha- and pi-helix conformations, since they predicted that helices would not be possible.)
- (1983) "Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features". Biopolymers 22: 2577-2637. PMID 6667333.
External links
| Protein secondary structure | ||
|---|---|---|
| Helices: | α-helix | 310 helix | π-helix | β-helix | Polyproline helix | Collagen helix | |
| Extended: | β-strand | Turn | Beta hairpin | Beta bulge | α-strand | |
| Supersecondary: | Coiled coil | Helix-turn-helix | EF hand | |
| Secondary structure propensities of amino acids | ||
| Helix-favoring: | Methionine | Alanine | Leucine | Glutamic acid | Glutamine | Lysine | |
| Extended-favoring: | Threonine | Isoleucine | Valine | Phenylalanine | Tyrosine | Tryptophan | |
| Disorder-favoring: | Glycine | Serine | Proline | Asparagine | Aspartic acid | |
| No preference: | Cysteine | Histidine | Arginine | |
| ←Primary structure | Tertiary structure→ | |
Biochemistry is the study of the chemical processes in living organisms.[1] The word "biochemistry" comes from the Greek word βιοχημεία biochēmeia, which means "the chemistry of life.
..... Click the link for more information.
..... Click the link for more information.
Structural biology is a branch of molecular biology concerned with the study of the architecture and shape of biological macromolecules—proteins and nucleic acids in particular—and what causes them to have the structures they have.
..... Click the link for more information.
..... Click the link for more information.
Biopolymers are a class of polymers produced by living organisms. Starch, proteins and peptides, DNA, and RNA are all examples of biopolymers, in which the monomer units, respectively, are sugars, amino acids, and nucleic acids.
..... Click the link for more information.
..... Click the link for more information.
Proteins are large organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid residues.
..... Click the link for more information.
..... Click the link for more information.
A nucleic acid is a macromolecule composed of nucleotide chains. In biochemistry these molecules carry genetic information or form structures within cells. The most common nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA).
..... Click the link for more information.
..... Click the link for more information.
In biochemistry and chemistry, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates.[1]
..... Click the link for more information.
Relationship to primary sequence
..... Click the link for more information.
hydrogen bond is a special type of dipole-dipole bond that exists between an electronegative atom and a hydrogen atom bonded to another electronegative atom. This type of bond always involves a hydrogen atom, thus the name.
..... Click the link for more information.
..... Click the link for more information.
In protein structure, the DSSP algorithm is the standard method for assigning secondary structure to the amino acids of a protein, given the atomic-resolution coordinates of the protein.
..... Click the link for more information.
..... Click the link for more information.
dihedral or torsion angle.
The dihedral angle of two planes can be seen by looking at the planes "edge on", i.e., along their line of intersection. The dihedral angle between two planes denoted A and B is the angle between their two normal unit vectors and
..... Click the link for more information.
The dihedral angle of two planes can be seen by looking at the planes "edge on", i.e., along their line of intersection. The dihedral angle between two planes denoted A and B is the angle between their two normal unit vectors and
..... Click the link for more information.
Ramachandran plot (also known as a Ramachandran map or a Ramachandran diagram), developed by Gopalasamudram Narayana Ramachandran, is a way to visualize dihedral angles φ against ψ of amino acid residues in protein structure.
..... Click the link for more information.
..... Click the link for more information.
In mathematics, differential topology is the field dealing with differentiable functions on differentiable manifolds. It arises naturally from the study of the theory of differential equations. Differential geometry is the study of geometry using differential calculus (cf.
..... Click the link for more information.
..... Click the link for more information.
In mathematics, curvature is any of a number of loosely related concepts in different areas of geometry. Intuitively, curvature is the amount by which a geometric object deviates from being flat, but this is defined in different ways depending on the context.
..... Click the link for more information.
..... Click the link for more information.
- In mathematics, the term torsion usually has one of two unrelated meanings. The first is geometrical, measuring curvature of some sort. The second is algebraic, relating to the set of elements that vanish (e.g., elements of finite order).
..... Click the link for more information.
Spectroscopy is the study of the interaction between radiation (electromagnetic radiation, or light, as well as particle radiation) and matter. Spectrometry is the measurement of these interactions and an instrument which performs such measurements is a spectrometer or
..... Click the link for more information.
..... Click the link for more information.
Circular dichroism (CD) is a form of spectroscopy based on the differential absorption of left- and right-handed circularly polarized light. It can be used to determine the structure of macromolecules (including the secondary structure of proteins and the handedness of DNA).
..... Click the link for more information.
..... Click the link for more information.
Infrared (IR) radiation is electromagnetic radiation of a wavelength longer than that of visible light, but shorter than that of radio waves. The name means "below red" (from the Latin infra, "below"), red being the color of visible light with the longest wavelength.
..... Click the link for more information.
..... Click the link for more information.
In nuclear magnetic resonance (NMR), the chemical shift describes the dependence of nuclear magnetic energy levels on the electronic environment in a molecule.[1][2][3]
..... Click the link for more information.
..... Click the link for more information.
Nuclear magnetic resonance (NMR) is a physical phenomenon based upon the quantum mechanical magnetic properties of an atom's nucleus. NMR also commonly refers to a family of scientific methods that exploit nuclear magnetic resonance to study molecules.
..... Click the link for more information.
..... Click the link for more information.
Kaj Ulrik Linderstrøm-Lang (November 29, 1896 - May 25, 1959) was a Danish protein scientist, who was the director of the Carlsberg Laboratory from 1939 until his death.
..... Click the link for more information.
..... Click the link for more information.
Leland Stanford Junior University, commonly known as Stanford University or simply Stanford, is a private university located approximately 37 miles (60 kilometers) southeast of San Francisco and approximately 20 miles (32 km) northwest of San Jose in Stanford,
..... Click the link for more information.
..... Click the link for more information.
alpha helix (α-helix) is a right-handed coiled conformation, resembling a spring, in which every backbone N-H group donates a hydrogen bond to the backbone C=O group of the amino acid four residues earlier ( hydrogen bonding). (See also helix.
..... Click the link for more information.
..... Click the link for more information.
β sheet (also β-pleated sheet) is the second form of regular secondary structure in proteins — the first is the alpha helix — consisting of beta strands
..... Click the link for more information.
..... Click the link for more information.
310-helix is a type of secondary structure found (rarely) in proteins.
..... Click the link for more information.
Structure
The amino acids in a 310-helix are arranged in a right-handed helical structure. Each amino acid corresponds to a 120° turn in the helix (i.e...... Click the link for more information.
pi helix (or π-helix) is a type of secondary structure found (rarely) in proteins.
..... Click the link for more information.
Standard structure
The amino acids in a standard π-helix are arranged in a right-handed helical structure. Each amino acid corresponds to a 87° turn in the helix (i.e...... Click the link for more information.
In proteins, a left-handed polyproline II helix (PPII, poly-Pro II) is formed when sequential residues all adopt (φ,ψ) backbone dihedral angles of roughly (-75°, 150°) and have trans isomers of their peptide bonds.
..... Click the link for more information.
..... Click the link for more information.
The alpha sheet (also known as an alpha pleated sheet or a polar pleated sheet) is a hypothetical secondary structure in proteins, first proposed by Linus Pauling and Robert Corey in 1951.
..... Click the link for more information.
..... Click the link for more information.
native state of a protein is its operative or functional form. All protein molecules are simple unbranched chains of amino acids, but it is by assuming a specific three-dimensional shape that they are able to perform their biological function.
..... Click the link for more information.
..... Click the link for more information.
Protein folding is the physical process by which a polypeptide folds into its characteristic three-dimensional structure.[1] Each protein begins as a polypeptide, translated from a sequence of mRNA as a linear chain of amino acids.
..... Click the link for more information.
..... Click the link for more information.
A turn is an element of secondary structure in proteins.
According to the most common definition, a turn is defined by the close approach of two atoms (
..... Click the link for more information.
According to the most common definition, a turn is defined by the close approach of two atoms (
..... Click the link for more information.
A random coil is a polymer conformation where the monomer subunits are oriented randomly while still being bonded to adjacent units. It is not one specific shape, but a statistical distribution of shapes for all the chains in a population of macromolecules.
..... Click the link for more information.
..... Click the link for more information.
This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus