Documente Academic
Documente Profesional
Documente Cultură
PROTEIN STRUCTURE
Note: this document represent a more detailed discussion of information given in the syllabus. I. CHARACTERISTICS OF PROTEINS In conjunction with their diversity in function, there are many structures of proteins. Nevertheless, several general features are shared. A. Composition and Size Proteins are complex macromolecules of high molecular weight, ranging from 5,000 to several million Daltons. They are composed of carbon, hydrogen, nitrogen, oxygen, and usually sulfur; some contain small amounts of other elements. Proteins may be simple or may be conjugated with other non-protein substances (for example, with lipids to form lipoproteins). B. The Amino Acids Proteins are polymers; the monomeric units are amino acids. Most proteins are formed from the same set of 20 amino acids. A few proteins contain non-standard amino acids that are derived from the standard set. Proteins contain from about 50 to several thousand amino acid residues. The characteristics and structures of amino acids are important to understanding protein function and many disease processes.
H H3 N+ C R
O C O-
All the amino acids except glycine are chiral and therefore proteins are chiral. The chiral amino acids contain one or more chiral carbons, in which four different substituents are attached to the chiral center. Amino acids are normally present in the L configuration; some amino acids of the D configuration are found in the cell walls of some bacteria and other sources. Note that the D or L refers to configuration in relation to reference compounds such as glyceraldehyde. By another convention, chiral compounds are called R (Latin: rectus, right) or S !%# !%# (Latin: sinister, left) R R compounds. Again, R and S H H refer to the spatial C C configuration. A chiral compound is optically active in that a solution containing the compound can rotate the plane of polarized light.
-OOC !$# NH3+ !"# +H3N !"# COO!$#
Fig. 1 The -amino acid. R refers to the side chain that distinguishes the amino acids.
The carbons are labeled using Greek letters starting at the first or carbon. Attached to the carbon is an amino group and an acid group; hence the name -amino acid. The carbons in the side-chain, R, are labeled serially with Greek letters ( , , etc.). Sometimes amino acids are written in the un-ionized form (for example, the NH3+ is shown as NH2, and COO- is shown as COOH).
S amino acid (or L) R amino Acid (or D) Fig. 2 The chiral character of amino acids.
Rules for assigning the value to substituents for a chiral molecule are as follows: Of the four atoms attached to an asymmetric carbon, the atom of higher atomic number is assigned a higher value. If two atoms are the same, comparison of the next atoms attached to each is made. A double (or triple) bond counts as two (or three) of the atomic number for the attached atom. To determine the configuration of the amino acid alanine (R=CH3), for example, values are assigned to the atoms that are attached to the asymmetric carbon: NH2-1, COOH-2, R-3, and H-4. Envision the molecule as a steering wheel with H-4 in back. Go from 1 to 2 to 3. If the steering wheel turns to the right, it is an R amino acid; if it turns to the left, it is an S amino acid. The key to deciding the polar character of the amino acids is to examine their atomic structure. The parts of amino acids with hydrocarbons (CH, CH2, CH3) are hydrophobic; the parts with oxygens and nitrogens are hydrophilic. There are different ways to classify the amino acids. Here they are classified by their chemistry. They are sometimes grouped by their partition coefficient between an organic solvent (chloroform) and water, with more hydrophilic amino acids dissolving more readily in water. They can also be classified by their appearance in globular proteins, with more polar amino acids appearing at the surface. 1. Amino acids with mostly nonpolar side-chains Alanine
H -OOC C +NH3 CH3
-OOC
Valine
H C +NH3 CH CH3 CH3
Leucine
H -OOC C +NH3 C H2 CH CH3 CH3
Isoleucine
H -OOC C +NH3 CH3 C H C H2 CH3
Methionine
H -OOC C +NH3 C H2 C S H2 CH3
Phenylalanine
H -OOC C +NH3 C H2
Tryptophan
H -OOC C C H2 +NH3 HC C
Tyrosine
H -OOC C +NH3 C H2 OH
N H
Proline
-OOC +H 2 N
H C CH 2 CH 2 C H2
Note that tryptophan and tyrosine have polar portions of their side-chains and that the nonpolar portions of glycine and alanine are small. 2. Polar with neutral side-chains (at pH 7). Threonine
H -OOC C +NH3 OH C H CH3 -OOC
Serine
H C +NH3 C OH H2
Asparagine
H -OOC C +NH3 C C H2 NH2
Glutamine
H -OOC O C +NH3 C C H2 H2 C O NH2
Note the relationship of aspartate to asparagine and that of glutamate to glutamine. Lysine
H -OOC C H3N+ CH2 CH2 CH2 CH2 NH3+
Arginine
H -OOC C H3N+ CH2 CH2 CH2 N H C NH2+ NH2
Histidine
4.
-OOC
Cysteine
H C +NH3 C SH H2
S S
Cysteine behaves as nonpolar when unS charged, and polar when charged (usually above pH 8). Under oxidizing conditions (typically outside the cell), cys- Fig. 3 Cysteine residues can form disulfide bonds between polypeptides. Each SS indicates the linkage between teine residues form covalent bonds the sulfurs of two cysteines. through disulfide bridges. The disulfide can cross link different chains or within the same chain, and imparts increased stability to protein structures. 5. Glycine, has a small side-chain that allows it to get into tight spaces.
H -OOC C +NH3 H
SS
C. The Peptide Bond The amino acids of a protein are linked by peptide or amide bonds in a head-to-tail fashion to form one or more chains of amino acids. A chain of amino acids is called a polypeptide. The amino acid sequence, determined by the mRNA sequence is specific for each protein. Note that the formation of the peptide bond between an amino and a carboxyl group has eliminated their charges. Only the most amino terminal amine and the most carboxyl terminal carboxyl groups are still present:
Peptide bond H
+
H CO2+
+
H CO2+
O C N H
H C R2 CO2+ H2O
H3N
C R1
H3N
C R2
H3N
C R1
Fig. 4 Formation of the peptide bond. The carboxyl group of one amino acid combines with the amino group of another.
The functional properties existing in a protein are contained mainly in the R-groups or side-chains. The three-dimensional structure or conformation of proteins is also determined by the properties of the amino acid residues. Each protein has a unique amino acid sequence and a distinct three-dimensional structure. Some chains of amino acids will form a protein domain that is globular. These proteins, which include enzymes and transport proteins, are usually watersoluble. Non-polar amino acids predominate on the inside of globular water-soluble proteins; polar ones predominate on the outside. Other chains of amino acids can form aggregates and are relatively insoluble in many aqueous solutions. These fibrous proteins are often elongated and include proteins such as collagen, which is abundant in skin. The polarity of the side-chain of an amino acid can dictate its role within a protein. A watersoluble protein has mostly amino acids with hydrophilic (water-loving) side-chains on the surface interacting with water. The interior of such proteins is hydrophobic (water-fearing) with the non-polar side-chains interacting with each other. On the other hand, the part of a protein that penetrates a membrane is composed mostly of amino acids with hydrophobic side-chains (membranes are hydrophobic). Likewise, regions of proteins found associated with negatively charged molecules (such as DNA) have surface amino acids that are positively charged. Note that the atoms N, O, and S can accept hydrogen bonds, whereas hydrogens attached to these atoms (NH, OH, SH) can donate hydrogen bonds. Proteins are rich in hydrogen bonds both the side-chains of polar B-chain amino acids and the peptide bond are involved. Hydrogen bonds form when a hydrogen atom is shared between two electronegative atoms. The strength of a hydrogen bond is distance dependent, and maximum strength occurs at short distances. The strongest hydrogen bonds contain atoms that are collinear (in the same line). The local environment may also influence the strength of a hydrogen bond; nonpolar regions favor hydrogen bond formation.
X H X
A-chain Fig. 5. Model of insulin. The hydrogen bonds are shown as dotted lines.