Documente Academic
Documente Profesional
Documente Cultură
Email:
hyu@uow.edu.au
Location: 18.G26
Consultation Times:
Tuesday 2:30-4:30pm
Email appointment preferred
References:
Introduction to Protein Structure, Branden & Tooze, 2nd Ed,
Garland Science 1999
Library eReading
Any classical Biochemistry textbooks (Stryer, Lehninger, Voet &
Voet etc.)
Lecture materials and past exam papers.
CHEM320: Bioinformatics
Hierarchy of Protein Structures (L1)
Haibo Yu
School of Chemistry
University of Wollongong
Building 18, Room G26
hyu@uow.edu.au
JWCL281_c05_082-128.qxd
6/3/10
10:00 AM
Page 95
Proteins
Why Do We Care?
95
DNA
Transcription
handling. Before 1960, when this was first realized, the measured molecular masses of DNA were no higher than !10 million D (!15 kb, where
1 kb ! 1 kilobase
pair ! 1000 bp).
3
UOW CHEM320 2016, Haibo Yu
DNA fragments of uniform molecular mass and as small as a
few hundred base pairs may be generated by shear degrading
DNA in a controlled manner; for instance, by forcing the
Protein
RNA
Translation
Why Do We Care?
Objectives (L1-2)
NH3 2
NH3
COO
COO
H2C
the highest to the lowest
priority
is
+
N
C
COO
N+
L isomer
D isomer
COO
counterclockwise.
configuration
is
HThe
2
FIGURE 3.9 Cyclic structure of proline.
H2
called R, from
the
Latin
rectus
for
FIGURE 3.4 The
refers chain
to the side
H L and D isomers of amino acids.
TheRside
is chain.
joined to both the !
Prolineright, if theTheprogression
L and D isomersis
areclockwise.
mirror images of each other.
7552dc03_41-76 4/17/01 7:22 AM Page 44
(Pro, P)
carbon and the amino group.
H2
C
Proline, Pro, P
L: CORN
Only
acids
are constituents
ofalmost
natural
proteins!
OnlyLLamino
amino acids
are constituents
of proteins. For
all amino
acids,
the
L
isomer
has
S
(rather
than
R)
absolute
configuration
(Figure
3.5).
Al+
+
No
satisfactory
explanation
has
been
arrived
at
why
NH3
NH3
COO
COO
though considerable effort has gone into understanding why amino acids in
Three amino acids with relatively simple aromaticamino
side chains
partconfiguration,
of
acidthis
inare
natural
proteins have
this absolute
proteins
have
absolute
no satisfactory
explanation has
e.g.
been
arrived
at.
It
seems
plausible
that
the
selection
of
L over D was arbithe fundamental
as
its
name
indiL isomer
D isomer repertoire (Figure 3.10). Phenylalanine,
configurations.
trary but, once made, was fixed early in evolutionary history.
Glycine
primary
cates,R=H:
contains
a phenyl ring attached in place of one
of the hydrogens of
carboxylic acid
Amino acids in solution at neutral pH exist predominantly as dipolar
FIGURE
3.4group
The L and D isomers of amino acids.R=CH
R refers
to the side chain.
(1)
3: Alanine
amino
alanine.
The
aromatic
ring
of
tyrosine
contains
a
hydroxyl
This hyions
(also group.
called zwitterions).
In the dipolar form, the amino group is protongroup
Question:
#
"Which amino acid is not chiral?
The L and D isomers are mirror images
of each other.
NH
ated (NH
and the carboxyl
3 )chains
droxyl
group is reactive, in contrast with the rather inert
side
of thegroup is deprotonated (COO ). The ionization
state of an amino acid varies with pH (Figure 3.6). In acid solution
UOW CHEM320 2016, Haibo Yu
9
UOW CHEM320 2016, Haibo
Yu
10
other amino acids discussed thus far. Tryptophan has(e.g.,
anpH
indole
ring group
joined
1), the amino
is protonated (NH3") and the carboxyl group
Only L amino acids are constituents oftoproteins.
For
almost
all
amino
acids,
is
not
dissociated
(COOH).
As the pH is raised, the carboxylic acid is the
a methylene (CH2) group; the indole group comprises two fused rings
first
group
to
give
up
a
proton,
inasmuch as its pKa is near 2. The dipolar
FIGURE 3.5 Onl
the L isomer has S (rather than R) absolute
configuration
3.5). Aland an
NH group. (Figure
Phenylalanine
is purely hydrophobic,
whereas
tyrosine
form persists
until the
pH approaches 9, when the protonated amino group
found in protein
(3)
R
though considerable effort has gone into
understanding
why
amino
acids
in
44
acids have an S a
and tryptophan are less so because of their hydroxyl and NH groups. The
(from the Latin si
proteins have this
absolute
configuration,
no satisfactory
explanation
has
rings of tryptophan
and
tyrosine contain delocalized
! electrons
The counterclockw
1-2
Genes
and aromatic
Proteins
R
R H (4)
R
H
H
H
H
H
arrow from highe
been arrived at. It seems plausible thatthat
the strongly
selectionabsorb
of L over
D was arbiultraviolet
light (Figure 3.11).
substituents indic
H N
COO
COOH
H N
H N
COO
trary but, once made, was fixed early in evolutionary
history.
center is of the S
A
compounds
extinction
coefficient
indicates
its
ability
to
absorb
There
is
a
linear
relationship
between
the
DNA
base
sequence
Hlight. of a
H
mRNA codon -> amino acid
Amino acids in solution at neutralBeers
pH exist
as amino-acid
dipolar
(2)
and the
sequence
of thewavelength:
protein
it encodes
law predominantly
gives gene
the absorbance
(A) of light
at a(1)
given
C
position form, the amino group is protonions (also called zwitterions). In the2nddipolar
The genetic
hereditary
information from
into proteins.
+
Ist position
# code is the formula that converts
COOgenes
NH3
ated (NH3") and the
(COO
).
The
ionU group
C Ais deprotonated
G 3rd(3'position
(5' carboxyl
end)
end)
Every amino acid in a protein is represented by a codon consisting of three
consecutive
Zwitterionic
form
Both groups
deprotonated
Ser with
Tyr
Cys
U
ization state of an amino acidPhevaries
pH (Figure
3.6).
In acid
solution
nucleotides
in the
gene. DNA contains four different nucleotides, with the bases adenine (A),
Phe
Ser
Tyr
Cys
C"
(G), thymidine
(T) and cytosine (C), whose sequence in a gene spells out the sequence
(e.g., pH 1), the amino group Leu
is protonated
(NHA3 ) andguanine
the carboxyl
group
Ser STOP STOP
Leu
Ser STOP Trp
G
of
the
amino
acids
in
the
protein that it specifies: this is the primary structure
of the protein.
Both groups
is not dissociated (COOH). As the pH is raised, the carboxylic acid is the
protonated
Leu
Pro
His
Arg
U
The nucleotide sequence of the DNA is transcribed into messenger RNA (mRNA),
with uridine
first group to give up a proton,
as its pK
near
2. The
dipolar
Leu inasmuch
Pro
His
Arg
C a is (U)
FIGURE 3.6 Ion
FIGURE
3.5
Only
L amino acids
are
replacing
thymine
(T).
Figure
1-4
shows
the
correspondence
between
the
64 possible
Leu
Pro
Gln
Arg
A
function of pH.
form persists until the pH approaches
9, when
the
found
in
proteins.
Almost
all
L amino
Leu
Pro
Gln
Arg
three-baseamino
codons ingroup
mRNA and the
20
naturally
occurring
amino
acids.
Some
amino
acids
G protonated
These groups are important in hydrophobic interactions;
amino acids is alt
2 by as many
4
6 different
8
10
12
14
The zwitterionic fo
acids have
configuration
are specified by only one codon, whereas
othersancanS absolute
be0specified
as six
Ile
Thr
Asn
Ser
U
Glycine
enables
the
peptide
bond
backbone
to
adopt
more
physiological pH.
pH
Ile
Thr
Asn
Ser
C
(fromThere
the Latin
sinister
meaning
left).
codons:
the
genetic
code
is
degenerate.
are
three
codons
that
do
not
code
for
amino
Ile
Thr
Lys
Arg
A
conformations.
Thepolypeptide
counterclockwise
direction
theprocess by which
acids, but signalR the termination of the
chain (stop
codons).ofThe
Met
Thr
LysR Arg
G
R
H+
H+
H
H sequence of the DNA
the nucleotide
is first
transcribed
RNA and then
into acid side chain: Ala scans
arrow
from
to lowest-priority
Val
Ala H Asp
Gly
Alanine
ishighestoften into
considered
as a translated
null amino
U
Val
Ala
Asp
Gly
C
protein
is
outlined
in
Figure
1-5.
substituents
indicates
that
the
chiral
are used to probe structure-function relationships in proteins.
+H N
Val+H Ala
Glu COO
Gly
A
COOH
H2N
COO
3N
3
44
Glycine
(Gly, G)
Alanine
(Ala, A)
7552dc03_41-76
4/17/01
7:22 AM
Page 44
H H
H+
Amino
acids
COO
H3N
Glycine
(Gly, G)
Valine
(Val, V)
Leucine H
(Leu, L)
CH
Methionine
(Met, M)
COO
CH3
+H
COO
3N
Alanine
(Ala, A)
HC
CH3
H3C
+H N
3
CH3
CH3
H2C
H2C
CH3
*
C H
Methionine
H
CH2
Isoleucine H
(Ile, I)
COO
+H N
3
Isoleucine
(Ile, I)
H 3N
CH3
H3C
CH3
COO
Alanine
(Ala, A)
Leucine
(Leu, L)
Concentration
U
C
A
G
CH3
Glycine
(Gly, G)
COO
H3N
COO
3N
+H N
3
H H
COO
Valine
( Val, V )
+H
Alanine
(Ala,
C A) COO
+H N
3
CH3
COO
(Met, M)
COO
+H N
3
CH2
+H
COO
3N
CH3
CH3
CH3
Val
Ala
Abbreviations
Glu
Gly
Codons
H+
CH3
CH3
CH2
CH2
CH3
CH2
Valine
Leucine
Isoleucine
12
COO
Methionine
7552dc03_41-76
4/17/01
7:22 AM
Page 46
JWCL281_c09_278-322.qxd
HC
H O * CH3
C
H
H
CH2
COO
+H N
3
OH
46
COO
+H N
3
2/24/10
1:17 PM
HC
CH2
COO
HN
Page 285
C
C
H
+H N
3
COO
Tyrosine
(Tyr, Y)
Phenylalanine
(Phe, F)
+H N
3
Phenylalanine
(Phe, F)
20,000
10,000
10,000
OH
5,000
A ! !cl
Tryptophan
(Trp, W)
Serine
(Ser, S)
Threonine
(Thr, T)
8,000
4,000
2,000
Trp
6,000
Tyr
0
220
240
Beers law
+H N
3
CH2
COO
CH2
COO
+H N
3
HC
HO
CH
H
C
HN
Threonine
(Thr, T)
Serine
(Ser, S)
CH
CH
Cysteine
(Cys, C)
47
A Repertoire of 20 10,000
Amino Acids
+H N
3
COO
CH3
H
Glutamine
*
(Gln, Q) C H
H
+H N
3
OH
OH
CH2
+H N
3
COO
COO
CH3
+H N
3
COO
Serine
(Ser, S)
Threonine
(Thr, T)
H2N
8,000
We turn now to amino acids with very polar side chains that render
them
S
H
highly hydrophilic. Lysine and arginine have relatively long side chains
that
CH2
H Lysine
terminate with groups that are positively charged at neutral pH.
6,000 is
capped by a primary amino group and arginine by a guanidinium
group.
+H N
COO
3
Histidine contains an imidazole group, an aromatic ring that also
can be posCysteine 4,000
itively charged (Figure 3.14).
(Cys, C)
10
SH
CH2
+H N
3
Trp
COO
"1
"1
2,000
"1
Tyr
We turn now to amino acids with very polar side chains that render them
0
highly hydrophilic. Lysine and arginine have relatively
chains
220
240long side
260
280 that
300
These residues
and
(nm)
terminate
with groups are
that hydrophilic
are positively charged
atcan
neutralWavelength
pH. Lysine
is
capped by a primary amino group and arginine by a guanidinium group.
form hydrogen bonds (What is a HB?).
Histidine contains an imidazole group, an aromatic ring that also can be positively charged (Figure 3.14).
theC concentration
of a protein
C O of tryptophan and
NH2 in solution if the number
O
tyrosine residues in Othe Cprotein is known. H C
H2C
2
Two amino acids, serine and threonine, contain aliphatic hydroxyl groups
"1
"1
atom fromwhere
a molecule
or a molecular
fragment XH in which X is
! is the extinction coefficient
[in units that are the
reciprocals of molarity and distance in centimeters (M
more electronegative
than
H,
and
an
atom
or a group of atoms in the
cm )], c is the concentration of the absorbing species (in
units of molarity, M), and l is the length through which
same or athedifferent
molecule,
which there is evidence of bond
light passes (in units
of centimeters). in
For tryptophan,
absorption is maximum at 280 nm and the extinction
formation.coefficient is 3400 M cm whereas, for tyrosine, absorption is maximum at 276 nm and the extinction coefInternational
of Pure and Applied Chemistry
ficient is a less-intense
1400 M cm . Union
Phenylalanine
HB Acceptor
320
HB
HB Donor
15
10
"1
O
HAsparagine
CHN)
2
(Asn,
H
320
Glutamate
(Glu, E)
20
300
20
Phe
50
HC
C
C
C
make them much more hydrophilic (water loving) and reactive than alanineHC C C CH2
C
CH2
CH2
H
FIGURE 3.10 Amino acids with
H
H
and valine. Threonine, like isoleucine, contains
anchains.
additional
asymmetric
aromatic side
Phenylalanine,
13
UOW CHEM320
14
+H N
+H N 2016,
+H N
C
COO
C Haibo
COO Yu
C
COO
c. Pulsed H/D Exchange Provides Structural Details
3
3
3
tyrosine,
and
tryptophan
have
hydrophobic
center; again only one isomer is present in proteins.
on How Proteins Fold
character. Tyrosine and tryptophan also
H
H
H
Pulsed H/D exchange, a method devised by Walter EngCysteine is structurally similar to
serine
contains
a sulfhydryl,
or thiol
havebut
hydrophilic
properties
because of
lander and Robert Baldwin, is the only known technique
Tryptophan
Phenylalanine
Tyrosine
SH
(Trp, W)
(Phe, F)
(Tyr, Y)
their OH and NH groups, respectively.
S
that can follow the time course of individual residues in a
(SH), group
H in place of the hydroxyl (OH) group (Figure 3.13). The
folding protein. Weakly acidic protons ( H), such as those
CH
2
sulfhydryl group
Pairs of sulfhydryl groups may
CHis
2 much more reactive.
of amine and hydroxyl groups (XH) , exchange with
H
those of water, a process known as hydrogen exchange that
UOW7:22
CHEM320
Haibo
4/17/01
AM 2016,
Page
47 Yu
e 49
100
COO
HC
CH
200
CH2
+H N
3
H
C
H
C
500
Section 9-
280 nm
HN
1,000
280
2,000
Wavelength (nm)
41-76
260
C
H
CH3
O
H
+H N
CH
2
C
COO
3
H
COO
CH2
C
H
Tyrosine
(Tyr, Y)
40,000
Tryptophan
(Trp, W)
C
H
CH2
C
C
H
+H N
3
16
Figure 9-11 C
Polypeptides in
conformations w
of known X-ray
absorption spec
a small differenc
Wetlaufer, D.B.,
Proteins bear n
those of its bac
are engaged i
solvent and, m
protein are no
Through th
H/D exchange
tein folding. Th
disulfide bond
ride or urea in
peptide nitrog
ing is then initi
the denaturant
neously lower
neutrality, hyd
OH$ and, ther
After a preset
(using a third
called labeling
tide nitrogen a
drogen bonds
that are hydro
hydrogen exch
(10 to 40 ms),
lowering the p
lowed to go to
changeable site
2D proton NM
signed). By rep
at position 93; the other is the site at which an O2 molecule binds. Within this pocket, the accessibility of the
heme group to solvent is highly restricted. This is important for function, because free heme groups in an oxygenated solution are rapidly oxidized from the ferrous
(Fe2!) form, which is active in the reversible binding of
O2, to the ferric (Fe3!) form, which does not bind O2.
Knowledge of the structure of myoglobin allowed
researchers for the first time to understand in detail the
Page 48
eAMtwo
3.4).
Histidine
(His, H)
NH3+
H2C
C
CH
C
CH2
N43
!N
CH
C
C CH3
Fe
CH
CH
(a)
(b)
CH2
O2
H
N
H2C
(3)
C H
+H N
3
CH2
NH3+
and
COO
+
COO
+
FIGURE 3.4 The
NHL3and
The
CH2
H3N
isomer
COO
H3N
pKa=4.0
isomer
NH2
H2N
R refers to the side chain.
isomers of amino acids.
C
NH
CH2
7552dc03_41-76
H C
H+
H
H+
C
N
CH
H
H2N
Imidazole
R
COO
7:22 AM
CH2
C
C
O
ization.
protons near
+H N
Group3
NH2
COO
+H N
3
Acid
O
COO
Typical pK *
Base
3.1
+H N
3
COO
2
18
CH2
CH2
CH2
CH2
+H N
3
COO
Asparagine
Note.
in biological systems, theirGlutamine
pKas can
(Gln, Q)
(Asn, N)
O
Aspartic acid
be
substantially
shifted from its intrinsic
4.1
C
C
H
Glutamic acid
O
O
pKa to
apparent
pKa due to the
FIGURE 3.16 Amino Hacids with side-chain carboxylates
and
carboxamides.
interactions
when
buried in a biomolecular
N
N
6.0
+
Histidine
environment.
N
N
Aspartate
(Asp, D)
O
Glutamate
(Glu,
O E)
pH = log "# H + $%
+ H
H
H
8.0
H
H
glutamateasparagine
and glutamineeach of which contains a terminal
H
Cysteine
8.3
S
S
carboxamide in place
of a carboxylic
acid (Figure 3.16).
H
O
O
Tyrosine
Seven of the 20 amino acids have 10.9
readily ionizable side chains. These
7 amino acids are able
to donate or accept protons to facilitate reactions
H
N
N
Lysine
10.8
H
H
as well as to form ionic
bonds. Table
3.1 gives equilibria and typical pKa
H
H
H of the sideH chains of tyrosine, cysteine, arginine, lyvalues for ionization
N H
N
H
sine, histidine, Hand
glutamic
acids in proteins. Two other
Arginine
12.5
N C aspartic Nand
C
N H
N H
groups in proteinsthe
terminal
!-amino
group and the terminal !H
H
carboxyl groupcan be ionized, and typical pKa values are also included
pK values depend on temperature, ionic strength, and the microenvironment of the
ionizable
group.
in Table
3.1.
Amino
acids are often designated by either a three-letter abbreviation or
UOW CHEM320 2016, Haibo Yu
a one-letter symbol (Table 3.2). The abbreviations for amino acids are the
first three letters of their names, except for asparagine (Asn), glutamine
"# A $%
10
[ HA ]
amino acids is altered by a change in pH. If the pH is 2 units below the pKa, how much will be protonated?
The zwitterionic form predominates near if the pH is equal to the pKa, how much will be protonated?
If the pH is 2 units above the pKa, how much will be protonated?
physiological pH.
"#A $%!"H #$
Ka =
[ AH ]
NH2
CH2
50
With a pKa value near 6, the imidazole group can be uncharged or positively charged near neutral pH, depending on its local environment (Figure
3.15). Indeed, histidineZwitterionic
is often found
in the active sites
of enzymes, where
form
Both groups
a
deprotonated
the imidazole ring can bind and release protons in the
course of enzymatic
reactions.
The set of aminoBoth
acids
also contains two with acidic side chains: aspargroups
tic acid and glutamicprotonated
acid (Figure 3.16). These amino acids are often called
10
FIGURE 3.6 Ionization state as a
aspartate and glutamate to emphasize that their side chains are usually negfunction
of
pH.
The
ionization
state
of
atively charged at physiological pH. Nonetheless, in some proteins these
amino acids is altered by a change in pH.
side0 chains 2do accept
and this
ability
impor4 protons,
6
8
10 is often
12 functionally
14
The zwitterionic form predominates near
pH. = pK
tant. In addition, the set includes
and pH
pH uncharged derivatives of aspartate physiological
+ log
Concentration
Page 50
!!
!
A + H +
HA
!
H2N
pK = log10 K a
CH
pKa=4.5
O
O C
4/17/01
17
Salt bridge
COO
Only L amino
acids are constituents of proteins.
For almost all amino
N acids,
CH2
CH
+2
COO
CH
NH3has
the L isomer
S (rather than R) absolute configuration (Figure
HC 3.5). AlCH
CH
(3)
R
2
2
though
considerable
effort has
acids
in
UOW
CHEM320
2016, Haibo
Yu gone into understanding why amino
C
N
proteins haveCHthis absolute configuration,
CH2
CH2no satisfactory explanation has
2
H (4)
been arrived at. It seems plausible that the selection of L over D was arbiH3N inCevolutionary
COO
C made,
COO was fixed+early
trary+H
but,
history. +H3N C COO
3N once
Amino acids in solution at neutral pH exist predominantly as dipolar
(2)
C
(1)
H
H
H
ions (also called zwitterions). In the dipolar form, the amino group is proton+
#
"
Histidine
Lysine
COO
NH3
ated
(NH(Lys,
) K)
andOnly
the carboxyl
groupArginine
is
deprotonated
(COO
).
The
ionFIGURE
L amino
acids
are
3 3.5
(His, H)
(Arg, R)
ization
an amino acid
varies with
(Figure 3.6). In acid solution
foundstate
inofproteins.
Almost
all pH
L amino
(e.g., pH 1), the amino group is protonated (NH3") and the carboxyl group
have an(COOH).
S absolute
configuration
isacids
not dissociated
As the
pH is raised, the carboxylic acid is the
first
group
give upsinister
a proton,meaning
inasmuch asleft).
its pKa is near 2. The dipolar
(from
theto Latin
FIGURE 3.5 Only L amino acids are
form persists until the pH approaches 9, when the protonated amino group
found in proteins. Almost all L amino
Glutamine
(Gln, Q)
NH3+ +
COO
Asparagine
(Asn, N)
Glutamate
(Glu, E)
H
N
N
H
CH2
Guanidinium
H+
C CH3
C
Fe
pKa=6.5
CH2
H2C
C
N
N!
CH2
polar
otonionution
roup
is the
ino acids
polar
ine.
roup
CH3
NH2
CH
C Acids
C
A Repertoire CH
of 20C Amino
HN
C
CH2
CH2
CH3 C
pKa=12
pKa=10.5
Amino acids are the building blocks of proteins. An !-amino acid consists
of a central carbon atom, called the ! carbon, linked to an amino group, a
carboxylic acid group, a hydrogen atom, and a distinctive R group. The R
group is often referred to as the side chain. With four different groups connected to the tetrahedral !-carbon atom, !-amino acids are chiral; the two
mirror-image forms are called the L isomer and the D isomer (Figure 3.4).
H2N
"
CH2
CH2
acids,
. Alds in
n has
arbi-
19
20
1: Properties of AA: pI
pK1
pK #
1
Protein
2.19
pK2
!
3
(a)
Methyl-substituted
carboxyl and
amino groups
Glycine
(Gly, G)
CH3
CH3
COOH
Alanine
(Ala, A)
H!
CH3 NH3
COO#
CH3 NH2
shared
properties ofof
many
aminomolecules
acids permit some
H
H
Hydrophobic Effect: based The
on
the
tendency
polar
to
Methylamine
Acetic acid
simplifying
generalizations
about
their
acid-base behavThe normal pK for an
The normal pK for a
7552dc03_41-76
4/17/01
7:22
AM
Page
44
exclude non-polar molecules
(oil
and
water
hate
each
amino group
is about
carboxyl
group
is about
4.8.
iors.
First,
all amino
acids
with
a10.6.
singleother).
!-amino group,
a
single
!-carboxyl
group,
and
an
R group that
does not
Hydrophobicity
Index: Hthe relative
hydrophobicity
among
amino
Carboxyl and
H
NH
NH
NH
amino groups
CH
ionize
have
titration
curves
resembling
that
of
glycine
H
H
H
H C COO
C COO
acids. in glycine H C COOH H (Fig.H310).
TheseH Namino
haveH NveryCOOsimilar, alH COO acids
FIGURE
3.7 Structures of glycine and
H
H
H
alanine.!-Amino
(Top) Ball-and-stick
models
!-Amino
acid
(glycine)
acid (glycine)
44
Glycine
Alanine
though
not identical,
pK
a values: pKa of the OCOOH
show the arrangement
of atoms and
pK "
pK " 2.34
(Gly, G)
(Ala,
A) 9.60
!
pKa of the R
3N
COO
H
Alanine
(Ala, A)
Glycine
(Gly, G)
perturbations of pKa are due to intramolecular interactions. Similar efcaused by chemical groups that happen to be positioned
example, in the active site
CH3 of an enzyme.
H
HValine
H
Leucine
Isoleucine
C O
O
representation
(see Chapter 1 Appendix).
C
H
COO
+H N
3
COO
CH2
+H N
3
(Val, V)
+H N
3
C
CH2
H
+
COO
H3N
+H N
3
H
Alanine
(Ala, A)
NH2
H3C
NH2
CH2CH
CH3
Isoleucine
(Ile, I)
H
COO
+
3N
+HHN
3
COO
COO
+H N
3
LeucineO
(Leu, L)
CH2
COO
H3N
CH2
COO
CH2
C
COO
COO
CH3
HC
H
+H
3N
Methionine
(Met, M)
H2C
CH2
COO
H
+H
3N
H2C
CH3
*
C H
COO
+H
3N
CH
Fractions sequentially collected
22
Abbreviation/
symbol
Nonpolar, aliphatic
R groups
Glycine
Gly G
Alanine
Ala A
Titration curves forProline
(a) glutamateProand
P
Valine
Val V
group is designated here
Leucine as pKR. Leu L
Isoleucine
Ile I
Methionine
Met M
Aromatic R groups
Phenylalanine
Phe F
Tyrosine
Tyr Y
Tryptophan
Trp W
Polar, uncharged
R groups
Serine
Ser S
Threonine
Thr T
Cysteine
Cys C
Asparagine
Asn N
Glutamine
Gln Q
Positively charged
R groups
Lysine
Lys K
Histidine
His H
Arginine
Arg R
Negatively charged
R groups
Aspartate
Asp D
Glutamate
Glu E
3.0
Mr
pK1
(OCOOH)
pK2
(ONH!
3)
pKR
(R group)
Hydropathy
index*
Occurrence in
proteins (%)
5.97
6.01
6.48
5.97
5.98
6.02
5.74
!0.4
1.8
1.6
4.2
3.8
4.5
1.9
7.2
7.8
5.2
6.6
9.1
5.3
2.3
pI
75
89
2.34
2.34
9.60
9.69
117
131
131
149
2.32
2.36
2.36
2.28
9.62
9.60
9.68
9.21
165
181
204
1.83
2.20
2.38
9.13
9.11
9.39
10.07
5.48
5.66
5.89
2.8
!1.3
!0.9
3.9
3.2
1.4
105
119
121
132
146
2.21
2.11
1.96
2.02
2.17
9.15
9.62
10.28
8.80
9.13
8.18
5.68
5.87
5.07
5.41
5.65
!0.8
!0.7
2.5
!3.5
!3.5
6.8
5.9
1.9
4.3
4.2
146
155
174
2.18
1.82
2.17
8.95
9.17
9.04
10.53
6.00
12.48
9.74
7.59
10.76
!3.9
!3.2
!4.5
5.9
2.3
5.1
133
147
1.88
2.19
9.60
9.67
3.65
4.25
2.77
3.22
!3.5
!3.5
5.3
6.3
(b)115
histidine.
1.99 The10.96
Average occurrence in more than 1,150 proteins. From Doolittle, R.F. (1989) Redundancies in protein sequences. In Prediction of Protein Structure and the Principles of Protein Conformation (Fasman, G.D., ed.), pp. 599623, Plenum Press, New York.
CH3
Chromatography
column
*A scale combining hydrophobicity and hydrophilicity of R groups; it can be used to measure the tendency of an amino acid to seek an aqueous
environment (! values) or a hydrophobic environment (" values). See Chapter 11. From Kyte, J. & Doolittle, R.F. (1982) A simple method for
displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105132.
H3C
CH3
High salt
10:20 AM
2.0
Ave. Mw:
110 D
CH3
COO
H2N
Glycine
(Gly, CG) O
H2C
CH2
H
NH2
O Valine
UOW CHEM320
2016, HaiboCH
Yu2
C
O
Methionine
(Met, M)
(Ile, I)
COO
H3N
COO
H3N
CH2
+H N
3
(Leu, L)
(Val, V)
H3N
H2C
CH2
1.0
FIGURE 312
CH3
+H
Sample mixture
Low salt
pK1 #
1.82
(b)
High-salt
elution
buffer
Buffer
pH=8
Buffer
pH=5.8
Amino acid
a
CHAPTER 3 Protein Structure and Function
bondsRepulsion
in space. (Middle)
betweenStereochemically
the amino
realistic
formulas
show
the geometrical
group
and the
departing
proton
arrangement
of bonds
around
atoms (see
7552dc03_41-76 4/17/01 7:22 AM Page 49
lowers the
pKa for
the carboxyl
Chapter
1 Appendix).
(Bottom)
Fischer
group,
and oppositely
charged
groupsshow
lower
pKaas
bybeing
stabiprojections
allthe
bonds
lizing
zwitterion.
perpendicular
forthe
a simplified
representation (see Chapter 1 Appendix).
(d)
(c)
OH" (equivalents)
pKR #
6.0 12/23/03
8885d_c03_078
(b)
Low-salt
elution
buffer
pK2 #
9.17
8
pH 6
Buffer
pH=5.8
Buffer
pH=8
10 Histidine
Page 136
"
2:25 PM
"
2/22/10
(a)
Pepsin
#1.0
Egg albumin
4.6
1.0
2.0
3.0
0
Serum
albumin "
4.9
OH (equivalents)
Urease
5.0
"-Lactoglobulin
5.2
Hemoglobin
6.8 COO
COO
COOH
COO
7.0
H N CH
H N CH
CH Myoglobin
H N CH
CH
CH
CH
CH Chymotrypsinogen
9.5
H
H
H
H
N
N
N
C
C cN
C
C
Cytochrome
10.7
CH pK
CH pK
CH pK
Lysozyme C N
11.0 C N
C
N
C
N
"
H3N
136
pI
JWCL281_c06_129-162.qxd
6
Isoelectric Points
TABLE
36pKRThe
#
of Some Proteins
4.25
4
pH
Protein concentration
CH2
COO
23
24
listed in Table 31. Within each class there are gradations of polarity, size, and shape of the R groups.
Nonpolar, Aliphatic R Groups
peptide bonds is an enzymatically controlled process that occurs on the ribosome and is directed
by the mRNA template. Although peptide bond formation can be reversed by the addition of
water (hydrolysis), amide bonds are very stable in water at neutral pH, and the hydrolysis of
peptide bonds in cells is also enzymatically controlled.
water
C
O
Hydrolysis
(protease)
water
amino terminus
(N terminus)
C
O
C
H
carboxyl terminus
(C terminus)
O
R2
122
1.5
116
C
1.3
119.5
1.0
H
R
1.4
111
118.5
Amide
plane
C
O
H
N
ically 1.32
plane, which is between the values expected for
a HCN single bond (1.49 ) and a CPN double bond
(1.27 ), as shown in Figure 3.24. Finally, the peptide
Figure 8-1 The trans-peptide group. The standard dimensions
bond
is uncharged,
allowing polymers of amino acids
(in angstroms, , and degrees, ) of this
planar group
were
derived by averaging the corresponding
quantities
the X-ray bonds to form tightly packed globular
linked
byinpeptide
crystal structures of amino acids and peptides. [After Marsh,
structures.
R.E. and Donohue, J., Adv. Protein Chem.
22, 249 (1967).]
See Kinemage Exercise 3-1
Two configurations are possible for a planar peptide
FIGURE 3.24 Typical bond lengths within a peptide unit.
Main unit
chain
bond. In the trans
configuration, the two !-carbon atoms
The peptide
is shown in the trans configuration.
221
are on opposite sides of the peptide bond. In the cis configuration, these groups are on the same side of the peptide bond. Almost
all peptide bonds in proteins are trans. This preference for trans over cis can
Side chainby the fact that steric clashes between groups attached to the
be explained
!-carbon
atoms hinder
formation
of theofcis form but do not occur in the trans
Figure 8-3 A polypeptide chain in its fully extended
conformation
showing
the planarity
configuration
3.25).
far the most common cis peptide bonds are
each of its peptide groups. [Illustration, Irving Geis.
Image from(Figure
the Irving
Geis By
Collection,
XPro
linkages. Such bonds show less preference for the trans configuration
Howard Hughes Medical Institute. Reprinted with
permission.]
because the nitrogen of proline is bonded to two tetrahedral carbon atoms,
limiting the steric differences between the trans and cis forms (Figure 3.26).
In contrast with the peptide bond, the bonds between the amino group
and the !-carbon atom and between the !-carbon atom and the carbonyl
UOW CHEM320 2016, Haibo Yu
26
group are pure single bonds. The two adjacent rigid peptide units may rotate
about these bonds, taking on various orientations. This freedom of rotation
R1
ior of proteins
and how they
fold to their native
structures very stable in water at neutral pH;
Peptide
Bond
(Amide
Bond):
9.
the lifetime ~ 1,000 years;
1 SECONDARY STRUCTURE
Directionality
is always N-terminus to C-terminus!
A polymers secondary structure (2 structure) is defined
as the local conformation of its backbone. For proteins, this 3.2.2 Polypeptide Chains Are Flexible
has come to mean the specification of regular polypeptide
Yet Conformationally
H
VMD
Prac:
Have a look at the peptide
bond! Restricted
backbone folding patterns: helices, pleated sheets, and
Definitions
ostable Proteins
res
CHAPTER
H
nd NMR Structures
dipole moment: an imaginary vector between two
osition
Synthesis
(ribosome)
-Dimensional
s of Proteins
-76
R2
Page 221
within
a peptide
explains
geometric preference.
bond,
such
as a CC
bond,this
is cylindrically
symmetrical
steric interference, which causes ing
In the 1930s and 1940s,
"1 Linus Pauling and Robert Corey
peptide
bond
about
itsamino
bond
axis,has
so considerable
that we mightdouble-bond
expect such acharacbond to
less The
the cis conformation (Fig. 8-2) to determined
be !8 kJthe
! mol
X-ray structures
of
several
acids
5 Quaternary Structure
and dipeptides
in an effort to ter,
elucidate
the
structural
which
prevents
rotation
thiscase,
bond.
exhibit
free
rotation.
If this about
were the
then in ethane,
stable than the trans conformation (this
energy difference
A. Subunit Interactions
constraints
on the
of
a polypeptide
chain.
B. Symmetry in
for
example, all
torsion angles about the CC bond would
isProteins
somewhat
less inbonds
peptide
bonds
byconformations
a Pro
FIGURE
3.23 Peptide
are planar.
Infollowed
a pair of linked
These studies indicated that the peptide group has a rigid,
C. Determination of Subunit Composition
amino
acids,
sixin
atoms
N, the
H, and
C !residues
) liestructure
in a plane.
be equally
in ethane are
residue
and,
fact,(C!10%
Pro
in(Fig.
proteins
!, C, O, of
planar
8-1), which, Pauling
pointed likely.
out, isHa Yet certain conformations
H
Appendix: Viewing Stereo Pictures
Side
chains
arepeptide
shown asbond,
green whereas
balls.
favored Cdue to Nquantum mechanical
effects
arising from
follow
a cis
cis peptides are otherC
N+
the interactions
ofCits molecular orbitals.
The
wise extremely rare).
C
C staggered
C
O
conformation
180) is ethanes
O R(Fig. 8-5a; torsion angleO(
most stable
whereas
the eclipsed conformaa. Polypeptide Backbone Conformations May Be 1.24
Peptide bond resonance
structures
H arrangement,
tion (Fig. 8-5b; torsion angle ( 0) is least stable. The enDescribed by Their Torsion Angles
123.5
120.5
ergy
difference
between
staggered
and eclipsed
conabove
considerations
they
The properties of aThe
protein
are largely
determined byare
its important because
Peptide
bond
The
inability
of the
bondthe
to rotate
constrains
the conH
C
"1
C
three-dimensional
structure.
One
might
naively
suppose
, a quantity
that
formations
!12 kJ ! mol
indicate that the backbone of a protein is a51linked se- formation
122
of in
theethane
peptideis backbone
and accounts
for the
that since proteins are all composed of the same 20 types
1.
1
1.0groups
H
N+
=180
FIGURE 3.25 Trans and cis peptide
7552dc03_41-76
7:22
Page 55favored
bonds.4/17/01
The trans
formAMis strongly
O
Peptide bond resonance structures
The inability of the bond to rotate constrains the conformation of the peptide backbone and accounts for the
the structural features of
m together, and their hierdouble-bond character is also ex-CH2-NH- bonds planarity.
1.45ThisAmine
mplex structures. This will
pressed
in
the
length
of
the bond between the CO and
he structurefunction re
1.5
-CO-NH- NH groups.
1.32
Amide
end the
1 roles .32 See Kinemage Exercise 3-1
Cbiochemical
The CN distance in a peptide bond is typ1
C
-CH=N- 221ically 1.32
1.25
, whichImine
is between the values expected for
C
a CN single bond (1.49 ) and a CPN double bond
Resonance
- the delocalisation of electrons
several
atoms;
increase
polarity
the
1.24
(1.27over
), as
shown
in Figure
3.24.the
Finally,
theofpeptide
peptide bond in a way that carbonyl oxygen
has
negative
charge
while
amide
nitrogen
has
bond is uncharged, allowing polymers of amino acids
O
positive
charge.
linked by peptide bonds to form tightly packed globular
structures.
Two configurations are possible for a planar peptide
UOW
CHEM320
2016, Haibo
Yu a peptide unit.
27
RE 3.24 Typical
bond
lengths
within
bond. In the trans configuration, the two !-carbon atoms
eptide unit is shown in the trans configuration.
are on opposite sides of the peptide bond. In the cis con-
Cis
Trans
=180
55
Primary Structure
=0
Trans
Cis
about two bonds of each amino acid allows proteins to fold in many different
Isomerisation
0.1-1/sec
(~20
energy
barrier);
ways.
The rotationsrate
about
these bonds
cankcal/mol
be specified
by dihedral
angles
(Figure
The angleinofprotein
rotationfolding;
about the bond between the nitrogen
Rate 3.27).
determining
and the !-carbon atoms is called phi ("). The angle of rotation about the
Catalysed by an enzyme: Proline isomerase;
bond between the !-carbon and the carbonyl carbon atoms is called psi (#).
A clockwise rotation about either bond as viewed from the front of the back
UOW
CHEM320
2016, Haibo
group
corresponds
to aYupositive value. The ! and " angles determine the
path of the polypeptide chain.
Are all combinations of ! and " possible? G. N. Ramachandran recognized that many combinations are forbidden because of steric collisions be-
Dihedral angle
A measure of the rotation about a
bond, usually taken to lie between
#180 and $180. Dihedral angles are
sometimes called torsion angles.
28
7552dc03_41-76
(B)
(A)
N
H
H
C
R
C
O
H
N
O
C
H R
N
H
(a) Staggered
H
H
(C)
(b)
Eclipsed
(a)
(b)
Indicated
thesingle
Ramachandran
Diagram
acid in a polypeptide can be adjusted by rotation
about by
two
bonds. (A)
Phi (!) is the
The sterically allowed values of ! and " can be deterangle of rotation about the bond between the
nitrogen
and
the
%-carbon
atoms,
mined by calculating the distances between whereas
the atoms of a
psi (") is the angle of rotation about the bond
between
the
%-carbon
and
thethe
carbonyl
tripeptide
at all
values
of ! and
" for
central peptide
unit. Sterically
forbidden
as that
carbon atoms. (B) A view down the bond between
the nitrogen
and conformations,
the %-carbon such
atoms,
shown
Fig. 8-6,
are thosethe
in which
any nonbonding
showing how ! is measured. (C) A view down
the inbond
between
%-carbon
and the interatomic distance is less than its corresponding van der Waals
carbonyl carbon atoms, showing how " is measured.
distance. Such information is summarized in a conforma-
1: Ramachandran Plot
Figure 8-7 indicates that 77% of the Ramachandran diagram (most combinations of ! and ") is conformationally
inaccessible to a polypeptide chain. The particular regions
of the Ramachandran diagram that represent allowed conformations depend on the van der Waals radii chosen to
calculate it. But with any realistic set of values, such as that
in Table 8-1, only three small regions of the conformational
map are physically accessible to a polypeptide chain. Nevertheless, as we shall see, all of the common types of regular
secondary structures found in proteins fall within allowed
regions of the Ramachandran diagram. Indeed, the
H
N
C
H H
N
H
Try-Gly-Gly-Phe-Leu
CHAPTERpentapeptide
3 Protein Structure
and Function(YGGFL)4.1 Overview of Protein Structure+180119
shows the sequence from the amino
terminus to
the carboxyl terminus.
This
120
The carbonyl oxygen has a partial negative
O!"
O
O"
charge and the amide nitrogen a partial positive
pentapeptide, Leu-enkephalin, is an opioid
C
C !# C$
C # C$
charge, setting up a small electric dipole.
C$
peptide
that
modulates
the
perception
of
60
Virtually all peptide bonds in proteins occur in
N
C$
C$
N
C$
N
Tyr in
Gly
this trans configuration; an exception is noted
pain. The reverse
pentapeptide,
H
H Leu-PheH
Figure 48b.
0
Amino
Gly-Gly-Tyr (LFGGY), is a different molecule
FIGURE and
3.28shows
A Ramachandran
terminal
no such effects.
residue
diagram
showing the values of ! and
60
O
". Not all # and $R values are possible
Carboxyl
1.24
terminus
without collisions
between atoms. The
120
1.46
1.53
Ca
C
w
R1
most favorable
regions
shownf in dark
FIGURE
3.20 are
Components
of a
O
H
N
H
f
w
f
w
Ca green;
borderline
regions
are
shown
in
180
1.32
polypeptide chain. A polypeptide chain
C 120N 60
C0
180
light
green.
The
structure
on
the
right
is
consists
of
a
constant
backbone
(shown
in
Amino
C
C
N
terminus
H
disfavored
because
steric clashes.
H
black)
and of
variable
side chains (shown in
O
H
R2
green).
NCa
CaC
H H
C
O H2C
H
N
O H2C
Gly
C
H
N
H
Phe
CH3
CH3
H
C
Leu
Carboxyl
terminal residue
R3
H
N
60 C 120 +180
N
H
CN
FIGURE 42 The planar peptide group. (a) Each peptide bond has
some double-bond character due to resonance and cannot rotate.
(b) Three bonds separate sequential ! carbons in a polypeptide
chain. The NOC! and C!OC bonds can rotate, with bond angles
O
designated " and #, respectively. The peptide CON bond is not free
to rotate. Other single bonds in the backbone may also be
rotationally hindered, depending on the size and charge of the R
C
Ca
groups. In the conformation
shown, " and # are 180% (or " 180%).
Dalton
As one looks out from the ! carbon, the # and " angles increase as
A (respectively)
unit of mass
very nearly equal to that
N
the carbonyl or amide nitrogens
rotate clockwise.
(c) By convention, both " andof
# are
as 0% when
the twoNamed after John
a defined
hydrogen
atom.
H
peptide bonds flanking that ! carbon are in the same plane and
Dalton (17661844), who developed
positioned as shown. In a protein, this conformation is prohibited
the atomic
of matter.
by steric overlap between an !-carbonyl
oxygentheory
and an !-amino
hydrogen atom. To illustrate the bonds between atoms, the balls
representing each atom are smaller than the van der Waals radii for
this scale. 1 & 0.1 nm.
HC
R5
O
H
(C= 90, C
= 90)
C
N
C
Disfavored
H
H
O
R4
Ca
good
hydrogen-bond donor. These groups interact with each other and with
3.3 SECONDARY
STRUCTURE:
POLYPEPTIDE
CHAINS
functional groups
from side chains
to stabilize particular
structures, as will
N
CAN FOLD
INTOinREGULAR
STRUCTURES SUCH AS
be
discussed
detail.
O
THE ALPHA
HELIX,
THE BETAchains
SHEET,
ANDbetween
TURNS50 and 2000 amino
C
Most
natural polypeptide
contain
w
residues
and are commonly referred to as proteins. Peptides made of
AND acid
LOOPS
29
Ca
small
numbers of amino acids are called oligopeptides or simply peptides. The
H
f
Can a polypeptide
chain fold
intoofa an
regularly
structure?
In110,
1951,and so the
mean
molecular
weight
amino repeating
acid residue
is about
R
Linus Pauling
and
Robert of
Corey
twobetween
periodic 5500
structures
called We can
molecular
weights
mostproposed
proteins are
and 220,000.
the ! helix
theof"apleated
sheet
(betaispleated
sheet).
Subse-of daltons;
also(alpha
refer helix)
to the and
mass
protein,
which
expressed
in units
quently,one
other
structures
such
theatomic
" turn mass
and omega
loop were
idendalton
is equal
toasone
unit. (#)
A protein
with
a molecular
(c)
tified. Although
periodic,
common
turndaltons,
or loop or
structures
are well30
UOW CHEM320 2016, Haibo Yu
weight ofnot
50,000
has these
a mass
of 50,000
50 kd (kilodaltons).
defined andIn
contribute
with !the
helices
and
" sheets tochain
form the
final proteinThe most
some proteins,
linear
polypeptide
is cross-linked.
Kilodalton (kd)
FIGURE 43 Ramachandran plot for -Ala residues. The
common cross-links are disulfide bonds, formed by the oxidation of a pair of
conformations of peptides are A
defined
the mass
values ofequal
" and #. to 1000 daltons. structure.
unitbyof
Conformations deemed possible are those that involve little or no
#180
cysteine residues (Figure 3.21). The resulting unit of linked cysteines is
steric interference, based on calculations using known van der
L
Exercise 3-1
2:13 PM
b. Allowed Conformations
of Polypeptides
Are
FIGURE 3.27 Rotation about bonds in a polypeptide.
The structure
of each amino
C#N bond (!) and the C#C bond ("). The torsion angles are
both 180 in the conformation shown and increase, as is indicated,
in a clockwise manner when viewed from C#. [Illustration, Irving
Geis. Image from the Irving Geis Collection, Howard Hughes
Medical Institute. Reprinted with permission.]
See Kinemage
12/30/03
H3N
Page 56
The 2016,
only reasonably
UOW CHEM320
Haibo Yufree movements are rotations about the
8:19 PM
H2C
8885d_c04_119
H
H
6/4/01
1: Disulfide Bond
Waals radii and bond angles. The areas shaded dark blue reflect
120
theoverlap
book,
molecular
conformationsthroughout
that involve no steric
andare
thus are
fully
allowed; medium
blue indicates conformations
allowed
at the
modeling-based
tutorials
that
enable you
60
extreme limits for unfavorable atomic contacts; the lightest blue
to review structure and learn what the
area reflects conformations that are permissible if a little flexibility is
0
research
tells ofustheabout
the
workings
allowed in thelatest
bond angles.
The asymmetry
plot results
from
the L stereochemistry
the amino acid residues.
The plots
other
of theof molecule.
To access,
goforto
the
"60
L-amino acid residues with unbranched side chains are nearly
Web site: www.whfreeman.com/biochem5,
identical. The allowed ranges for branched amino acid residues
"120
and
select
the chapter,
Structural
Insights,
such as Val, Ile,
and Thr
are somewhat
smaller than
for Ala. The Gly
residue, whichand
is lessthe
sterically
hindered, exhibits a much broader
title.
"180
"180
range of allowed conformations. The range for Pro residues is
greatly restricted because " is limited by the cyclic side chain to the
range of "35% to "85%.
w (degrees)
f (degrees)
Cysteine
Oxidation
72
31
of RNase B.
Altogether, over 70 sets of threedimensional coordinates related to RNase A have
been deposited in the Brookhaven Protein Data Bank
(www.pdb.bnl.gov).
RNase A is small. The mature enzyme, as secreted
32
such cases, the individual polypeptides are not considered subunits but are commonly referred to simply as
chains.
We can calculate the approximate number of amino
acid residues in a simple protein containing no other
1: 2,3,4,5,6, ...
Dipeptides (2);
Tripeptides (3);
Tetrapeptides (4);
Pentapeptides (5);
... ...
Oligopeptides (a few);
Polypeptides (many, MW< 10,000);
TABLE 32
Cytochrome c (human)
Ribonuclease A (bovine pancreas)
Lysozyme (chicken egg white)
Myoglobin (equine heart)
Chymotrypsin (bovine pancreas)
Chymotrypsinogen (bovine)
Hemoglobin (human)
Serum albumin (human)
Hexokinase (yeast)
RNA polymerase (E. coli)
Apolipoprotein B (human)
Glutamine synthetase (E. coli)
Titin (human)
33
Summary on L1
1. 20 amnio acids and their properties (structure, pKa);
2. Important molecular forces (hydrophobicity/hydrophilicity,
hydrogen bonding, salt bridges);
3. Peptide bond ();
4. Ramachandran plot ( vs ).
Proteins;
For example, the amide bonds in the side chains of asparagine and glutamine are cleaved by acid treatment,
yielding aspartate and glutamate, respectively. The side
chain of tryptophan is almost completely degraded by
acid hydrolysis, and small amounts of serine, threonine,
35
13,000
13,700
13,930
16,890
21,600
22,000
64,500
68,500
102,000
450,000
513,000
619,000
2,993,000
Number of
residues
104
124
129
153
241
245
574
609
972
4,158
4,536
5,628
26,926
Number of
polypeptide chains
1
1
1
1
3
1
4
1
2
5
1
12
1
34