Documente Academic
Documente Profesional
Documente Cultură
www.bba-direct.com
Review
Large-scale protein identification using mass spectrometry
Dayin Lin a, David L. Tabb b, John R. Yates III a,*
a
Department of Cell Biology, The Scripps Research Institute, 10550 North Torrey Pines Rd., La Jolla, CA 92037, USA
b
Department of Genome Sciences, University of Washington, Box 357730 Seattle, WA 98195, USA
Received 2 September 2002; received in revised form 25 November 2002; accepted 3 December 2002
Abstract
Recent achievements in genomics have created an infrastructure of biological information. The enormous success of genomics promptly
induced a subsequent explosion in proteomics technology, the emerging science for systematic study of proteins in complexes, organelles,
and cells. Proteomics is developing powerful technologies to identify proteins, to map proteomes in cells, to quantify the differential
expression of proteins under different states, and to study aspects of protein – protein interaction. The dynamic nature of protein expression,
protein interactions, and protein modifications requires measurement as a function of time and cellular state. These types of studies require
many measurements and thus high throughput protein identification is essential. This review will discuss aspects of mass spectrometry with
emphasis on methods and applications for large-scale protein identification, a fundamental tool for proteomics.
D 2002 Elsevier Science B.V. All rights reserved.
1570-9639/02/$ - see front matter D 2002 Elsevier Science B.V. All rights reserved.
doi:10.1016/S1570-9639(02)00546-0
2 D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10
have come about through synergies of genomic data and terminus and incomplete purification [15]. A recent inno-
advanced mass spectrometers. Current high-sensitivity meth- vation in the use of Edman degradation has improved the
ods using mass spectrometry for protein identification have analysis of simple protein mixtures by deconvoluting the
vastly lowered detection limits [7], permitting the facile mixed sequence from each cycle using sequences from
analysis of proteins with low cell copy numbers from fewer databases [16]. Other traditional approaches for protein
numbers of cells [8]. An increasing level of automation is identification include the use of antibodies to perform
allowing the development of high-speed proteomic experi- Western blots [17]. Antibody use, however, is impaired by
ments. In addition, the quality of bioinformatics continues to non-specific binding and by the availability of antibodies to
improve; making predictions of gene products deduced from all proteins [18]. As genome sequence information has
genome information increasingly accurate. With this data- accumulated, the paradigm has shifted from sequencing to
base infrastructure in place, algorithms designed to match identification. This situation has been facilitated by advan-
mass spectrometry data to known protein sequences have ces in ionization and mass analysis techniques for mass
become possible [9,10]. Isolated proteins can be matched to spectrometry and the ability to correlate mass spectrometry
protein sequences by peptide mass fingerprinting, while more data of peptides and proteins to sequences in databases.
complex mixtures of proteins can be analyzed through
tandem mass spectrometry. These improvements in instru-
mentation and automation make possible experiments far 3. Ionization techniques for biomolecules in mass
greater in scale than ever before. spectrometry
Proteomics can be used to help define the functions and
interrelationships of proteins in an organism [11,12]. The Mass spectrometry analysis of peptides and proteins relies
basic experiments to achieve this aim are commonly prac- exclusively on soft ionization techniques that create intact
ticed in biochemistry and molecular biology. In the past, the gas-phase ions from biomolecules. The creation of intact
rate-limiting step has been the analytical process through molecular ions enables accurate measurement of molecular
which the proteins observed in an experiment such as an weight. The electrospray ionization (ESI) and matrix-assis-
immunoprecipitation can be identified. To inventory a cell’s ted laser desorption/ionization (MALDI) techniques were
complete protein content and dissect its protein interaction developed more than 10 years ago and revolutionized the
network requires protein identification technologies that can analysis of biomolecules [19 – 22]. Biomolecules are often
operate at extremely high throughput and sensitivity. Further polar and charged, thus conversion of solution or solid-phase
complicating the analysis of proteins are the chemical ions to gas-phase ions is not an energetically favorable
modifications that may occur either co-translationally or process. ESI and MALDI methods are currently the principal
post-translationally to modify or regulate their functions. methods for peptide/protein ionization and they have been
Covalent modification events include phosphorylation, linked to high-throughput sample preparation techniques
methylation, glycosylation, prenylation, formylation, or [23].
acetylation. Other modifications of proteins include proteo-
lytic cleavage, oxidation of some amino acids, or cross- 3.1. Electrospray ionization
linking events. All processing adds greatly to the complex-
ity of protein analysis. Modifications may also change both ESI operates at atmospheric pressure and produces tiny
chemical and physical properties of individual proteins, charged solvent droplets when a high electric potential (2– 5
such as chromatographic behavior, mass, and ionization kV) is set between a capillary and the inlet to a mass
efficiency. Although the complexity of the proteome is a spectrometer. By using a drying gas or heat in the atmos-
challenge for its analysis, techniques to handle structural pheric pressure interface, the charged droplets shrink and
and modification features have evolved to make mass eventually desolvated ions are desorbed from the droplet
spectrometry an important technology for large-scale protein [24]. The ESI process can be used to produce negative or
identification [9]. positive ions, but typically peptides or proteins are analyzed
as positive ions using the capillary as an anode and the MS
inlet as the cathode. ESI uses a steady stream of solvent to
2. A paradigm shift: from sequencing to identification produce a continuous beam of ions and thus is readily
coupled to HPLC. When operating at high solvent flow
Edman sequencing has been the gold standard for protein rates of 100 Al/min to 1 ml/min, a nebulization gas (or
sequencing for the last 25 years [13]. For unambiguous sheath gas) is needed to assist the solvent dispersion. At low
sequencing, a protein is purified to homogeneity prior to flow rates, electrospray can be induced by electric potential
sequencing. The sample then undergoes cycles of Edman alone. Two operating regimes have been defined at low flow
degradation reactions to remove the N-terminal amino acid rates. The first is microelectrospray, where the flow rate is
which is collected at the end of each cycle for identification approximately 100 –500 nl/min; and the second is nano-
by liquid chromatography [14]. Each cycle requires 30 – 60 electrospray, where a flow rate of 100 nl/min or less is used
min. Edman sequencing is complicated by a blocked N- [25,26]. These low flow rates create smaller droplets and a
D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10 3
much reduced Taylor cone, allowing the bulk of the spray to RF to DC voltage selectively stabilizes the trajectory of ions
be directed into the mass spectrometer’s inlet [26]. As a of particular m/z value as they pass through the analyzer. Ion
result, much less sample is required than at higher flow rates current is recorded at a detector as ions exit the analyzer [29].
because of the concentration-sensitive nature of electro- Quadrupole ion trap (IT) analyzers create three-dimensional
spray. Another characteristic of ESI is the production of RF fields to trap ions in the center of a ring electrode [29,30].
multiply charged ions which lowers the m/z values (over The field can be manipulated to selectively eject ions of a
singly charged species, z=1) for higher molecular weight particular m/z value to a detector to record the m/z ratios or to
compounds and thus allows measurement of m/z values on selectively retain a particular m/z value for collision-acti-
mass spectrometers with limited m/z ranges. Furthermore, vated dissociation (CAD). CAD can be used to fragment ions
multiple protonation of peptides and proteins promotes by exciting the trapped ions to increase their motion causing
more facile amide bond fragmentation when the ions are hundreds to thousands of ion-molecular collisions with the
activated for dissociation. These features of ESI have led to helium bath gas. The resulting fragment ions are then
its widespread adoption for proteomics research. sequentially ejected to the detector [30]. TOF mass analyzers
accelerate a packet of ions with a set of electric potentials and
3.2. Matrix-assisted laser desorption/ionization differentiate them by the time they take to traverse a flight
tube [31]. An m/z value can be calculated from the time
MALDI uses energy from lasers rather than electrical required to move from the ion source to the detector. Mag-
potential to ionize biomolecules. MALDI uses small UV netic fields can also be used to measure m/z values by
absorbing molecules to co-crystallize with peptides/proteins trapping ions in a static magnetic field (Ion Cyclotron Mass
on a sample plate [22]. Ionization occurs when these matrix Spectrometry also called Fourier Transform Mass Spectrom-
molecules absorb the energy provided by a laser (usually 337 etry, FTMS). Ions rotate around the magnetic field with a
nm). Release of the energy causes a rapid thermal expansion cyclotron frequency related to their m/z value. By perturbing
of matrix and analyte into the gas phase. Proton transfer from ion motion in the cell, the ion motion goes into coherence
analyte to matrix may result in charge reduction to the singly and an electric signal can be measured between two detector
charged ion observed in the gas-phase [27]. The most plates (detector plates detect an ion current as the packet of
commonly used matrix molecules are a-cyano-4-hydroxyci- ions moves from one side of the cell to the other). As the
naminic acid for peptides and polypeptides less than 5000 motion of the ions decays back to their natural cyclotron
Da, and 3,5-dimethoxy-4-hydroxy-cinnamic acid, or sina- frequencies, the signal deteriorates. By recording the signal
pinic acid, for proteins. MALDI produces predominately decay, frequencies can be calculated using a Fourier trans-
singly charged ions and is less sensitive to salts in the buffer form to calculate very accurate m/z values [32]. Hybrid mass
than ESI, although salt and matrix adducts of analyte ions spectrometers that combine different types of mass analyzers
can form. Because MALDI depends on pulsed laser radia- have been constructed to produce unique capabilities. For
tion, ions are created in bunches or packets, consequently, example, quadrupole-TOFs are instruments that combine a
mass analyzers capable of analyzing ions created in an quadrupole mass analyzer for ion selection in a tandem mass
intermittent fashion are required. The most common mass spectrometer mode and TOF analyzers to record m/z values
analyzers used with MALDI are time-of-flight (TOF) mass with high resolution [33]. Ions are dissociated in a collision
spectrometers with growing interest in using this ionization cell between the two mass analyzers. More recently, TOF
technique with ion trap mass spectrometers (described mass analyzers have been combined to create tandem mass
below). A recent innovation in MALDI sources is the spectrometers (TOF– TOF). In this instrument, m/z values
operation of this device at atmospheric pressure [28]. This are selected by their TOF and all others are deflected from
method of operation simplifies source design and allows the the flight path. The ions then pass into a collision cell and
use of MALDI with sources originally designed for ESI. undergo high-energy collisions with an inert gas such as
Thus, the two different ionization methods can be used helium, causing fragmentation of the ions. This prompt
interchangeably with the same instrument. fragmentation of ions is referred to as collision-induced dis-
sociation (CID). All of the above mass spectrometers are
effective for proteomic studies and differentiate on the basis
4. Mass analyzers for proteomic mass spectrometry of mass range, mass accuracy, sensitivity, resolution and
cost. Ion trap and TOF mass analyzers are compatible with
As ions exit the ion source, they pass into a mass analyzer. MALDI because they trap packets of ions or analyze them
The mass analyzer is responsible for separating ions by their simultaneously, respectively.
mass-to-charge (m/z) ratios. Mass analyzers use electric and/
or magnetic fields to manipulate ions in a mass-dependent
manner. Quadrupole (Q) mass analyzers use a radiofre- 5. Overview of protein identification by mass analysis
quency (RF) voltage applied to four metal rods with RF
voltage of alternate polarity placed on opposite rods. A direct Complete protein identification involves correlating data
current (DC) voltage is overlayed on the rods. The ratio of from a protein with its gene sequence and identifying the
4 D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10
site and type of modifications that may be on the protein 41]. Even as ECD improves, protein fragmentation is still
(Fig. 1). To unambiguously correlate an isolated protein’s not as general as peptide fragmentation. Computer algo-
sequence to the sequence predicted through genome rithms to use the data derived from ‘‘top-down’’ methods to
sequencing, data indicative of amino acid sequence must identify protein sequences from databases have been devel-
be derived from the protein. During the last several years, oped [37,42]. However, the observed molecular weight of a
two different approaches for MS-based protein identification protein may differ from the predicted molecular weight
from complex mixtures have been maturing: ‘‘top-down’’ because of sequence errors, post-translational modifications,
and ‘‘bottom-up’’ [34]. or proteolyic processing. The use of protein fragmentation
techniques such as ECD has facilitated protein identification
5.1. Top-down approach using intact proteins and databases [43]. After a protein has
been identified, an accurate molecular weight can be enor-
The ‘‘top-down’’ strategy starts with an intact protein and mously useful in pinpointing the extent of modification of
cleaves the protein in the gas phase rather than in solution. the protein. This technique will have a significant role in the
The protein is fragmented inside the mass spectrometer to identification of small proteins (<10 kDa) because they are
create a ladder of ions indicative of the sequence. The difficult to predict through bioinformatics and there is much
difference in m/z values of fragment ions define the position to be learned about their biological roles.
and sequence of the amino acids in the protein. Several
methods and mass spectrometers are used for ‘‘top-down’’ 5.2. Bottom-up approach
analysis. ESI-FTMS is able to analyze intact proteins
because of its capability for high resolution. Proteins are The ‘‘bottom-up’’ approach identifies proteins by tandem
fragmented in this device using electron capture dissociation mass spectrometry analysis of peptides derived from diges-
(ECD) [35]. Other methods of ion dissociation have been tion of mixtures of intact proteins [44,45]. The resulting
used in FTMS instruments, but ECD has the potential to be peptide mixture is chromatographically fractionated and
a general method for protein fragmentation [36]. FTMS can introduced into a tandem mass spectrometer. Tandem mass
isolate individual m/z values (a specific charge state of a spectrometers can select individual peptide ions for analysis
protein, for example) and thus simple protein mixtures can from mixtures of ions. The fragmentation pattern derived for
be introduced into the mass spectrometer and then frag- each peptide is indicative of the amino acid sequence of the
mented separately. Proteins need not be purified to homo- peptide. This pattern can be used to search sequence data-
geneity and thus have less dependence on chromatographic bases without first interpreting the sequence from the spec-
separation prior to MS analysis [37]. The high mass accu- trum. Amino acid sequences are selected for further analysis
racy of FTMS instruments can detect protein sequence from the database on the basis of the molecular weight of the
errors and protein post-translational modifications [38 – peptide. The predicted fragmentation pattern for each peptide
Fig. 1. Schematic showing the ‘‘top-down’’ and bottom-up approaches to protein analysis and identification. In the ‘‘top-down’’ approach, the intact protein is
fragmented in the gas-phase to create sequence-specific fragment ions to aid in sequence analysis. The bottom-up approach uses proteolytic enzymes to create
peptides for sequence analysis usually by using multidimensional liquid chromagoraphy in combination with tandem mass spectrometry.
D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10 5
identify proteins from 2DGE [59]. Proteins are separated to spectrometers to select and analyze peptides from protein
homogeneity, typically by 2DGE, and then trypsin or mixtures which have been proteolytically digested. The
another enzyme digests them into characteristic peptides. peptides then serve as surrogate markers for the protein
A key component to mass mapping is cleavage of the sequence [44]. Proteins are identified by searching the
proteins into predictable peptide fragments, although less resulting tandem mass spectra through sequence databases.
specific proteases such as thermolysin have been used to Because digested protein mixtures create complex mixtures
cleave peptides for mass mapping [60]. Mass spectrometry of peptides, high-performance separation techniques are
(often MALDI-TOF) measures the m/z values of the pep- required to resolve the peptides prior to entering the tandem
tides, and an algorithm searches a protein database for mass spectrometer [44,47,48]. This approach, when com-
proteins that would produce peptides of these molecular bined with multidimensional liquid chromatography and
weights when cleaved with the same protease (see Table 1 database searching, is called Multidimensional Protein Iden-
for several programs used for this purpose). The quality of tification Technology, or MudPIT. This process fully auto-
the result depends on the purity of the sample, the accuracy mates the separation and identification of proteins from
of the measurement, and the number of peptide masses complex mixtures [64,65]. The first step in any MudPIT
obtained. The peptide mass mapping method can be auto- experiment is the enzymatic digestion of a protein mixture to
mated when coupled with robotic equipment that can deliver produce a more complex peptide mixture. A specially
processed samples from spots on 2-D gels. Mass mapping is packed biphasic liquid chromatography column (see Fig.
an attractive choice for proteomic studies involving 2DGE. 2) using a strong cation exchange (SCX) support as the initial
Shevchenko et al. [61] reported the identification of 150 phase and reversed-phase (RP) material as the second phase
yeast proteins by 2DGE using MALDI peptide mass map- is used to separate peptides. The column is constructed from
ping. In 1999, a 2-D yeast reference map was established, fused silica and has an internal diameter of 100 Am and thus
and a combined 374 proteins were identified from several eluent is delivered to the mass spectrometer at low flow rates
different labs [62]. In an analysis of H. pylori proteome, (100 – 200 nl/min). Peptides are stepped from the SCX phase
1863 spots were observed from multiple 2DGE separations onto the RP phase in a series of salt steps that increase in
of prefractionated cellular material. Despite the observation concentration. A subsequent RP gradient separates the pep-
of so many proteins, only 126 of the proteins were identified tides and delivers the peptides into the mass spectrometer
[63]. Single dimension gel electrophoresis can also be used after each salt step. Since each phase separates peptides by
for large-scale protein identification studies. Two recent orthogonal chemical properties, a high-resolution separation
analyses of immunoprecipitated yeast protein complexes among the peptides is achieved with the aim that the mass
by using SDS-PAGE and mass spectrometry identified large spectrometer will analyze a more manageable number of
numbers of protein complexes [11,12]. peptides at any point during the separation [66]. MudPIT
analysis of proteins from a cell can detect proteins over a
6.2. Multidimensional protein identification technology wide range of pI, abundance, and subcellular localization
[65]. The MudPIT method was used to identify proteins from
An alternate approach for the analysis of protein mixtures a yeast whole cell lysate by Washburn et al. [65]. From three
uses a different paradigm to identify the proteins contained in fractions (soluble proteins, peripheral membrane proteins,
a mixture. This strategy uses the ability of tandem mass integral membrane proteins) of the cell lysate, a total of 1484
Fig. 2. Preparation for a biphasic column for LC/LC/MS/MS analysis. A piece of fused silica capillary is pulled to have an opening of 5 Am. The packing of the
column is achieved by using a pressure bomb to force packing materials up to the column. RP and SCX materials are used sequentially to make a biphasic
column.
D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10 7
Fig. 3. Schemes for large-scale protein identification methods. The gel-based 2DGE approach separates proteins from cell lysate by molecular weight and
isoelectric point. Each spot can then be excised and enzymatically digested for MS analysis. The liquid-based MudPIT method digests the proteins in the lysate
and separates peptides on a biphasic LC column for MS/MS analysis. Database searches are conducted for both approaches after MS data are collected.
8 D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10
chromatography steps. Conrads et al. [72] uses the high Modified peptides, such as those bearing the important
mass accuracy of FTMS instruments to profile peptides phosphorylation modification, may produce fragment ion
from digested protein mixtures. Measurement of m/z values spectra of greater complexity than normal peptides, compli-
with high accuracy allows facile comparison between anal- cating their interpretation. At the same time, augmented
yses and provides in instances a mechanism to identify forms of existing database matching algorithms typically
proteins. To further ensure unambiguous identification of take far longer to run on spectra when modification possi-
the peptides, the complex peptide mixture is analyzed by bilities are taken into account. Progress has been made in
multiple LC/MS/MS experiments using ‘‘gas-phase’’ frac- solving this problem, but much work remains to be done
tionation on an ion trap mass spectrometer [73] (Fig. 3). By [79 – 84].
reducing the scan range of the mass spectrometer (e.g. 500– An increasing emphasis in proteomics is the quantitation
800), the number of ions available for data-dependent data of protein content rather than simple determination of
acquisition is reduced, allowing the acquisition of more presence or absence [85]. Several methods exist to quanti-
peptide ions then normally occurs during a single dimension tate proteins and all rely on labeling proteins with a mass
chromatographic separation. By repeating the process over label to differentiate the same peptides from two states of
different mass ranges (e.g. 800 –1000, 1000– 1200, etc.) the cells for relative quantitation. The first method uses
more thorough coverage of the proteome is possible. In the metabolic labeling to incorporate 15N into peptides [86]. A
study by Lipton et al. [73] of D. radiodurans, an impressive second method uses proteolytic digestion in the presence of
18
61% of the proteome was covered by analyzing cultures O water to incorporate a label [87]. The last method uses
grown under a variety of conditions. covalent labeling to incorporate the mass label [71,88 – 90].
A variation of the ‘‘bottom-up’’ approach that combines Covalent labeling is targeted at reactive amino acid residues
features of the ‘‘top-down’’ approach is employed by Chong and generally the label contains deuterium atoms in place of
et al. [74]. In this approach, proteins are separated by hydrogen atoms. Labels have been developed for amino and
multidimensional liquid chromatography and fractions col- sulfhydryl groups. These methods allow the measurement of
lected for digestion. Liquid isoelectric focusing has been relative quantitative states. Recently, Stemmann et al. [91]
used together with reversed-phase liquid chromatography to have developed a method for the measurement of absolute
effect high-resolution separations [75]. After digestion, quantities of peptides. Because the method requires the
fractions are then analyzed by MALDI-TOF to identify addition of a known quantity of a peptide standard, it is
the proteins present. Alternatively, the molecular weights not suitable for proteome-wide measurements. Biology is
of proteins can be measured directly in the mass spectrom- moving to a more quantitative description of cellular pro-
eter to create three-dimensional maps of proteins based on cesses and mass spectrometry is poised to play a large role,
pI, hydrophobicity and molecular weight to compare cells of but all of this relies on high throughput identification of
different states [76]. proteins.
As challenges in protein identification are steadily over- The authors acknowledge support from NIH grant
come, the focus in proteomics is shifting to other important R33CA8165-01, NIH grant RR11823-05, and National
areas. Three areas of particular emphasis are ‘‘de novo’’ Science Foundation Graduate Research Fellowship.
peptide sequencing, post-translational modification identifi-
cation, and quantitation of peptides by mass spectrometry.
De novo peptide sequencing by mass spectrometry
References
differs substantially from the peptide identification process
as embodied in SEQUEST and other algorithms, which rely [1] I.H.G.S. Consortium, Nature 409 (2001) 860 – 921.
on the presence of the target peptide sequence in a supplied [2] J.C. Venter, M.D. Adams, G.G. Sutton, A.R. Kerlavage, H.O. Smith,
database. The de novo challenge is to infer a peptide M. Hunkapiller, Science 280 (1998) 1540 – 1542.
sequence directly from the spectrum without external hints [3] S.A. Goff, D. Ricke, T.H. Lan, G. Presting, R. Wang, M. Dunn,
as to what the sequence may be. Several attempts have been J. Glazebrook, A. Sessions, P. Oeller, H. Varma, D. Hadley, D. Martin,
C. Martin, F. Katagiri, B.M. Lange, T. Moughamer, Y. Xia, P. Bud-
made but success has been limited by data quality (such as worth, J. Zhong, T. Miguel, U. Paszkowski, S. Zhang, M. Colbert,
mass accuracy and signal to noise) and hindered by the W.L. Sun, L. Chen, B. Cooper, S. Park, T.C. Wood, L. Mao, P. Quail,
complexity of accurately predicting fragment ion spectra R. Wing, R. Dean, Y. Yu, A. Zharkikh, R. Shen, S. Sahasrabudhe,
[77,78]. It is unlikely that de novo sequencing will ever be A. Thomas, R. Cannings, A. Gutin, D. Pruss, J. Reid, S. Tavtigian,
as sensitive as database searching methods. J. Mitchell, G. Eldredge, T. Scholl, R.M. Miller, S. Bhatnagar, N. Adey,
T. Rubano, N. Tusneem, R. Robinson, J. Feldhaus, T. Macalma,
Post-translational modifications play an important role A. Oliphant, S. Briggs, Science 296 (2002) 92 – 100.
in determining protein function, but identifying peptides [4] J. Yu, S. Hu, J. Wang, G.K. Wong, S. Li, B. Liu, Y. Deng, L. Dai, Y.
containing these modifications poses many challenges. Zhou, X. Zhang, M. Cao, J. Liu, J. Sun, J. Tang, Y. Chen, X. Huang,
D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10 9
W. Lin, C. Ye, W. Tong, L. Cong, J. Geng, Y. Han, L. Li, W. Li, G. [31] R.J. Cotter, Biomed. Environ. Mass Spectrom. 18 (1989) 513 – 532.
Hu, J. Li, Z. Liu, Q. Qi, T. Li, X. Wang, H. Lu, T. Wu, M. Zhu, P. Ni, [32] A.G. Marshall, C.L. Hendrickson, G.S. Jackson, Mass Spectrom. Rev.
H. Han, W. Dong, X. Ren, X. Feng, P. Cui, X. Li, H. Wang, X. Xu, W. 17 (1998) 1 – 35.
Zhai, Z. Xu, J. Zhang, S. He, J. Xu, K. Zhang, X. Zheng, J. Dong, W. [33] H.R. Morris, T. Paxton, A. Dell, J. Langhorne, M. Berg, R.S. Bordoli,
Zeng, L. Tao, J. Ye, J. Tan, X. Chen, J. He, D. Liu, W. Tian, C. Tian, J. Hoyes, R.H. Bateman, Rapid Commun. Mass Spectrom. 10 (1996)
H. Xia, Q. Bao, G. Li, H. Gao, T. Cao, W. Zhao, P. Li, W. Chen, Y. 889 – 896.
Zhang, J. Hu, S. Liu, J. Yang, G. Zhang, Y. Xiong, Z. Li, L. Mao, C. [34] J.R. Kettman, J.R. Frey, I. Lefkovits, Biomol. Eng. 18 (2001) 207 – 212.
Zhou, Z. Zhu, R. Chen, B. Hao, W. Zheng, S. Chen, W. Guo, M. Tao, [35] F.W. McLafferty, M.W. Senko, Stem Cells 12 (1994) 68 – 73.
L. Zhu, L. Yuan, H. Yang, Science 296 (2002) 79 – 92. [36] E. Mortz, P.B. O’Connor, P. Roepstorff, N.L. Kelleher, T.D. Wood,
[5] D. Kennedy, Science 296 (2000) 13. F.W. McLafferty, M. Mann, Proc. Natl. Acad. Sci. U. S. A. 93 (1996)
[6] A.J.T. George, Trends Immunol. 23 (2002) 351 – 355. 8264 – 8267.
[7] M.E. Belov, M.V. Gorshkov, H.R. Udseth, G.A. Anderson, R.D. [37] J.L. Stephenson, S.A. McLuckey, G.E. Reid, J.M. Wells, J.L. Bundy,
Smith, Anal. Chem. 72 (2000) 2271 – 2279. Curr. Opin. Biotechnol. 13 (2002) 57 – 64.
[8] D.A. Wolters, M.P. Washburn, J.R. Yates III, Anal. Chem. 73 (2001) [38] N.L. Kelleher, S.V. Taylor, D. Grannis, C. Kinsland, H.J. Chiu, T.P.
5683 – 5690. Begley, F.W. McLafferty, Protein Sci. 7 (1998) 1796 – 1801.
[9] J.R. Yates III, J. Mass Spectrom. 33 (1998) 1 – 19. [39] N.L. Kelleher, R.A. Zubarev, K. Bush, B. Furie, B.C. Furie, F.W.
[10] J.R. Yates III, Trends Genet. 16 (2000) 5 – 8. McLafferty, C.T. Walsh, Anal. Chem. 71 (1999) 4250 – 4253.
[11] A.C. Gavin, M. Bosche, R. Krause, P. Grandi, M. Marzioch, A. Bauer, [40] S.K. Sze, Y. Ge, H. Oh, F.W. McLafferty, Proc. Natl. Acad. Sci. U. S. A.
J. Schultz, J.M. Rick, A.M. Michon, C.M. Cruciat, M. Remor, C. 99 (2002) 1774 – 1779.
Hofert, M. Schelder, M. Brajenovic, H. Ruffner, A. Merino, K. Klein, [41] Y. Ge, B.G. Lawhorn, M. ElNaggar, E. Strauss, J.H. Park, T.P. Begley,
M. Hudak, D. Dickson, T. Rudi, V. Gnau, A. Bauch, S. Bastuck, B. F.W. McLafferty, J. Am. Chem. Soc. 124 (2002) 672 – 678.
Huhse, C. Leutwein, M.A. Heurtier, R.R. Copley, A. Edelmann, E. [42] D.M. Horn, R.A. Zubarev, F.W. McLafferty, Proc. Natl. Acad. Sci.
Querfurth, V. Rybin, G. Drewes, M. Raida, T. Bouwmeester, P. Bork, U. S. A. 97 (2000) 10313 – 10317.
B. Seraphin, B. Kuster, G. Neubauer, G. Superti-Furga, Nature 415 [43] F. Meng, B.J. Cargile, L.M. Miller, A.J. Forbes, J.R. Johnson, N.L.
(2002) 141 – 147. Kelleher, Nat. Biotechnol. 19 (2001) 952 – 957.
[12] Y. Ho, A. Gruhler, A. Heilbut, G.D. Bader, L. Moore, S.L. Adams, A. [44] J.K. Eng, A.L. McCormack, J.R. Yates III, J. Am. Soc. Mass Spec-
Millar, P. Taylor, K. Bennett, K. Boutilier, L. Yang, C. Wolting, I. trom. 5 (1994) 976 – 989.
Donaldson, S. Schandorff, J. Shewnarane, M. Vo, J. Taggart, M. Gou- [45] M. Mann, M. Wilm, Anal. Chem. 66 (1994) 4390 – 4399.
dreault, B. Muskat, C. Alfarano, D. Dewar, Z. Lin, K. Michalickova, [46] J.R. Yates, J.K. Eng, A.L. McCormack, D. Schieltz, Anal. Chem. 67
A.R. Willems, H. Sassi, P.A. Nielsen, K.J. Rasmussen, J.R. Andersen, (1995) 1426 – 1436.
L.E. Johansen, L.H. Hansen, H. Jespersen, A. Podtelejnikov, E. Niel- [47] A.L. McCormack, J.K. Eng, J.R. Yates III, Methods 6 (1994) 273 –
sen, J. Crawford, V. Poulsen, B.D. Sorensen, J. Matthiesen, R.C. Hen- 274 (A companion to Methods in Enzymology).
drickson, F. Gleeson, T. Pawson, M.F. Moran, D. Durocher, M. Mann, [48] A.L. McCormack, D.M. Schieltz, B. Goode, S. Yang, G. Barnes, D.
C.W. Hogue, D. Figeys, M. Tyers, Nature 415 (2002) 180 – 183. Drubin, J.R. Yates III, Anal. Chem. 69 (1997) 767 – 776.
[13] J.E. Shively, EXS 88 (2000) 99 – 117. [49] S.E. Ong, A. Pandey, Biomol. Eng. 18 (2001) 195 – 205.
[14] J.M. Walker, Methods Mol. Biol. 32 (1994) 329 – 334. [50] F. Von Eggeling, A. Gawriljuk, W. Fiedler, G. Ernst, U. Claussen, J.
[15] T.C. Farries, A. Harris, A.D. Auffret, A. Aitken, Eur. J. Biochem. 196 Klose, I. Romer, Int. J. Mol. Med. 8 (2001) 373 – 377.
(1991) 679 – 685. [51] J. Klose, U. Kobalz, Electrophoresis 16 (1995) 1034 – 1059.
[16] A.J. Mackey, T.A. Haystead, W.R. Pearson, Mol. Cell Proteomics 1 [52] L. Vuillard, N. Marret, T. Rabilloud, Electrophoresis 16 (1995)
(2002) 139 – 147. 295 – 297.
[17] P. Shalhoub, S. Kern, S. Girard, L. Beretta, Dis. Markers 17 (2001) [53] T. Rabilloud, Electrophoresis 17 (1996) 813 – 829.
217 – 223. [54] T. Rabilloud, C. Adessi, A. Giraudel, J. Lunardi, Electrophoresis 18
[18] D.E. Rebeski, E.M. Winger, Y.K. Shin, M. Lelenta, M.M. Rob- (1997) 307 – 316.
inson, R. Varecka, J.R. Crowther, J. Immunol. Methods 226 [55] N. Galeva, M. Altermann, Proteomics 2 (2002) 713 – 722.
(1999) 85 – 92. [56] S.P. Gygi, G.L. Corthals, Y. Zhang, Y. Rochon, R. Aebersold, Proc.
[19] M. Barber, R.S. Bordoli, R.D. Sedgwick, A.N. Tyler, B.W. Bycroft, Natl. Acad. Sci. U. S. A. 97 (2000) 9390 – 9395.
Biochem. Biophys. Res. Commun. 101 (1981) 632 – 638. [57] P.G. Righetti, A. Castagna, B. Herbert, Anal. Chem. 73 (2001)
[20] H.R. Morris, M. Panico, M. Barber, R.S. Bordoli, R.D. Sedgwick, A. 320A – 326A.
Tyler, Biochem. Biophys. Res. Commun. 101 (1981) 623 – 631. [58] G. Candiano, L. Musante, M. Bruschi, G.M. Ghiggeri, B. Herbert, F.
[21] J.B. Fenn, M. Mann, C.K. Meng, S.F. Wong, C.M. Whitehouse, Antonucci, P.G. Righetti, Electrophoresis 23 (2002) 292 – 297.
Science 246 (1989) 64 – 71. [59] J.S. Cottrell, Pept. Res. 7 (1994) 115 – 124.
[22] F. Hillenkamp, M. Karas, R.C. Beavis, B.T. Chait, Anal. Chem. 63 [60] S.J. Bark, N. Muster, J.R. Yates III, G. Siuzdak, J. Am. Chem. Soc.
(1991) 1193A – 1203A. 123 (2001) 1774 – 1775.
[23] G. Siuzdak, Proc. Natl. Acad. Sci. U. S. A. 91 (1994) 11290 – 11297. [61] A. Shevchenko, O.N. Jensen, A.V. Podtelejnikov, F. Saglicco, M.
[24] A.P. Jonsson, Cell. Mol. Life Sci. 58 (2001) 868 – 884. Wilm, O. Vorm, P. Mortensen, A. Shevchenko, H. Boucherie, M.
[25] M.R. Emmett, R.M. Caprioli, J. Am. Soc. Mass Spectrom. 5 (1994) Mann, Proc. Natl. Acad. Sci. U. S. A. 93 (1996) 14440 – 14445.
605 – 613. [62] M. Perrot, F. Sagliocco, T. Mini, C. Monribot, U. Schneider, A.
[26] M.S. Wilm, M. Mann, Int. J. Mass Spectrom. Ion Process. 136 (1994) Shevchenko, M. Mann, P. Jeno, H. Boucherie, Electrophoresis 20
2 – 3. (1999) 2280 – 2298.
[27] R. Kruger, A. Pfenninger, I. Fournier, M. Gluckmann, M. Karas, [63] P.R. Jungblut, D. Bumann, G. Haas, U. Zimny-Arndt, P. Holland, S.
Anal. Chem. 73 (2001) 5812 – 5821. Lamer, F. Siejak, A. Aebischer, T.F. Meyer, Mol. Microbiol. 36
[28] V.V. Laiko, M.A. Baldwin, A.L. Burlingame, Anal. Chem. 72 (2000) (2000) 710 – 725.
652 – 657. [64] A.J. Link, J. Eng, D.M. Schieltz, E. Carmack, G.J. Mize, D.R.
[29] K. Busch, G.L. Glish, S.A. McLuckey, Mass Spectrometry/Mass Morris, B.M. Garvik, J.R. Yates III, Nat. Biotechnol. 17 (1999)
Spectrometry: Techniques and Applications of Tandem Mass Spec- 676 – 682.
trometry, 1st ed., VCH, New York, 1988. [65] M.P. Washburn, D. Wolters, J.R. Yates III, Nat. Biotechnol. 19 (2001)
[30] K.R. Jonscher, J.R. Yates III, Anal. Biochem. 244 (1997) 1 – 15. 242 – 247.
10 D. Lin et al. / Biochimica et Biophysica Acta 1646 (2003) 1–10
[66] D. Lin, A.J. Alpert, J.R. Yates III, Am. Genomic/Proteomic Technol. [78] J.A. Taylor, R.S. Johnson, Rapid Commun. Mass Spectrom. 11
1 (2001) 38 – 46. (1997) 1067 – 1075.
[67] R.G. Sadygov, J. Eng, E. Durr, A. Saraf, H. McDonald, M.J. Mac- [79] K.R. Jonscher, J.R. Yates, J. Biol. Chem. 272 (1997) 1735 – 1741.
Coss, J.R. Yates III, J. Proteome Res. 1 (2002) 211 – 215. [80] M.G. de Carvalho, A.L. McCormack, E. Olson, F. Ghomashchi, M.H.
[68] D.L. Tabb, W.H. McDonald, J.R. Yates III, J. Proteome Res. 1 (2002) Gelb, J.R. Yates, C.C. Leslie, J. Biol. Chem. 271 (1996) 6987 – 6997.
21 – 26. [81] Y. Oda, T. Nagasu, B.T. Chait, Nat. Biotechnol. 19 (2001) 379 – 382.
[69] E.C. Koc, W. Burkhart, K. Blackburn, M.B. Moyer, D.M. Schlatzer, A. [82] A. Schlosser, R. Pipkorn, D. Bossemeyer, W.D. Lehmann, Anal.
Moseley, L.L. Spremulli, J. Biol. Chem. 276 (2001) 43958 – 43969. Chem. 73 (2001) 170 – 176.
[70] S.P. Gygi, B. Rist, T.J. Griffin, J. Eng, R. Aebersold, J. Proteome Res. [83] H. Zhou, J.D. Watts, R. Aebersold, 2000.
1 (2002) 47 – 54. [84] M.J. MacCoss, W.H. McDonald, A. Saraf, R. Sadygov, J.M. Clark,
[71] S.P. Gygi, B. Rist, S.A. Gerber, F. Turecek, M.H. Gelb, R. Aebersold, J.J. Tasto, K.L. Gould, D. Wolters, M. Washburn, A. Weiss, J.I. Clark,
Nat. Biotechnol. 17 (1999) 994 – 999. J.R. Yates III, Proc. Natl. Acad. Sci. U. S. A. 99 (2000) 7900 – 7905.
[72] T.P. Conrads, G.A. Anderson, T.D. Veenstra, L. Pasa-Tolic, R.D. [85] M.P. Washburn, R. Ulaszek, C. Deciu, D.M. Schieltz, J.R. Yates III,
Smith, Anal. Chem. 72 (2000) 3349 – 3354. Anal. Chem. 74 (2002) 1650 – 1657.
[73] M.S. Lipton, L. Pasa-Tolic, G.A. Anderson, D.J. Anderson, D.L. [86] Y. Oda, K. Huang, F.R. Cross, D. Cowburn, B.T. Chait, Proc. Natl.
Auberry, J.R. Battista, M.J. Daly, J. Fredrickson, K.K. Hixson, H. Acad. Sci. U. S. A. 96 (1999) 6591 – 6596.
Kostandarithes, C. Masselon, L.M. Markillie, R.J. Moore, M.F. Ro- [87] X. Yao, A. Freas, J. Ramirez, P.A. Demirev, C. Fenselau, Anal. Chem.
mine, Y. Shen, E. Stritmatter, N. Tolic, H.R. Udseth, A. Venkateswar- 73 (2001) 2836 – 2842.
an, K.K. Wong, R. Zhao, R.D. Smith, Proc. Natl. Acad. Sci. U. S. A. [88] M. Munchbach, M. Quadroni, G. Miotto, P. James, Anal. Chem. 72
99 (2002) 11049 – 11054. (2000) 4047 – 4057.
[74] B.E. Chong, R.L. Hamler, D.M. Lubman, S.P. Ethier, A.J. Rosenspire, [89] M. Geng, J. Ji, F.E. Regnier, J. Chromatogr., B 870 (2000) 295 – 313.
F.R. Miller, Anal. Chem. 73 (2001) 1219 – 1227. [90] J. Ji, A. Chakraborty, M. Geng, X. Zhang, A. Amini, M. Bina, F.
[75] D.B. Wall, M.T. Kachman, S.S. Gong, S.J. Parus, M.W. Long, D.M. Regnier, J. Chromatogr., B, Biomed. Sci. Appl. 745 (2000) 197 – 210.
Lubman, Rapid Commun. Mass Spectrom. 15 (2001) 1649 – 1661. [91] O. Stemmann, H. Zou, S.A. Gerber, S.P. Gygi, M.W. Kirschner, Cell
[76] D.B. Wall, S.J. Parus, D.M. Lubman, J. Chromatogr., B, Anal. Tech- 107 (2001) 715 – 726.
nol. Biomed. Life Sci. 774 (2002) 53 – 58.
[77] V. Dancik, T.A. Addona, K.R. Clauser, J.E. Vath, P.A. Pevzner,
J. Comput. Biol. 6 (1999) 327 – 342.