Documente Academic
Documente Profesional
Documente Cultură
26501–26507, 2002
© 2002 by The American Society for Biochemistry and Molecular Biology, Inc. Printed in U.S.A.
Received for publication, April 3, 2002, and in revised form, May 3, 2002
Published, JBC Papers in Press, May 6, 2002, DOI 10.1074/jbc.M203183200
Toby H. Richardson‡, Xuqiu Tan, Gerhard Frey, Walter Callen, Mark Cabell,
David Lam, John Macomber, Jay M. Short, Dan E. Robertson, and Carl Miller
From the Diversa Corporation, San Diego, California 92121
High throughput screening of microbial DNA libraries rium Bacillus licheniformis (1–3). This enzyme operates opti-
was used to identify ␣-amylases with phenotypic char- mally at 90 °C and pH 6, and it requires addition of calcium
acteristics compatible with large scale corn wet milling (Ca2⫹) for its thermostability (4), conditions that are substan-
process conditions. Single and multiorganism DNA li- tially different from those encountered in the various industrial
braries originating from various environments were processes where the enzyme is utilized. The disparity between
targeted for activity and sequence-based screening ap- these industrial requirements and the native environment for
proaches. After initial screening, 15 clones were desig- the ␣-amylase results in sub-optimal enzymatic performance in
nated as primary hits based upon activity at pH 4.5 or many applications.
95 °C without addition of endogenous Ca2ⴙ. After fur- Corn wet milling is an example of a multistep industrial
ther characterization, three enzyme candidates were
have been collected from a myriad of natural environments, designated BD5031, BD5064, and BD5063 (internal classifica-
many of which display environmental variables of extreme pH, tion labels) were selected. These enzymes each had near-opti-
temperature, pressure, organic contaminants, salt concentra- mal characteristics for the corn wet milling process. The se-
tion, etc. The DNA representing the microbial constituents of quences of these three proteins are shown in Fig. 1. BD5031
the sample is processed into complex gene libraries. These and BD5064 (both 461 amino acids in length) showed high
libraries, made from DNA extracted from soils or aqueous levels of sequence identity to other hyperthermophilic ␣-amy-
environments, are screened for genes coding for enzymes of lases and were most similar (95 and 85% amino acid identity,
desired phenotypes using targeted expression assays and se- respectively) to the ␣-amylase of Pyrococcus sp. KOD1 (12).
quence homology-based methods (21). BD5063 shared significant identity to both BD5031 and
To target discovery of new ␣-amylases applicable to the corn BD5063 (88 and 90% amino acid identity, respectively). Phylo-
wet milling process, libraries containing the genomes of be- genetic analysis revealed that all three enzymes were members
tween 1 and 15,000 organisms were screened either by enzy- of the glycosyl hydrolase family 13 (17) and are likely to be from
matic activity or sequence homology to known ␣-amylases. organisms closely related to the order of Thermococcales.
These technologies enable screening of 105–109 clones/day from Characterization of Wild-type ␣-Amylases—␣-Amylases were
gene libraries using pH and temperature conditions approxi- expressed in a P. fluorescens host using a proprietary cloning
mating those of the corn wet milling process. ␣-Amylase ex- vector. BD5031 was produced with and without its signal pep-
pression screening was performed in semi-solid agar plates or tide sequence.
in liquid-based format in microtiter plates using azo-dyed In order to simulate the first step of the corn wet milling
starch as the substrate (see “Experimental Procedures”). process, heat-treated cell lysates were tested for their ability to
Clones with the highest activity on the azo-dyed starch were liquefy starch under pH conditions ranging from 4.25 to 6.25
sequenced. Open reading frames were subjected to homology (Fig. 2). The pH optima of the three wild-type ␣-amylases,
searches (BLAST) using the non-redundant gene data base at BD5031, BD5063, and BD5064, were compared with the com-
the National Center for Biotechnology Information. monly used B. licheniformis ␣-amylase. Each of these three
Libraries also were screened using degenerate PCR primers; newly discovered ␣-amylases had superior characteristics com-
in brief these primers were designed to incorporate sequences pared with the Bacillus enzyme, but BD5063 had the greatest
found at the N and C termini of known hyperthermophilic activity at pH 4.5 and a temperature of 105 °C.
␣-amylases. By using these probes, single organism or environ- The relative level of ␣-amylase expression was found to be
mental DNA libraries were screened for sequence homologues. BD5031 ⬎ BD5064 ⬎ BD5063 (Table I). As with other heter-
Homologous DNA inserts were sequenced and re-cloned for ologously expressed archaeal amylases reported previously
expression, and their ␣-amylase activity was verified. (22), the majority of the ␣-amylase protein expressed in the
Following this concerted discovery effort, three enzymes, Pseudomonas host was found as insoluble aggregates.
26504 Discovery of a Low pH Thermostable ␣-Amylase
TABLE II
Analysis of fragments of reassembled ␣-amylase clones
25 random clones and 20 up-mutants were sequenced. The composi-
tion of the clones from the three different wild-type enzymes is shown
(in percent). In fragments I–VII, one parent is preferred over the other
two in the improved clones (ⱖ50%; marked in gray). The derived con-
sensus composition as well as the composition of two improved clones
discussed in the text is shown at the bottom.
Expressiona ⬎30% ⬃20% ⬃5–10% Primary, Secondary, and Tertiary Screening of Reassembled
Relative activityb 15% 21% 100% ␣-Amylases—To identify enhanced ␣-amylases from the pool of
pH optimum 5.25 5.25 4.5
Growth in fermenterc ⬎200 ⬃30 ⬃80 reassembled clones, a screening strategy was designed to iden-
a
tify clones that exhibited high activity at pH 4.5 based upon
The levels of expression are the % of total protein expressed in a
normalized cell density. The protocol for the primary high
Pseudomonas host.
b
The relative activity represents whole cell activity at a per cell level throughput assay is shown schematically in Fig. 3. In brief, the
at pH 4.5 relative to BD5063. reassembled clones were individually distributed into 384-well
c
Final A575. plates at an average inoculum of 1 cell per well and incubated
for 24 h. The cells were then lysed with 6 M urea for 1 h to
solubilize all ␣-amylase protein. An aliquot of cell lysate was
After scaling to 10-liter fermenters, the pH and protein ex- transferred to a well in the screening plate containing Remazol
pression characteristics of all three enzymes were similar com- Brilliant Blue (RBB)-labeled cornstarch in 50 mM NaOAc
pared with those measured previously in smaller shake flasks buffer at pH 4.5. The assay plates were incubated at 90 °C for
(25 ml of media in a 250-ml flask). In the larger fermenter, 20 min. Following an ethanol precipitation step, enzymatic
BD5063 continued to maintain the best pH optimum (meas- activity (release of RBB) was assessed by measuring absorb-
ured as liquefaction activity at the target pH) but was produced ance at 595 nm.
at much lower levels than the other two enzymes. Detectable amylase activity was found in ⬃40% of the reas-
Reassembly of Amylases—In order to optimize the ␣-amylase sembled clones. 145 daughter clones showed an increased ac-
productivity and expression phenotypes to better suit the com- tivity on a per cell basis at pH 4.5, when compared with the
mercial process, gene reassembly was performed using the three parental ␣-amylases in the primary screen. Seventy one
three wild-type genes (Fig. 1) as parental sequence. Pooled of the best up-mutants were grown in shake flasks and sub-
fragments were ligated to produce chimeric ␣-amylase genes. jected to a secondary screen for starch liquefaction. From the
Ligation of fragments from different parents with identical liquefaction screen a total of 34 clones was confirmed to have
overhangs was predicted to produce all possible combinations higher liquefaction activity at pH 4.5 when compared with the
with equal probability; the possible number of combinations of three wild-type enzymes on a per cell basis (data not shown).
nine fragments from three different parents is 39 ⫽ 19,683, The ␣-amylase activity in each of these 34 clones was also
with only 3 of the 19,683 combinations regenerating the wild- tested for improved thermostability at pH 4.5.
type sequences. Twenty two of the 34 reassembled enzymes exhibited im-
The reassembled genes were ligated into an E. coli/Pseudo- proved thermostability at 90 °C in the absence of added cal-
monas shuttle vector and introduced into E. coli by transfor- cium when compared with BD5063, the most stable of the
mation. Plasmid DNA from ⬃21,000 pooled colonies was iso- wild-type parental ␣-amylases. Similarly, 20 reassembled
lated and introduced into P. fluorescens also by transformation. clones exhibited greater thermostability at 100 °C in the pres-
Twenty five random clones from the primary reassembled li- ence of 40 mg/liter calcium when compared with the wild-type
brary were sequenced to determine the distribution of the 27 ␣-amylase BD5063 (data not shown).
parental fragments among the daughter clones and to verify Twenty seven of the 34 clones described above were further
the efficacy of the reassembly protocol (Table II). Each clone evaluated for their capacity to express ␣-amylase activity
had a unique sequence and exhibited an apparently random during large scale fermentation. Two of the reassembled
distribution of all parental sequence fragments with the excep- clones, BD5088 and BD5096, were robust and grew to high
tion of fragment IV. cell density (⬎200 A575). By using these two clones, the ␣-am-
Discovery of a Low pH Thermostable ␣-Amylase 26505