Documente Academic
Documente Profesional
Documente Cultură
The rational design of artificial enzymes, either by applying physico–chemical intuition of protein structure and function or with
the aid of computational methods, is a promising area of research with the potential to tremendously impact medicine, indus‑
trial chemistry and energy production. Designed proteins also provide a powerful platform for dissecting enzyme mechanisms
of natural systems. Artificial enzymes have come a long way from simple α‑helical peptide catalysts to proteins that facilitate
multistep chemical reactions designed by state-of-the-art computational methods. Looking forward, we examine strategies
employed by natural enzymes that could be used to improve the speed and selectivity of artificial catalysts.
I
n the nineteen-fifties and sixties, the advent of the semiconductor remarkable selectivity, rate-enhancements and product specificity
transistor and the integrated circuit transformed digital comput- of natural enzymes under aqueous conditions warrants more work
ers from powerful curiosities into pragmatic, cost-effective tools. in developing powerful molecular design technologies.
Along with advances in numerical methods, computers revolution- The complexities of enzyme design can be quite daunting.
ized the design and construction of aircraft, allowing engineers to Examining high-resolution structures of natural enzyme–substrate
simulate complex, nonlinear systems that integrated aerodynamics, complexes reveals that the conformations of active-site amino acids
propulsion, control and so on, thereby pushing aircraft technology are poised to facilitate catalysis. Second-shell interactions tune the
well beyond what was possible with previous analytical models. reactivity of the active site through networks of direct interactions with
Today, a Boeing 747 is an incredibly complex machine with over primary ligands and long-range electrostatic forces. Designing such
6,000,000 parts. As such, computers have become indispensable in molecules from scratch presents a host of computational challenges
the aerospace industry. Although much smaller in size, the mecha- (see Box 1). Accurate modelling of important forces in the active site
nistic complexity of enzymes and challenges associated with their requires quantum mechanical (QM) calculations. Unfortunately, it is
design (Box 1) suggest that they are as sophisticated as passenger not feasible to perform QM calculations on molecules the size of even
airliners, and it is expected that computational methods in chemistry the smallest enzymes. Design also requires the rapid evaluation of a
and biology will promote a similar revolution in the design of artifi- large number of candidate structure/sequence combinations and QM
cial catalysts. calculations are very demanding on computational resources. Effective
The promise of constructing enzymes that are capable of effi- treatment of water molecules and their interactions with active-site
ciently catalysing virtually any chemical reaction is a tremendous residues and reactants can also significantly increase the complexity
motivator for researchers in the protein-design field. Enzymes cata- of calculations. Along with high-resolution design of the active-site
lyse difficult chemical reactions in mild, aqueous environments, residues, candidate enzymes must maintain overall structural integ-
often with a speed and specificity unrivalled by synthetic catalysts. rity, and in some cases, incorporate large-scale protein motions that
Designing an enzyme from scratch is also the most rigorous way of may support catalysis. Integrating all of these factors into a design is a
testing our understanding of how natural enzymes function. Several formidable challenge.
recent designs have been stripped-down or rebuilt versions of natu- It is reasonable to ask what one gains through such sophisticated
ral enzymes, providing powerful tools for dissecting molecular con- computation. After all, a number of novel folds and catalytically
tributions to enzyme structure and reactivity. active proteins have been built without such tools. In this review, we
Enzyme design is inextricably linked with the design of protein survey several designed, artificial enzymes that have been developed
structure. Advances in protein design are often rapidly followed by with varying degrees of computational involvement. These include
attempts to apply new technologies to artificial enzymes. Therefore, de novo enzymes, where both the protein topology and the active
this is as much a review of protein fold design as of catalyst design. site are built from scratch, and active-site design, where surfaces and
However, it should be noted that complex protein topologies are not cavities on existing proteins are repurposed for catalysis. We build
a prerequisite for catalysis. Proline alone can catalyse a remarkable on previous reviews of this field (for example, see ref. 3) by includ-
array of reactions, including aldolase-like formations of carbon– ing a deeper discussion of computational challenges associated with
carbon bonds through enamine intermediates with high yields and enzyme design. Additionally, we look to the future of design, such as
substantial product enantiomeric excess. Other processes includ- introducing multiple substrates, protein motion, allostery into arti-
ing asymmetric epoxidations and acylations are achievable using ficial enzymes, and expanding protein design principles to a broader
short peptides. The impressive catalytic properties of proline and class of structured, catalytically active polymers.
small peptides have been extensively reviewed previously 1,2 and are
not covered here. Few designed enzymes have achieved the cata- The helix and the enzyme
lytic utility of such small peptides, and much remains to be done A fundamental paradigm in biochemistry is the link between a
before designer enzymes find practical applications. However, the protein’s function and its three-dimensional fold, which in turn
1
Robert Wood Johnson Medical School - UMDNJ Biochemistry, Center for Advanced Biotechnology and Medicine, 679 Hoes Lane West, Piscataway, New
Jersey 08854, USA; 2City College of New York Physics, 160 Convent Avenue, New York, New York 10031, USA. *e-mail: nanda@cabm.rutgers.edu
dro
amino acids are nonpolar ( ) and the rest polar ( ), will charac-
phobic fa
c
amphipathic α‑helices; nonpolar side-chains associate between heli-
ce
Motif III O
Asp
H
Hydrophobic
pocket N
O OH N His O O
Lys H +
NH O
O O
His
Asp
Lys
Figure 6 | Assembly line for the ROSETTA Enzymes. a, Sketch of a reaction motif, outlining key intermediates, general acid/base ligands and strategies
for modulating catalytic residue pKas. b, QM calculations are used to optimize the geometry of a transition-state model including truncated active-site
residues. c, Elaboration of catalytic residue rotamers creates an ensemble of active sites. d, These sites are matched to complementary surfaces on a
family of target protein scaffolds. e, Promising designs are synthesized and characterized for activity. Figure reproduced from ref. 111, © 2008 NPG.
rests on the assumption that structure and stability information is intermediates to cleave the substrate and regenerate the active site.
implicitly encoded locally within each fragment 34. The global stabil- The reaction proceeded through an imine intermediate involving
ity of a design is evaluated based on a scoring potential that com- a lysine as a Schiff base, similar to the oxaldie decarboxylation of
bines physics-based energy terms such as van der Waals packing oxaloacetate. In oxaldie, the lysine nucleophile was stabilized through
and knowledge-based energy terms derived from statistical analysis electrostatic interactions with other charged side chains or the helix
of amino acid interactions with the PDB66. Recently, ROSETTA was macrodipole. The same mechanism was used in the first of four reac-
used to develop artificial enzymes that catalysed a retro-aldol reac- tion motifs attempted by placing a second lysine in the vicinity of
tion (Fig. 6) and a Kemp elimination67,68. These designs were impres- the first. The other three used a hydrophobic pocket to lower the
sive in the extent to which the relationship between structure and pKa of the lysine. A general acid/base was included to trigger cleav-
reactivity was modelled and characterized. age of the carbon–carbon bond. Each motif used a different base:
In the retro-aldolase, the goal was to break a carbon–carbon bond I, a Lys/Asp dyad; II, tyrosine; III, a His/Asp; and IV, a water mol-
in a non-natural substrate, 4‑hydroxy‑4-(6-methoxy‑2-napthyl)‑2- ecule. Attempting designs based on several reaction motifs not only
butanone69. The intended reaction was significantly more complex increases the chance of a successful outcome, but also demonstrates
than previous designs, involving multiple transition states and how design can be used to test various hypotheses for catalysis.
Timing in multiple substrate reactions. An important facet of Allostery. Cooperative phenomena are fundamental in many bio-
the mechanism of many of nature’s more complicated catalysts is logical functions such as oxygen transport, metabolic and transcrip-
the temporal control of substrate binding and transfer events. One tional regulation97. The ability to incorporate allosteric regulation
merely has to look at the exquisite engineering apparent in the into artificial enzymes will enable the creation of medicinally useful
multiple electron and proton transfers in photosynthesis, respira- in vivo catalysts that can be regulated either by metabolites or using
tion and nitrogen fixation to see the advantages conferred by the exogenous small-molecule effectors. Such behaviour is in most
ability to control the timing and energetics of the intermediates in cases a more complicated version of the large-scale conformational
multiple-substrate reactions88. As these reaction mechanisms would switching described above, only in this case the large-scale protein
require the simultaneous optimization of several different catalytic motion is actuated by the binding of an effector molecule.
events at several different sites, it is unlikely that such would arise
from laboratory timescale evolution. A more likely situation is one Keeping energetic intermediates from the cellular environment.
where an initial enzymatic scaffold is designed explicitly and then Anyone who has observed a bioinorganic chemist at work in a dry
the kinetics and thermodynamics of the intermediates are further glovebox can appreciate the fragility of metalloenzyme active sites.
optimized using directed evolution. Such enzymes employ reactive intermediates that must be screened
from water, oxygen and reactive species in the cellular environment
Small-scale enzyme motions. The analysis of hydrogen tunnelling such as glutathione and superoxide. In fact, the failure of many ini-
in enzymes has established unequivocally that protein dynamics tial attempts at metalloenzyme design can be attributed to the desire
can play a large part in rates of enzymatic catalysis89. Computational to make these model proteins as small as possible98. Larger proteins
analysis of enzyme–substrate complexes using molecular mechanics have sufficiently sized hydrophobic cores that they can completely
has estimated the role of the dynamics of ‘near attack complexes’ in encapsulate these intermediates in a non-reactive environment,
catalytic function in the absence of atomic tunnelling to be as high screening them from solution and lengthening their lifetimes.
as 103 (ref. 90). However, these nanosecond timescale motions are
an intrinsic property of all biopolymers. It is not clear to what extent Catalytic foldamers
choreographed fast dynamics promotes catalysis in different natural Despite the intensive focus on protein enzymes, the first catalytic
enzymes. A recent report of the dynamic analysis of each interme- biomolecules may have been based on RNA rather than amino
diate state in the catalytic mechanism of dihydrofolate reductase91 acids99. Nucleotides can carry out metal-assisted reactions, act as
demonstrated that in each kinetic intermediate, the protein accessed general acid–bases and function to orient substrates and isolate
only the conformations present in the current state and the states them from solvent, much like the most sophisticated catalytic pro-
immediately before and after. These findings indicate that for at least teins100. Catalytic activity has also been found in certain bacterial
some enzymes such dynamics are considerably choreographed. carbohydrates101. Evidently, proteins do not have a monopoly on
Because the origins and even the consequences of these motions biological catalysis. Although the successful design of a new pro-
are not well understood, this contribution to catalysis will be dif- tein is a rewarding experience in itself, often real advances are in
ficult to reproduce. As the optimization brought about by directed our understanding of the basic molecular forces that guide struc-
evolution techniques is often a result of mutations distal to the active ture and function. Given that catalysis is not limited to proteins,
site that may impact protein motions, it seems that this method is it is important for us to ask whether insights gained from current
currently the best approach for increasing catalytic efficiency with de novo proteins are idiosyncratic to proteins, or whether we are