Documente Academic
Documente Profesional
Documente Cultură
DOI 10.1007/s00253-012-3920-8
MINI-REVIEW
Received: 8 December 2011 / Revised: 19 January 2012 / Accepted: 20 January 2012 / Published online: 7 February 2012
# Springer-Verlag 2012
Abstract Studies in the structural biology of the multicomponent protein complex, metabolic engineering, and synthetic
biology frequently rely on the efficient over-expression of
these subunits or enzymes in the same cell. As a first step,
constructing the multiple expression cassettes will be a complicated and time-consuming job if the classic and conventional digestion and ligation based cloning method is used.
Some more efficient methods have been developed, including
(1) the employment of a multiple compatible plasmid expression system, (2) the rare-cutter-based design of vectors, (3) in
vitro recombination (sequence and ligation independent cloning, the isothermally enzymatic assembly of DNA molecules
in a single reaction), and (4) in vivo recombination using
recombination-efficient yeast (in vivo assembly of overlapping fragments, reiterative recombination for the chromosome
integration of foreign expression cassettes). In this review, we
systematically introduce these available methods.
Keywords Synthetic biology . Simultaneous expression .
Pathway construction . Yeast recombination . Cre-loxP sitespecific recombination . Acembl system
T. Wang : G. Du : J. Chen
State Key Laboratory of Food Science and Technology,
Jiangnan University,
Wuxi 214122 Jiangsu, China
X. Ma (*) : A. Li
School of Biotechnology, and State Key Laboratory of Bioreactor
Engineering, East China University of Science and Technology,
Shanghai 214122, China
e-mail: maxy@ecust.edu.cn
H. Zhu
Center for Bioengineering and Biotechnology,
China University of Petroleum (East China),
Qingdao 266555, China
Introduction
The necessity of heterogeneously expressing more than one
gene spontaneously can be understood in the following two
aspects. First, structural and functional studies of many multiprotein complexes depend on the efficient over-expression of
these recombinant proteins in suitable hosts (Yokoyama 2003;
Cowieson et al. 2008; Bieniossek et al. 2009; Perrakis et al.
2011); second, investigations in metabolic engineering and
synthetic biologyan active research field that is attracting
increasing attention from biological scientistsfrequently
involves the expression of metabolic enzymes in the pathway
of interest (Kaznessis 2007; Purnick and Weiss 2009; Na et al.
2010; Nandagopal and Elowitz 2011; Vick et al. 2011). With
the help of restriction endonuclease (RE) EcoR I and ligase
from E. coli, Cohen and colleagues performed the first plasmid construction in 1973 (Cohen et al. 1973). Since then,
digestionligation-based genetic operations have been one of
the most straightforward and effective practices in recombinant plasmid construction. One can easily insert a gene of
interest into any available plasmid vector if a restriction
enzyme can cut the vector and cannot cut the heterogeneous
gene. The usual incompatibility of the ends generated by
different restriction enzymes makes it possible to insert a
foreign fragment with the desired orientation in the plasmid
by digesting the gene of interest and vector with two different
restriction enzymes. However, the increase in the length of the
inserted sequences will lead to a significant decrease in the
number of usable restriction enzyme sites. Therefore, a plasmid vector suitable for the expression of one gene will be less
effective (if not entirely useless) due to the availability of
restriction enzyme sites in studies of structural biology, metabolic engineering, and synthetic biology, in which the spontaneous expression of multiple genes in one host is frequently
involved.
1854
Therefore, in metabolic engineering and synthetic biology study, methods that can efficiently assemble multiple
expression cassettes for the construction of the desired pathway in model microorganisms are highly desired (Wingler
and Cornish 2011). This conclusion can be justified by the
following facts: First, the traditional classic digestion and
ligation method depending on the available restriction
enzyme sites cannot fulfill the task of constructing recombinant plasmids with multiple inserts (Busso et al. 2011).
Second, at least currently, the investigations in metabolic
engineering and synthetic biology can mainly follow a trial
and error process (Carrera et al. 2009; McArthur and Fong
2010; Lee et al. 2011). It is difficult to succeed in constructing a pathway, which is tested to be efficient by following
only one theoretically designed reasonable pathway
(Purnick and Weiss 2009). From this sense, the current
metabolic engineering and synthetic biology are much similar to the case of protein engineering: Rational design of
mutant protein is more promising, and it is the direction that
protein engineering will go; however, many enzymes with
improved properties are frequently obtained by directed
evolution in which an essentially identical trial and
error technical route is followed. Therefore, several
trials (optimization) might be necessary. Although a
reasonable design will make it possible to get the
expected construct with fewer trials and errors, materializing the design by rationally putting suitable genetic elements together is always an indispensible step and cannot be
circumvented in investigations in metabolic engineering and
synthetic biology nowadays (over-expression of metabolic
enzymes is frequently involved) and in the future (suitable
expression for robust circuit operation and balanced metabolic
redistribution are more popular). If there is no effective method for constructing the different designs of the pathways, it
will be impossible to obtain a satisfactory one (Wingler and
Cornish 2011).
In this review, we will summarize the available methods,
which can be used for assembling multiple expression cassettes for metabolic engineering and synthetic biology
research, with the intention of providing a comprehensive
reference for readers. These methods include the employment of a compatible multiple plasmid system, rarecutter-based digestion and ligation, in vitro recombination embodied by sequence and ligation independent
cloning, and in vivo recombination utilizing yeast or
the Cre-loxP system.
1855
(III). Mixing the two DNA samples (with different molar ratios) in
buffer favoring the annealing between complementary single-stranded
DNA promotes the formation of recombinant plasmids with nicks.
These are repaired by the DNA repairing system in E. coli cells,
resulting in covalently closed circular DNA plasmids. In the figure,
the four small solid arrows indicate where nicks have been formed in
the DNA molecule. The two parts inside the box (dotted line) depict
regions that stabilize the recombinant plasmid before being transformed into competent cells
In vitro recombination
Ligation independent cloning
1856
product of the target gene (6). Mixing the DNA samples (3) and (6) in
ligation buffer enables the formation of recombinant plasmids with
nicks (7), which are subsequently repaired by E. coli enzymes after
transfer into competent cells (Li and Elledge 2007). b Spontaneous
construction of recombinant plasmids containing multiple fragments
with SLIC. Multiple fragments to be inserted into the same plasmid
vectors are amplified as products with ends homologous to each other,
using specially designed primers. Treatment with T4 DNA polymerase
generates single-stranded overhangs compatible with intermolecular
annealing. Mixing these products with linearized plasmid vectors in a
suitable buffer encourages the formation of recombinant plasmid in
which the homologous ends combine to ensure that all the fragments
assemble in the correct order
selective agar plate due to the self-circularization of the plasmid vector, although linearization with one enzyme is also
workable (Fig. 2). In SLIC, if the plasmid vector is linearized
with restriction enzyme digestion, incomplete digestion and
the self-circularization of the linearized plasmid might be the
main factors that lead to false-positive resistant colonies on
selective plates.
If PCR amplification is used to generate a linearized
plasmid vector, the target gene can be inserted into the
vector at any desirable site. However, the expression of
foreign genes inserted in the region beyond the ranges
defined by the designed MCS will require the necessary
regulatory elements, such as promoter, transcription terminator, etc. Because the plasmid is used as the template in
PCR, a subsequent treatment with Dpn I that will specifically degrade the methylated DNA should be carried out to
eliminate the false-positive transformants resulting from the
template plasmid.
With the SLIC method, the cloning of foreign sequences
into any sites of a vector can be realized. Thus, this can be
used to express more genes in one plasmid, which is routinely
used for the expression of one gene. This is very important,
especially for synthetic biology study. As reported by Li and
Elledge (2007), the PCR product amplified with a modified
PCR thermocycling program in favor of the generation of an
incomplete PCR product by omitting the last extension step
for several minutes also provided the ready materials for
SLIC. Meanwhile, an acceptable target gene fragment with
single-stranded ends could also be obtained by mixing and
reannealing the PCR products of the target gene amplified
with two set of primers, which would generate a product with
a 5- or 3 additional sequence, respectively (Tillett and Neilan
1999) (Fig. 3).
SLIC can be used to insert multiple fragments of DNA
into a desirable plasmid vector in an expected order when
these fragments were prepared in such a way as to make
them capable of carrying sequence homology to each other.
One can use necessities from different suppliers to carry out
the SLIC. In addition, several commercial kits have been
developed by different companies, for example, In-fusion
from Clontech, CloneEZTM from GenScript, GeneArt seamless cloning and assembly from Invitrogen, etc. The necessities of SLIC were provided as a mixture in the form of dry
powder or solution in an optimized setting for high efficiency.
The recA enzyme can promote the annealing between DNA
samples of low concentration ready for SLIC; improved efficiency can also be obtained by increasing the concentration of
each DNA fragment.
Notably, simply by placing the encoding sequences one
after another, it is very easy seamlessly to clone them together
in a defined order with these commercialized SLIC kits
because the 5-terminal of one gene is usually different from
the 3-terminal of another gene. The homology part of these
1857
1858
In vivo recombination
Yeast-based in vivo recombination
Unlike in vitro recombination, which requires the artificially
generated single-stranded ends of the substrate DNA molecules, in vivo recombination needs only the homology parts
with a reasonable length among the DNA fragments to be
joined together, greatly simplifying the process. The expression of multiple proteins can be achieved by constructing
the desired DNA sequences through in vivo recombination
from smaller fragments. For its relative simplicity and the availability of various methods for genetic operations,
recombination-efficient microorganism yeast is the first choice
for in vivo recombination(Krivoruchko et al. 2011). As one of
the model organisms, it is regarded as the E. coli of eukaryotes in
researches. Many shuttle vectors can rapidly propagate in E. coli
and efficiently integrate into the genome. In 1987, Ma et al.
reported the method for plasmid construction by homologous
recombination in yeast (Ma et al. 1987), a useful method that
was later generalized by Raymond et al. (Raymond et al. 1999).
In 2001, Gunyuzlu et al. proposed the linker-assisted homologous recombination for plasmid construction in yeast (Gunyuzlu
et al. 2001). The super recombination capability of yeast makes
it possible to assemble multiple fragments of it into one plasmid
spontaneously, greatly simplifying the steps when one gene has
to be expressed in different hosts, and other features of the
1859
al. 2010). These good examples have sufficiently demonstrated the feasibility of using an in vivo recombination
of yeast to DNA fragment that can express multiple
proteins for synthetic biology. As a powerful assembling
machine, the limitations of the assembly methods in
yeast remain unknown (Gibson et al. 2008b). However,
the recombination taking place in vivo requires the cell
to take up enough DNA fragments, and the required
amount of DNA for successful assembly will increase
when a larger final assembled product is expected. This
1860
1861
plasmid (MCS1 and MCS2), facilitating digestionligationbased cloning. Expression of foreign genes inserted into these
sites was controlled by the promoter p10 or polh. The two
expression cassettes were flanked by two unique restriction
sites, Pme I and Avr II, used to release the combined expression cassettes together from the recombinant plasmid. The
restriction enzymes have compatible ends (Pme I, BstZ 17I,
and Nur I; Avr II and Spe I), enabling the released double
cassette to be further inserted into another recombinant plasmid that already contains another two genes, digested by
enzymes whose sites are located in the multiplication module
(a sequence with BstZ 17I, Spe I, Cla I, Nru I sites). After this,
a recombinant plasmid containing four genes in independent
expression cassettes (each gene has its own promoter and
terminator) has been constructed. In this new recombinant
plasmid, the sites for Pme I and Avr II are retained and can
be used for transferring this four-gene fragment into a further
recombinant plasmid in the same way. The authors demonstrated that multiple gene expression cassettes constructed
using this method could be inserted into the MultiBac bacmid
by Cre-loxP site-specific recombination (Fitzgerald et al.
2006). In combination with the use of Tn7 transposition, it is
possible to achieve expression of mutant protein complexes
with different subunits, or modification of an expressed protein can be achieved.
In 2009, Bieniossek et al. described the Acembl system:
automated unrestricted multigene recombineering to achieve
(HES, acceptor vectors: I-CeuI and donor vector: PI-SceI) and Bs tXI are
used as flanking regions from promoter to terminator. The ends generated
by the I-Ceu I, PI-Sce I, or Bst XI are compatible, enabling the insertion of
multiple expression cassettes released with the homing endonuclease
enzyme and Bst XI, into the vector linearized by the homing endonuclease enzyme (I-Ceu I, PI-Sce I). Joining of the donor and acceptor vectors
can be achieved by Cre-loxP meditated site-specific recombination. Mixing recombinant donor and acceptor vectors in the presence of the Cre
enzyme promotes the formation of a fused plasmid. Positive fusion
plasmids can be selected based on replication origin (the donor vector
can only be propagated in pir expressing host) and antibiotics resistance
(Bieniossek et al. 2009)
1862
Reference
Aslanidis C, de Jong PJ (1990) Ligation-independent cloning of PCR
products (LIC-PCR). Nucleic Acids Res 18(20):60696074
Aslanidis C, de Jong PJ, Schmitz G (1994) Minimal length
requirement of the single-stranded tails for ligation-independent
cloning (LIC) of PCR products. PCR Methods Appl 4(3):172
177
Berger I, Fitzgerald DJ, Richmond TJ (2004) Baculovirus expression
system for heterologous multiprotein complexes. Nat Biotechnol
22(12):15831587. doi:10.1038/nbt1036
Bieniossek C, Nie Y, Frey D, Olieric N, Schaffitzel C, Collinson I,
Romier C, Berger P, Richmond TJ, Steinmetz MO, Berger I
(2009) Automated unrestricted multigene recombineering for
multiprotein complex production. Nat Methods 6(6):447450.
doi:10.1038/nmeth.1326
Busso D, Peleg Y, Heidebrecht T, Romier C, Jacobovitch Y, Dantes A,
Salim L, Troesch E, Schuetz A, Heinemann U, Folkers GE,
Geerlof A, Wilmanns M, Polewacz A, Quedenau C, Bussow K,
Adamson R, Blagova E, Walton J, Cartwright JL, Bird LE, Owens
RJ, Berrow NS, Wilson KS, Sussman JL, Perrakis A, Celie PH
(2011) Expression of protein complexes using multiple Escherichia
coli protein co-expression systems: a benchmarking study. J Struct
Biol 175(2):159170. doi:10.1016/j.jsb.2011.03.004
Carrera J, Rodrigo G, Jaramillo A (2009) Towards the automated
engineering of a synthetic genome. Mol Biosyst 5(7):733743.
doi:10.1039/b904400k
Cohen SN, Chang AC, Boyer HW, Helling RB (1973) Construction of
biologically functional bacterial plasmids in vitro. Proc Natl Acad
Sci U S A 70(11):32403244
Cowieson NP, Kobe B, Martin JL (2008) United we stand: combining
structural methods. Curr Opin Struct Biol 18(5):617622.
doi:10.1016/j.sbi.2008.07.004
Dennis K, Srienc F, Bailey JE (1985) Ampicillin effects on five recombinant Escherichia coli strains: implications for selection pressure
design. Biotechnol Bioeng 27(10):14901494. doi:10.1002/
bit.260271014
Dymond JS, Richardson SM, Coombes CE, Babatz T, Muller H,
Annaluru N, Blake WJ, Schwerzmann JW, Dai J, Lindstrom
DL, Boeke AC, Gottschling DE, Chandrasegaran S, Bader JS,
Boeke JD (2011) Synthetic chromosome arms function in yeast
and generate phenotypic diversity by design. Nature 477
(7365):471476. doi:10.1038/nature10403
Fitzgerald DJ, Berger P, Schaffitzel C, Yamada K, Richmond TJ,
Berger I (2006) Protein complex expression by using multigene
baculoviral vectors. Nat Methods 3(12):10211032. doi:10.1038/
nmeth983
Gibson DG, Benders GA, Andrews-Pfannkoch C, Denisova EA,
Baden-Tillson H, Zaveri J, Stockwell TB, Brownley A, Thomas DW,
Algire MA, Merryman C, Young L, Noskov VN, Glass JI, Venter JC,
Hutchison CA, Smith HO (2008a) Complete chemical synthesis,
assembly, and cloning of a Mycoplasma genitalium genome. Science
319(5867):12151220. doi:10.1126/science.1151721
Gibson DG, Benders GA, Axelrod KC, Zaveri J, Algire MA, Moodie
M, Montague MG, Venter JC, Smith HO, Hutchison CA (2008b)
One-step assembly in yeast of 25 overlapping DNA fragments to
form a complete synthetic Mycoplasma genitalium genome. Proc
Natl Acad Sci U S A 105(51):2040420409. doi:10.1073/
pnas.0811011106
Gibson DG, Young L, Chuang RY, Venter JC, Hutchison CA 3rd,
Smith HO (2009) Enzymatic assembly of DNA molecules up to
several hundred kilobases. Nat Methods 6(5):343345.
doi:10.1038/nmeth.1318
Gibson DG, Glass JI, Lartigue C, Noskov VN, Chuang RY, Algire MA,
Benders GA, Montague MG, Ma L, Moodie MM, Merryman C,
1863
Purnick PE, Weiss R (2009) The second wave of synthetic biology:
from modules to systems. Nat Rev Mol Cell Biol 10(6):410422.
doi:10.1038/nrm2698
Ramon A, Smith HO (2011) Single-step linker-based combinatorial
assembly of promoter and gene cassettes for pathway engineering.
Biotechnol Lett 33(3):549555. doi:10.1007/s10529-010-0455-x
Raymond CK, Pownder TA,Sexson SL (1999) General method for plasmid construction using homologous recombination. Biotechniques
26(1):134138, 140131.
Scheich C, Kummel D, Soumailakakis D, Heinemann U, Bussow K
(2007) Vectors for co-expression of an unrestricted number of
proteins. Nucleic Acids Res 35(6):e43. doi:10.1093/nar/gkm067
Storici F, Durham CL, Gordenin DA, Resnick MA (2003) Chromosomal
site-specific double-strand breaks are efficiently targeted for repair
by oligonucleotides in yeast. Proc Natl Acad Sci U S A 100
(25):1499414999. doi:10.1073/pnas.2036296100
Tillett D, Neilan BA (1999) Enzyme-free cloning: a rapid method to
clone PCR products independent of vector restriction enzyme
sites. Nucleic Acids Res 27(19):e26
Vick JE, Johnson ET, Choudhary S, Bloch SE, Lopez-Gallego F,
Srivastava P, Tikh IB, Wawrzyn GT, Schmidt-Dannert C (2011)
Optimized compatible set of BioBrick vectors for metabolic pathway engineering. Appl Microbiol Biotechnol 92(6):12751286.
doi:10.1007/s00253-011-3633-4
Wakamori M, Umehara T, Yokoyama S (2010) A series of bacterial coexpression vectors with rare-cutter recognition sequences. Protein
Expr Purif 74(1):8898. doi:10.1016/j.pep.2010.06.016
Wang TW, Zhu H, Ma XY, Zhang T, Ma YS, Wei DZ (2006) Mutant
library construction in directed molecular evolution: casting a
wider net. Mol Biotechnol 34(1):5568. doi:10.1385/MB:34:1:55
Wang HH, Isaacs FJ, Carr PA, Sun ZZ, Xu G, Forest CR, Church GM
(2009) Programming cells by multiplex genome engineering and
accelerated evolution. Nature 460(7257):894898. doi:10.1038/
nature08187
Wingler LM, Cornish VW (2011) Reiterative recombination for the in
vivo assembly of libraries of multigene pathways. Proc Natl Acad
Sci U S A 108(37):1513515140. doi:10.1073/pnas.1100507108
Wladyka B, Puzia K, Dubin A (2005) Efficient co-expression of a
recombinant staphopain A and its inhibitor staphostatin A in Escherichia coli. Biochem J 385(Pt 1):181187. doi:10.1042/BJ20040958
Yang W, Zhang L, Lu Z, Tao W, Zhai Z (2001) A new method for protein
coexpression in Escherichia coli using two incompatible plasmids.
Protein Expr Purif 22(3):472478. doi:10.1006/prep.2001.1453
Yokoyama S (2003) Protein expression systems for structural
genomics and proteomics. Curr Opin Chem Biol 7(1):3943.
doi:S1367593102000194