Documente Academic
Documente Profesional
Documente Cultură
Louise V. Wain,1 Elizabeth Bailes,1 Frederic Bibollet-Ruche,2 Julie M. Decker,2 Brandon F. Keele,2
Fran Van Heuverswyn,3 Yingying Li,2 Jun Takehisa,2 Eitel Mpoudi Ngole,4 George M. Shaw,2
Martine Peeters,3 Beatrice H. Hahn,2 and Paul M. Sharp1
1
Institute of Genetics, University of Nottingham, Queens Medical Centre, Nottingham, United Kingdom; 2Departments of Medicine
and Microbiology, University of Alabama at Birmingham; 3Laboratoire Retrovirus, Institut de Recherche pour le Développement and
Department of International Health, University of Montpellier, 34394 Montpellier cedex 9, France; and 4Projet Prevention du Sida au
Cameroun (PRESICA), Yaounde, Cameroun
Human immunodeficiency virus type 1 (HIV-1) originated from three independent cross-species transmissions of simian
immunodeficiency virus (SIVcpzPtt) infecting chimpanzees (Pan troglodytes troglodytes) in west central Africa, giving
rise to pandemic (group M) and non-pandemic (groups N and O) clades of HIV-1. To identify host-specific adaptations
in HIV-1 we compared the inferred ancestral sequences of HIV-1 groups M, N and O to 12 full length genome sequences
of SIVcpzPtt and four of the outlying but closely related SIVcpzPts (from P. t. schweinfurthii). This analysis revealed
a single site that was completely conserved among SIVcpzPtt strains but different (due to the same change) in all three
groups of HIV-1. This site, Gag-30, lies within p17, the gag-encoded matrix protein. It is Met in SIVcpzPtt, underwent
a conservative replacement by Leu in one lineage of SIVcpzPts but changed radically to Arg on all three lineages leading
to HIV-1. During subsequent diversification this site has been conserved as a basic residue (Arg or Lys) in most lineages
of HIV-1. Retrospective analysis revealed that Gag-30 had reverted to Met in a previous experiment in which HIV-1 was
passaged through chimpanzees. To examine whether this substitution conferred a species specific growth advantage, we
used site-directed mutagenesis to generate variants of these chimpanzee-adapted HIV-1 strains with Lys at Gag-30, and
tested their replication in both human and chimpanzee CD4þ T lymphocytes. Remarkably, viruses encoding
Met replicated to higher titers than viruses encoding Lys in chimpanzee T cells, but the opposite was found in human
T cells. Taken together, these observations provide compelling evidence for host-specific adaptation during the
emergence of HIV-1 and identify the viral matrix protein as a modulator of viral fitness following transmission to the new
human host.
SIVcpzMB897 JTT matrix and gamma distributed rates at sites. The HIV-1
SIVcpzMB66 group N ancestral sequence was obtained by a similar anal-
SIVcpzLB7 ysis of five genome sequences, with SIVcpzPtt/EK505 used
to root the phylogeny.
HIV-1 M
We searched for species-specific signatures in align-
HIV-1 M ments of Gag, Pol, Vif, Vpr, Vpu, Tat, Rev and Nef pro-
SIVcpzEK505 tein sequences of 19 viral strains (twelve SIVcpzPtt, four
HIV-1 N SIVcpzPts, and the three HIV-1 group ancestral sequences).
HIV-1 N We were looking for sites that are highly conserved among
SIVcpzMT145 SIVcpzPtt strains, but since any of the SIVcpz sequences
may have been derived from a defective viral genome,
SIVcpzCAM5
we allowed for some deviation from perfect conservation
SIVcpzCAM3
among all sequences; this same concern does not apply
SIVcpzDP943 to HIV-1 group ancestor sequences. Thus, we searched
SIVcpzUS for sites where the amino acid was conserved among at least
SIVcpzCAM13 nine of the twelve SIVcpzPtt sequences, and identical
SIVcpzGAB1 among all three HIV-1 group ancestors, but differed be-
SIVcpzGAB2 tween HIV-1 and the consensus sequence of SIVcpzPtt.
HIV-1 O
The degree of biochemical dissimilarity among amino
acids was assessed by reference to Grantham’s chemical
HIV-1 O
distances. These distances combine information on chem-
SIVgor ical composition, polarity and molecular volume, and are
SIVcpzTAN1 highly correlated with amino acid replacement frequencies
SIVcpzTAN2 (Grantham 1974). The values for the 190 pair-wise compar-
SIVcpzTAN3 isons among the 20 amino acids range from 5 to 215, with
SIVcpzANT a mean of 100.
FIG. 1.—The three independent origins of HIV-1. Red arrows
indicate the branches on which ape-to-human transmissions can be
inferred to have occurred, and black circles denote the ancestors of HIV-1 Cell Lines, Viruses and Virus Stocks
groups M, N and O. Colours indicate SIVcpz strains from Pan
troglodytes troglodytes (red) and P. t. schweinfurthii (blue). The tree 293T and JC53BL13 cells were maintained in Dulbec-
represents a strict consensus of phylogenies derived from four major co’s modified Eagle’s medium (DMEM) þ 10% heat inac-
regions of the proteome (Keele et al. 2006). The evolutionary relation-
ships among SIVcpzPtt strains, and among SIVcpzPts strains, vary tivated fetal bovine serum (FBS). Replication competent
depending on the region of the genome used for analysis, presumably molecular clones of chimpanzee-passaged HIV-1 (JC16
reflecting recombination events in the past. However, the distinction and NC7) were obtained from Dr. Novembre (Mwaengo
between the SIVcpzPtt and SIVcpzPts lineages, and the positions of HIV- and Novembre 1998). Viral stocks of JC16 and NC7 were
1 groups M (close to SIVcpzPtt strains LB7, MB66 and MB897), N
(close to SIVcpzPtt strain EK505) and O (as an outgroup to the known
generated by transfecting full-length proviral plasmids into
SIVcpzPtt strains) are constant. The dashed line indicates the position of 293T cells using FuGene 6 (Roche Applied Science,
SIVgor based on partial sequences (Van Heuverswyn et al. 2006); note Indianapolis, IN). Seventy-two hours post-transfection,
that HIV-1 group O may have been transmitted from chimpanzees to virus-containing supernatants were clarified by low speed
humans via gorillas. centrifugation and frozen in aliquots at -70C. The infectivity
of virus stocks was determined using the JC53BL13 assay as
described previously (Derdeyn et al. 2000; Wei et al. 2002).
Phylogenetic relationships were inferred by the Bayesian To change the amino acid at Gag-30 in JC16 and NC7
method (Yang and Rannala 1997) implemented in MrBayes from a Met to a Lys, site-directed mutagenesis was per-
(Huelsenbeck and Ronquist 2001), using the JTT model of formed using the QuickChange II XL kit (Stratagene, La
amino acid substitution (Jones et al. 1992) and a gamma Jolla, CA) and the primers JCNC-M30K-F (5’-GGAAA
distribution of among-site rate variation with four categories GAAAAAATATAAGTTAAAACATATAGTATGG-3’)
(Yang 1994); analyses were run for 1 million generations. and JCNC-M30K-R (5’-CCATACTATATGTTTTAACT
The HIV-1 group M ancestral sequence was obtained TATATTTTTTCTTTCC-3’).
from the HIV Sequence Database (www.hiv.lanl.gov). The Plasmids were grown in STBL2 bacteria (Invitrogen,
group O ancestral sequence was inferred from 13 full length Carlsbad, CA) as described by Takehisa et al. (2007).
genome sequences; sequences previously shown to be re-
combinant (Yamaguchi et al. 2003) were excluded from
this analysis. (Accession numbers of all sequences used
Isolation and Activation of Chimpanzee and Human
are provided in the Supplement.) A phylogeny estimated PBMCs
from concatenated Gag, Pol and Env sequences, rooted
by reference to SIVcpzPtt sequences, was used to find Blood was obtained from healthy HIV-1-negative hu-
the maximum likelihood estimate of the ancestral group man volunteers (Research Blood Components, Boston,
O sequence using Codeml from the PAML package (Yang MA), as well as HIV/SIV-uninfected chimpanzees housed
1997), under the assumption of a molecular clock using the at the Yerkes Primate Research Center in accordance with
Adaptation of HIV-1 to its human host 1855
SIVcpzMB897
100
SIVcpzMB66
80
88 SIVcpzLB7
100 100 HIV-1 M
HIV-1 M
SIVcpzEK505
100
100 HIV-1 N
99
HIV-1 N
SIVcpzMT145
100
SIVcpzCAM5
86 SIVcpzCAM3
99
100 SIVcpzDP943
100
100 SIVcpzUS
100 SIVcpzCAM13
SIVcpzGAB1
96 SIVcpzGAB2
100 HIV-1 O
HIV-1 O
SIVcpzTAN1
100 SIVcpzTAN2
FIG. 2.—Species-specific adaptive changes in the matrix protein SIVcpzTAN3
(Gag p17). Sequences from a region in the N-terminal basic domain of SIVcpzANT
0.05
HIV-1/SIVcpz matrix proteins reveal a site (boxed) that differs between
the HIV-1 group ancestors (R, Arg) and chimpanzee viruses (M, Met/L, FIG. 3.—Phylogeny of the SIVcpz/HIV-1 clade derived from anal-
Leu). Dashes indicate amino acid identity to the sequence at the top. ysis of Gag protein sequences. SIVcpzPtt and SIVcpzPts strains are in red
Clones JC16 and NC7 were isolated from two chimpanzees experimen- and blue, respectively. The representative HIV-1 sequences used were
tally infected with HIV-1 (Mwaengo and Novembre 1998). U455 and LAI (group M), YBF30 and YBF106 (group N) and ANT70
and MVP1580 (group O). A posteriori probabilities (expressed as percent)
are shown above branches. The scale bar represents 0.05 amino acid
To assess how unusual the observation at Gag-30 is, replacements per site.
we searched for any sites in the Gag alignment where,
within the SIVcpzPtt/HIV-1 clade, any three sequences
shared one residue while all 12 other sequences shared an- ical distances than the change at Gag-30 (table 2). In light of
other residue; in addition, we required that the residue these data, the finding of a non-conservative Met-to-Arg
found in the three sequences should not be present in change on all three (and only those three) branches leading
any of the SIVcpzPts sequences. After excluding sites with to the human viruses is quite remarkable and highly un-
a gap in any sequence, the alignment contained 450 sites. likely to represent a chance occurrence.
This analysis identified six sites, including Gag-30, the site
already found (table 2). However, consideration of the phy-
logenetic relationships among the gag genes of these viral Evolution of HIV-1 in Experimentally
strains (fig. 3) indicates that four of these sites require fewer Infected Chimpanzees
than three independent substitutions. SIVcpzPtt strains To investigate further whether the observed replace-
GAB1 and CAM13 form a monophyletic pair, and replace- ment at Gag-30 was indicative of host-specific adaptation,
ments at Gag-19 and Gag-380 likely occurred in their com-
mon ancestor. Similarly, a replacement at Gag-52 likely
occurred in the common ancestor of SIVcpzPtt strain Table 2
EK505 and HIV-1 group N. Thus, these three sites require Sites in Gag Differing between Three HIV-1/SIVcpzPtt
only two independent substitutions. Only one replacement Sequences and All Others
is required at site Gag-231, since SIVcpzPtt strains CAM3, Sitea Set 1b AA Set 2b D1c SIVcpzPts D2c
CAM5 and DP943 form a monophyletic clade (fig. 3). The
remaining site, Gag-28, would require three independent Gag-19 GAB1, CAM13, LB7 Val Ile 29 Ile -
Gag-28 GAB1, GAB2, LB7 Arg Lys 26 Lys -
replacements, although it is equally parsimonious to invoke Gag-30 M, N, O Arg Met 91 Met/Leu 15
three independent Lys-to-Arg changes on terminal branches Gag-52 EK505, MB897, N Asp Glu 45 Glu -
of the tree, or two Lys-to-Arg changes and a reversion of Gag-231 CAM3, CAM5, DP943 Ala Pro 27 Pro -
Arg-to-Lys on the terminal branch to CAM13. Thus only Gag-380 GAB1, CAM13, US Lys Arg 26 Arg/Gly 125
one other site, in addition to Gag-30, was found that a
sites numbered according to the reference strain HIV-1/HXB2.
may have undergone the same replacement on three termi- b
Set 1 are the three sequences from the SIVcpzPtt/HIV-1 clade sharing the
nal branches of the tree. These replacements involving ba- unusual residue; M, N and O refer to HIV-1 groups, while the other sequences are
sic residues at Gag-28 are much more conservative (D 5 all SIVcpzPtt strains. Set 2 shows the amino acid found in the remaining 12
sequences from the SIVcpzPtt/HIV-1 clade.
26) than the Met-to-Arg change at Gag-30, and likely reflect c
D values are Grantham’s amino acid distances. D1 is the distance between
recurrent neutral switches. The other sites identified in this the residues found in Set1 and Set 2. D2 is the distance between the two residues
analysis also involve amino acids with much smaller chem- found in SIVcpzPts.
Adaptation of HIV-1 to its human host 1857
FIG. 4.—Replication of variants of JC16 (upper) and NC7 (lower) strains in chimpanzee (left) and human (right) CD4þ T lymphocytes. For each
virus, strains encoding Met (blue) or Lys (red) at Gag-30 are compared. Virus replication was monitored by measuring the level of the p24 antigen in
culture supernatants (note that the scale varies between left and right panels). Replication curves are shown as the average of four independent
experiments in CD4þ T lymphocytes from different chimpanzee and human donors; the error bars represent one standard deviation calculated for each
time point.
we examined the sequences of two molecular clones (JC16 in both clones, indicating selection pressure for this residue
and NC7) that were recovered from chimpanzees experi- in the chimpanzee host (fig. 2).
mentally inoculated with HIV-1 (Mwaengo and Novembre The six other sites initially identified as differing
1998). The rationale for these chimpanzee infections was to between HIV-1 and the majority of SIVcpzPtt strains all
generate pathogenic variants for vaccine and pathogenesis retained the HIV-1 residue after passage in chimpanzees
studies (Novembre et al. 1997; Wei and Fultz 1998). Al- (table 1). At one site, Gag-224, the residue in the
though chimpanzees were subsequently found not to repre- chimpanzee-adapted virus (Ala) differed from that in the
sent a viable animal model, the viral isolates and molecular three HIV-1 group ancestor sequences (Pro). However, this
clones derived from these experiments are of value because site is Ala in most HIV-1 subtype B sequences, including
they represent chimpanzee-adapted strains of HIV-1. Clone the viruses used in the chimpanzee passage experiment.
JC16 was recovered from a chimpanzee 10 years after his
initial infection; clone NC7 was recovered from a second
chimpanzee one month after receiving blood from the first Replication of JC16 and NC7 in Chimpanzee and
chimpanzee (Mwaengo and Novembre 1998). The two
Human T Cells
clones differ at 1.4% of nucleotides; given the rate of
HIV-1 evolution (Li et al. 1988), this indicates that they To assess whether replacements at Gag-30 confer
may have diverged about 1-2 years before they were iso- a host species specific growth advantage, we applied site-
lated. These nucleotide substitutions cause 51 amino acid directed mutagenesis to the chimpanzee adapted JC16
differences across the viral proteome. To determine whether and NC7 clones to change the Met back to Lys, and as-
prolonged propagation in chimpanzees had exerted selec- sessed their replication potential in CD4þ T lymphocytes
tion pressure on the Gag-30 residue, we compared the se- from multiple chimpanzee and human donors. These ex-
quences of JC16 and NC7 to their parental strains, which periments revealed a clear and reproducible correlation be-
represented a mixture of group M subtype B strains (SF2 tween the identity of the amino acid at Gag-30 and the
and LAV) and thus contained a conservative replacement ability of viruses to replicate in cells from the two different
of Arg with Lys at Gag-30 (Wei and Fultz 1998). Remark- species (fig. 4). In chimpanzee cells, JC16 and NC7 with
ably, inspection of JC16 and NC7 revealed Met at Gag-30 Met at Gag-30 (blue) grew, on average, to much higher
1858 Wain et al.
titers than the same viruses with Gag-30 mutated to Lys 30 had reverted to Met, providing independent evidence
(red). However, there was considerable variation among of host-species specific selection on this site. Thus, while
replicates: in tests of the day 9 titers, NC7 with Met was the Met-to-Arg change on the lineages leading to HIV-1
significantly higher than NC7 with Lys (P , 0.05), but could have occurred prior to cross-species transmission,
for JC16 the comparison only bordered on significance possibly predisposing particular chimpanzee viruses to
(P 5 0.054). In human cells, these chimpanzee adapted vi- greater fitness in humans, the reversion to Met in chimpan-
ruses exhibited low replicative fitness (compare the scales zees infected with HIV-1 argues that the adaptive changes
in the left and right panels of fig. 4), but both JC16 and NC7 occurred after the cross-species transmission events.
viruses with Lys at Gag-30 replicated considerably more Previously it has been shown that mutation of HIV-1
efficiently than those with Met (P , 0.005, in both cases). subtype B Gag-30 to a non-basic residue (Thr), in combi-
nation with a similar mutation at Gag-32, greatly reduced
viral replication (Freed et al. 1995). Here we have found
Evolution of Gag-30 in HIV-1 that when the Met at Gag-30 in two chimpanzee-adapted
HIV-1 strains was replaced by Lys, the growth rate of these
Arg at Gag-30 has remained highly conserved during
viruses in chimpanzee CD4þ T cells was reduced by half.
the diversification of HIV-1 groups M, N and O, typically
In contrast, in human cells these same strains with Lys grew
only being conservatively replaced by another basic amino
twice as efficiently as strains with Met. These reciprocal
acid, Lys (D 5 26). Approximately 80% of available group
differences in replication capacity confirm that the nature
O sequences have Arg, while in the other 20% it is Lys.
of the amino acid at Gag-30 can have a major effect on viral
Only six group N virus Gag sequences have thus far been
fitness in cells from different host species. However, the
characterized: five have Arg, and one has Lys. Among the
greatly reduced growth rate of the chimpanzee-adapted vi-
group M subtype consensus sequences compiled at the HIV
ruses in human T cells, even after the replacement of Met
Sequence Database (www.hiv.lanl.gov), Arg is present in
with Lys at Gag-30, indicates that additional sites have
subtypes A1, A2, D, F1, G, H and K, while the subtype
evolved away from being human-adapted.
B consensus sequence has Lys. Interestingly, the only
In this context, it is intriguing that in subtype C, a major
HIV-1 group M subtype that does not contain a basic res-
sublineage within HIV-1 group M, Gag-30 has reverted to
idue at Gag-30 is subtype C. Met is found in both the con-
Met. Subtype C is very widespread: it is the predominant
sensus sequence and the inferred ancestral sequence of
form of HIV-1 in southern Africa and also responsible for
subtype C. Since subtype C is not the basal lineage in
AIDS epidemics in China and India (Hemelaar et al. 2006).
the group M radiation (Korber et al. 2000; Leitner et al.
It is thus clear that HIV-1 strains with Met at Gag-30 can be
2005), this indicates that reversion of Arg to Met occurred
epidemiologically successful. However, subtype C isolates
subsequent to the divergence of the subtype C ancestor
have been shown to replicate less efficiently than other sub-
from the other group M lineages.
types in head-to-head ex vivo competition experiments in
primary CD4þ T cells and macrophages (Ball et al.
Discussion 2003; Arien et al. 2005). This finding has been interpreted
to indicate that disease progression may be slower in sub-
Comparisons of the inferred sequences of the ances- type C infections, providing a longer period when infected
tors of the three independent groups of HIV-1 with those individuals may transmit virus to others and thus explaining
of SIVcpz strains revealed a single site in the HIV-1 pro- why subtype C is so common globally (Arien et al. 2007).
teome which appears to have undergone identical changes Whether there is indeed a causal link between the observed
on each of the three occasions when virus was transmitted in vitro phenotype of subtype C and its ability to spread in
from apes to humans. All available sequences of SIVcpzPtt, the human population requires further studies, especially in
the clade from within which HIV-1 originated, have Met at light of data indicating that in vivo viral loads are higher
Gag-30, while the ancestors of all three HIV-1 groups had in subtype C infections (Dyer et al. 1998). However, it is
Arg. Gag-30 was the only site, among 450 in the Gag align- tempting to speculate that the reduced replication fitness
ment, to show the same non-conservative amino acid re- of subtype C viruses in primary lymphocyte cultures may
placement on three independent terminal branches within be due, at least in part, to the Met residue at Gag-30. Since
the SIVcpzPtt/HIV-1 clade, and so it is remarkable that the host-specificity at this site is otherwise so consistent, it
these changes were found specifically on the branches lead- is also possible that the ancestral subtype C lineage under-
ing to the three HIV-1 groups. The finding of this non- went changes at other sites which removed the need for, and
conservative amino acid replacement on the branches of perhaps even selected against, a basic residue at Gag-30
the tree immediately prior to each of the three clades of hu- during replication in human cells. A number of subtype
man viruses, at a site which is highly conserved among C viruses have Arg at Gag-30, but we could not find any
chimpanzee viruses, strongly suggests host-species specific meaningful correlation between the presence or absence
adaptation. During the subsequent spread and evolution of of Met at Gag-30 and any polymorphisms at other sites. Site
HIV-1 in humans, Gag-30 has been conserved as Arg, or directed mutagenesis of prototypic subtype C strains, as re-
conservatively replaced by another basic amino acid ported here for chimpanzee adapted subtype B strains, may
(Lys), in all lineages except group M subtype C. shed further light on these possibilities.
Retrospective analysis of the result of passaging HIV- The residue present at Gag-30 varies among SIVs in-
1 in chimpanzees for more than 10 years (Mwaengo and fecting different primate species. Interestingly, it is also Met
Novembre 1998) revealed that the basic residue at Gag- in SIVsmm from sooty mangabeys, the progenitor of HIV-2.
Adaptation of HIV-1 to its human host 1859
Eight different groups of HIV-2 have been described, which CA 13148), the Yerkes Regional Primate Research Center
each seem to reflect an independent cross-species transmis- (RR-00165), the Bristol Myers Freedom to Discover Pro-
sion event (Hahn et al. 2000; Damond et al. 2004; Santiago gram, and the Institut de Recherche pour le Développement
et al. 2005), although only two of these groups (A and B) (IRD).
are known to have spread in the human population. Re-
markably, this residue changed to Arg in the inferred ances-
tor of the most widespread form of HIV-2, group A.
However, it has been conserved as Met in HIV-2 group Literature Cited
B, and was not found to have changed in any of the sporadic Arien KK, Abraha A, Quinones-Mateu ME, Kestens L,
forms of HIV-2. These observations provide additional ev- Vanham G, Arts EJ. 2005. The replicative fitness of primary
idence that a basic residue at Gag-30 enhances the replica- human immunodeficiency virus type 1 (HIV-1) group M,
tion potential and possibly the secondary spread of primary HIV-1 group O, and HIV-2 isolates. J Virol. 79:8979–8990.
lentiviruses after transmission to humans. Arien KK, Vanham G, Arts EJ. 2007. Is HIV-1 evolving to a less
Gag-30 lies within the basic N-terminal domain of the virulent form in humans? Nature Rev Microbiol. 5:141–151.
gag-encoded matrix protein (p17). This domain is critical Ball SC, Abraha A, Collins KR, Marozsan AJ, Baird H,
Quinones-Mateu ME, Penn-Nicholson A, Murray M,
for targeting the Gag precursor to the plasma membrane
Richard N, Lobritz M, Zimmerman PA, Kawamura T,
during virus assembly (Hill et al. 1996). Several cellular Blauvelt A, Arts EJ. 2003. Comparing the ex vivo fitness of
factors are known (or believed) to bind this same region. CCR5-tropic human immunodeficiency virus type 1 isolates
These include PI(4,5)P2, a plasma membrane component of subtype B and C. J Virol. 77:1021–1038.
(Ono et al. 2004), TIP47, a recently discovered Gag/Env Bibollet-Ruche F, Bailes E, Gao F, Pourrut X, Barlow K,
connector protein complex (Lopez-Verges et al. 2006), Clewley JP, Mwenda JM, Langat MK, Chege GK,
and AP-3, an adaptor protein complex that is involved in McClure HM, Mpoudi-Ngole E, Delaporte E, Peeters M,
cargo protein sorting to specific membrane components Shaw GM, Sharp PM, Hahn BH. 2004a. New simian
(Dong et al. 2005). NMR studies have shown that Gag- immunodeficiency virus infecting De Brazza’s monkeys
30 is not a part of the hydrophobic cleft that binds (Cercopithecus neglectus): evidence for a Cercopithecus
monkey clade. J Virol. 78:7748–7762.
PI(4,5)P2 (Saad et al. 2006). The TIP47 binding site on
Bibollet-Ruche F, Gao F, Bailes E, Saragosti S, Delaporte E,
the HIV-1 matrix protein has not yet been mapped; how- Peeters M, Shaw GM, Hahn BH, Sharp PM. 2004b. Complete
ever, mutagenesis studies suggest that two highly con- genome analysis of one of the earliest SIVcpzPtt strains from
served matrix regions (S6-S9 and D14-E17) are likely Gabon (SIVcpzGAB2). AIDS Res Human Retrovir.
involved (Lopez-Verges et al. 2006). Finally, the d subunit 20:1377–1381.
of the AP-3 complex is believed to interact with the amino- Damond F, Worobey M, Campa P, Farfara I, Colin G,
terminal a-helical segment of the matrix protein (Dong et al. Matheron S, Brun-Vezinet F, Robertson DL, Simon F.
2005). Thus, there are a number of candidate human pro- 2004. Identification of a highly divergent HIV type 2 and
teins that could bind the SIVcpz matrix proteins with re- proposal for a change in HIV type 2 classification. AIDS Res
duced affinity due to species specific differences in their Human Retrovir. 20:666–672.
Derdeyn CA, Decker JM, Sfakianos JN, Wu X, O’Brien WA,
protein sequence, and the same could be true for chimpanzee
Ratner L, Kappes JC, Shaw GM, Hunter E. 2000. Sensitivity
proteins and their interaction with the HIV-1 matrix protein. of human immunodeficiency virus type 1 to the fusion
In conclusion, although the specific function of Gag- inhibitor T-20 is modulated by coreceptor specificity defined
30 remains to be determined, it is clear that adaptive by the V3 loop of gp120. J Virol. 74:8358–67.
changes at this residue in the SIVcpz matrix protein were Dong X, Li H, Derdowski A, Ding L, Burnett A, Chen X,
required for efficient replication in the human host. These Peters TR, Dermody TS, Woodruff E, Wang J-J, Spearman P.
findings represent the first demonstration of host species- 2005. AP-3 directs the intracellular trafficking of HIV-1 Gag
specific selection pressure in primate lentiviruses and iden- and plays a key role in particle assembly. Cell. 120:663–674.
tify the Gag matrix protein as a modulator of virus fitness Dyer JR, Kazembe P, Vernazza PL, Gilliam BL, Maida M,
following cross-species transmission to a new primate host. Zimba D, Hoffman IF, Royce RA, Schock JL, Fiscus SA,
Cohen MS, Erin JJ. 1998. High levels of human immunode-
ficiency virus type 1 in blood and semen of seropositive men
in sub-Saharan Africa. J. Infect Dis. 177:1742–1746.
Supplementary Material Freed EO, Englund G, Martin MA. 1995. Role of the basic
domain of human immunodeficiency virus type 1 matrix in
A list of the GenBank accession numbers of all se- macrophage infection. J Virol. 69:3949–3954.
quences analyzed is available at Molecular Biology and Gao F, Bailes E, Robertson DL, Chen Y, Rodenburg CM,
Evolution online (http://www.mbe.oxfordjournals.org/). Michael SF, Cummins LB, Arthur LO, Peeters M, Shaw GM,
Sharp PM, Hahn BH. 1999. Origin of HIV-1 in the chim-
panzee Pan troglodytes troglodytes. Nature. 397:436–441.
Grantham R. 1974. Amino acid difference formula to help
Acknowledgments explain protein evolution. Science. 185:862–864.
Hahn BH, Shaw GM, De Cock KM, Sharp PM. 2000. AIDS as
We thank Wes Sundquist, Eric Freed, Mike Emerman, a zoonosis: scientific and public health implications. Science.
Heinrich Gottlinger and Mike Summers for unpublished in- 287:607–614.
formation, and Mike Worobey and John Brookfield for dis- Hemelaar J, Gouws E, Ghys PD, Osmanov S. 2006. Global and
cussion. This work was supported by the National Institutes regional distribution of HIV-1 genetic subtypes and recombi-
of Health (R01 AI50529, R01 AI58715, P30 AI 27767, P30 nants in 2004. AIDS. 20:W13–W23.
1860 Wain et al.
Hill CP, Worthylake D, Bancroft DP, Christensen AM, plasma membrane for virus assembly. Proc Natl Acad Sci
Sundquist WI. 1996. Crystal structures of the trimeric human USA. 103:11364–11369.
immunodeficiency virus type 1 matrix protein: implications Santiago M, Range F, Keele BF, Li Y, Bailes E, Bibollet-
for membrane association and assembly. Proc Natl Acad Sci Ruche F, Fruteau C, Noe R, Peeters M, Brookfield JFY,
USA. 93:3099–3104. Shaw GM, Sharp PM, Hahn BH. 2005. Simian immunode-
Huelsenbeck JP, Ronquist F. 2001. MRBAYES: Bayesian ficiency virus infection in free-ranging sooty mangabeys
inference of phylogenetic trees. Bioinformatics. 17:754–755. (Cercocebus atys atys from the Tai Forest, Cote d’Ivoire:
Jones DT, Taylor WR, Thornton JM. 1992. The rapid generation implications for the origin of epidemic human immunodefi-
of mutation data matrices from protein sequences. Comp Appl ciency virus type 2. J Virol. 79:12515–12527.
Biosci. 8:275–282. Sharp PM, Bailes E, Chaudhuri RR, Rodenburg CM,
Keele BF, Van Heuverswyn F, Li Y, Bailes E, Takehisa J, Santiago MO, Hahn BH. 2001. The origins of AIDS viruses:
Santiago ML, Bibollet-Ruche F, Chen Y, Wain LV, where and when? Phil Trans Roy Soc London B.
Liegeois F, Loul S, Mpoudi Ngole E, Bienvenue Y, 356:867–876.
Delaporte E, Brookfield JFY, Sharp PM, Shaw GM, Sharp PM, Shaw GM, Hahn BH. 2005. Simian immunodefi-
Peeters M, Hahn BH. 2006. Chimpanzee reservoirs ciency virus infection of chimpanzees. J Virol. 79:3891–
of pandemic and non-pandemic HIV-1. Science. 313: 3902.
523–526. Takehisa J, Kraus MH, Decker JM, Li Y, Keele BF, Bibollet-
Korber B, Muldoon M, Theiler J, Gao F, Gupta R, Lapedes A, Ruche F, Zammit KP, Wng Z, Santiago ML, Kamenya S,
Hahn BH, Wolinsky S, Bhattacharya T. 2000. Timing the Wilson ML, Pusey AE, Bailes E, Sharp PM, Shaw GM,
ancestor of the HIV-1 pandemic. Science. 288:1789–1796. Hahn BH. 2007. Generation of infectious molecular clones of
Leitner T, Korber B, Daniels M, Calef C, Foley B. 2005. HIV-1 simian immunodeficiency virus by synthesis of fecal viral
subtype and circulating recombinant form (CRF) reference consensus sequences. J Virol. in press
sequences, 2005. In: Leitner T, Foley B, Hahn B, Marx P, Thompson JD, Higgins DG, Gibson TJ. 1994. ClustalW:
McCutchan F, Mellors J, Wolinsky S, Korber B, editors. HIV improving the sensitivity of progressive multiple sequence
sequence compendium. New Mexico: Los Alamos National alignment through sequence weighting, position-specific gap
Laboratory. p. 41–48. penalties and weight matrix choice. Nucleic Acids Res.
Li W-H, Tanimura M, Sharp PM. 1988. Rates and dates of 22:4673–4680.
divergence between AIDS virus nucleotide sequences. Mol Van Heuverswyn F, Li Y, Neel C, Bailes E, Keele BF, Liu W,
Biol Evol. 5:313–330. Loul S, Butel C, Liegois F, Bienvenue Y, Ngole EM,
Lopez-Verges S, Camus G, Blot G, Beauvoir R, Benarous R, Sharp PM, Shaw GM, Delaporte E, Hahn BH, Peeters M.
Berlioz-Torrent C. 2006. Tail-interacting protein TIP47 is 2006. SIV infection in wild gorillas. Nature. 444:164.
a connector between Gag and Env and is required for Env Wei Q, Fultz PN. 1998. Extensive diversification of human
incorporation into HIV-1 virions. Proc Natl Acad Sci USA. immunodeficiency virus type 1 subtype B strains during dual
103:14947–14952. infection of a chimpanzee that progressed to AIDS. J Virol.
Mwaengo DM, Novembre FJ. 1998. Molecular cloning and 72:3005–3017.
characterization of viruses isolated from chimpanzees with Wei X, Decker JM, Liu H, Zhang Z, Arani RB, Kilby JM,
pathogenic human immunodeficiency virus type 1 infections. Saag MS, Wu X, Shaw GM, Kappes JC. 2002. Emergence of
J Virol. 72:8976–8987. resistant human immunodeficiency virus type 1 in patients
Nerrienet E, Santiago ML, Foupouapouognigni Y, Bailes E, receiving fusion inhibitor (T-20) monotherapy. Antimicrob
Mundy NI, Njinku B, Kfutwah A, Muller-Trutwin MC, Barre- Agents Chemother. 46:1896–905.
Sinoussi F, Shaw GM, Sharp PM, Hahn BH, Ayouba A. Yamaguchi J, Bodelle P, Kaptue L, Zekeng L, Gurtler LG,
2005. Simian immunodeficiency virus infection in wild- Devare SG, Brennan CA. 2003. Near full-length genomes of
caught chimpanzees from Cameroon. J Virol. 79:1312–1319. 15 HIV type 1 group O isolates. AIDS Res Human Retrovir.
Novembre FJ, Saucier M, Anderson DC, Klumpp SA, 19:979–988.
O’Neill SP, Brown CR, Hart CE, Guenthner PC, Yang Z. 1994. Maximum likelihood phylogenetic estimation
Swenson RB, McClure HM. 1997. Development of AIDS from DNA sequences with variable rates over sites:
in a chimpanzee infected with human immunodeficiency virus approximate methods. J Mol Evol. 39:306–14.
type 1. J Virol. 71:4086–4091. Yang Z. 1997. PAML: a program package for phylogenetic
Ono A, Ablan SD, Lockett SJ, Nagashima K, Freed EO. 2004. analysis by maximum likelihood. Comp Appl Biosci.
Phosphatidylinositol (4,5) bisphosphate regulates HIV-1 Gag 15:555–556.
targeting to the plasma membrane. Proc natl Acad Sci USA. Yang Z, Rannala B. 1997. Bayesian phylogenetic inference using
101:14889–14894. DNA sequences: a Markov Chain Monte Carlo Method. Mol
Peeters M, Honore C, Huet T, Bedjabaga L, Ossari S, Bussi P, Biol Evol. 14:717–24.
Cooper RW, Delaporte E. 1989. Isolation and partial
characterization of an HIV-related virus occurring naturally
Robin Bush, Associate Editor
in chimpanzees in Gabon. AIDS. 3:625–630.
Saad JS, Miller J, Tai J, Kim A, Ghanam RH, Summers MF.
2006. Structural basis for targeting HIV-1 Gag proteins to the Accepted May 22, 2007