Somatic populations of PGT135-137 HIV-1-neutralizing antibodies identified by 454 pyrosequencing and bioinformatics

Zhu, Jiang; O’Dell, Sijy; Ofek, Gilad; Pancera, Marie; Wu, Xueling; Zhang, Baoshan; Zhang, Zhenhai; Comparative Sequencing Program, NISC; Mullikin, James C.; Simek, Melissa; Burton, Dennis R.; Koff, Wayne C.; Shapiro, Lawrence; Mascola, John R.; Kwong, Peter D.

doi:10.3389/fmicb.2012.00315

ORIGINAL RESEARCH article

Front. Microbiol., 11 September 2012

Sec. Virology

Volume 3 - 2012 | https://doi.org/10.3389/fmicb.2012.00315

This article is part of the Research Topic Genomics and computational science for virus research View all 18 articles

Somatic populations of PGT135-137 HIV-1-neutralizing antibodies identified by 454 pyrosequencing and bioinformatics

Jiang Zhu¹

Sijy O’Dell^1†

Gilad Ofek^1†

Marie Pancera^1†

Xueling Wu^1†

Baoshan Zhang^1†

Zhenhai Zhang^2†

NISC Comparative Sequencing Program³ James C. Mullikin³

Melissa Simek⁴

Dennis R. Burton^5,6,7

Wayne C. Koff⁴

Lawrence Shapiro^1,2

John R. Mascola¹

Peter D. Kwong¹*

¹ Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
² Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
³ NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
⁴ International AIDS Vaccine Initiative, New York, NY, USA
⁵ Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, CA, USA
⁶ International AIDS Vaccine Initiative Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA, USA
⁷ Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA

Select HIV-1-infected individuals develop sera capable of neutralizing diverse viral strains. The molecular basis of this neutralization is currently being deciphered by the isolation of HIV-1-neutralizing antibodies. In one infected donor, three neutralizing antibodies, PGT135–137, were identified by assessment of neutralization from individually sorted B cells and found to recognize an epitope containing an N-linked glycan at residue 332 on HIV-1 gp120. Here we use next-generation sequencing and bioinformatics methods to interrogate the B cell record of this donor to gain a more complete understanding of the humoral immune response. PGT135–137-gene family specific primers were used to amplify heavy-chain and light-chain variable-domain sequences. Pyrosequencing produced 141,298 heavy-chain sequences of IGHV4-39 origin and 87,229 light-chain sequences of IGKV3-15 origin. A number of heavy and light-chain sequences of ∼90% identity to PGT137, several to PGT136, and none of high identity to PGT135 were identified. After expansion of these sequences to include close phylogenetic relatives, a total of 202 heavy-chain sequences and 72 light-chain sequences were identified. These sequences were clustered into populations of 95% identity comprising 15 for heavy chain and 10 for light chain, and a select sequence from each population was synthesized and reconstituted with a PGT137-partner chain. Reconstituted antibodies showed varied neutralization phenotypes for HIV-1 clade A and D isolates. Sequence diversity of the antibody population represented by these tested sequences was notably higher than observed with a 454 pyrosequencing-control analysis on 10 antibodies of defined sequence, suggesting that this diversity results primarily from somatic maturation. Our results thus provide an example of how pathogens like HIV-1 are opposed by a varied humoral immune response, derived from intrinsic mechanisms of antibody development, and embodied by somatic populations of diverse antibodies.

Introduction

Recent years have seen revolutions in both genomics and computational science (Lander et al., 2001; Venter et al., 2001; Chen et al., 2012). In both of these fields, capabilities are advancing exponentially (Kahn, 2011). The impact of this non-linear development on biology is pervasive and multifaceted. With respect to virus research, the influence has been profound and is the focus of this special issue of Frontiers. Medical interest in viruses is focused on pathogens and their infection, and the biological mirror of infection is the host immune response. Advances in genomics and computational science have the potential for an equally profound impact on our understanding of the immune response. Here we focus on the application of new genomic and computational techniques, particularly 454 pyrosequencing of B cell transcripts (Reddy et al., 2009; Reddy and Georgiou, 2011; Wu et al., 2011) and systems-level bioinformatics (Kitano, 2002), to understand the antibody response to infection.

The human immunodeficiency virus type I, HIV-1, is the etiological agent of a global pandemic, which has killed over 30 million people, and currently infects ∼1% of adults worldwide (UNAIDS, 2010). HIV-1 is a retrovirus and member of the lentivirus genus (Gonda et al., 1985; Sonigo et al., 1985). Global genetic diversity of HIV-1 is extraordinarily high (Starcich et al., 1986; Korber et al., 2001), and this is thought to result from the low fidelity of its genome replication (Preston et al., 1988) as well as the persistent nature of the infection: the diversity of HIV-1 virus within a single individual after 6 years of infection is equivalent to the global diversity of H1N1 influenza observed annually (Korber et al., 2001). Infection by HIV-1 elicits many antibodies, but in general these are not capable of neutralization of diverse strains of HIV-1. However, after several years of infection, 10–25% of infected individuals develop broadly neutralizing antibodies (Li et al., 2007; Gray et al., 2009; Sather et al., 2009; Simek et al., 2009; Stamatatos et al., 2009; Doria-Rose et al., 2010; Gnanakaran et al., 2010). These antibodies provide little or no benefit to the infected host, as the evolution of the virus outpaces the immune response (Parren et al., 1999; Poignard et al., 1999; Wei et al., 2003). Nevertheless these antibodies, when tested in humanized mice or macaque models by passive antibody transfer, impart effective immunity to challenge with HIV-1 or simian/human chimeric immunodeficiency viruses (Mascola et al., 1999, 2000; Parren et al., 2001; Mascola, 2003; Veazey et al., 2003; Hessell et al., 2009a,b; Balazs et al., 2011), indicating the potential for their use as targets for re-elicitation by rationally designed vaccines (reviewed in Walker and Burton, 2010; Kwong et al., 2011). Thus, substantial interest has focused on understanding human antibodies that effectively neutralize diverse strains of HIV-1.

A number of techniques have recently been applied to identification of such antibodies. These methods – including antigen-specific B cell sorting (Scheid et al., 2009; Wu et al., 2010) and direct assessment of neutralization by antibodies secreted from individually sorted B cells (Walker et al., 2009, 2011), each coupled to single B cell sequencing techniques – have so far yielded dozens of broadly HIV-1-neutralizing antibodies. These antibodies represent an extraordinarily sparse sampling of the humoral immune response, which typically generates roughly a billion new B cells in a healthy individual each day. We therefore asked whether the revolutionary new capabilities of next-generation sequencing (Mardis, 2008a,b; Boyd et al., 2010; Hawkins et al., 2010) and computational science could expand this sampling to generate a more complete understanding of the humoral immune response. In principle, memory B cells contain a persistent record of the antibody response to infection. As memory B cells are readily attained from blood, they provide a convenient means to access the antibody record, with B cell transcripts in peripheral blood mononuclear cells (PBMCs) providing a genetic representation. Using three antibodies, PGT135–137 from Protocol G donor 39 (Walker et al., 2011) as an example, we used 454 pyrosequencing of PCR-amplified heavy- and light-chain transcripts to capture a more comprehensive genetic record. We used bioinformatics approaches to interrogate this record, to identify populations of neutralizing antibodies, and to characterize their ontogenies. We link these ontogenies to the natural mechanisms of B cell development to provide a view of how somatic populations of antibodies engender a diverse immunological response to infection.

Materials and Methods

Human Specimens

The PBMCs of the HIV-1 infected donor 39 were obtained from the International AIDS Vaccine Initiative (IAVI) protocol G. The same sample was used to isolate broadly neutralizing antibodies PGT135–137 (Walker et al., 2011). Human peripheral blood samples were collected after obtaining informed consent and appropriate Institutional Review Board (IRB) approval.

Sample Preparation for 454 Pyrosequencing

Ten previously described heavy-chain plasmids with known sequences (Wu et al., 2011) were selected to assess 454 pyrosequencing error. Ten plasmids (100 ng each) were combined in 35 μl water, and 1 μl of the ten-plasmid combination was used to template polymerase chain reactions (PCRs). The heavy and kappa chain PCR samples for 454 pyrosequencing from donor 39 were prepared as described (Wu et al., 2011) with minor modifications. Briefly, mRNA was extracted from 20 million PBMCs into 200 μl of elution buffer (Oligotex kit, Qiagen), then concentrated to 10–30 μl by centrifuging the buffer through a 30 kD micron filter (Millipore). The reverse transcription was performed in one or multiple 35 μl-reactions, each composed of 13 μl of mRNA, 3 μl of oligo(dT)_12–18 at 0.5 μg/μl (Invitrogen), 7 μl of 5× first strand buffer (Invitrogen), 3 μl of RNase Out (Invitrogen), 3 μl of 0.1 M DTT (Invitrogen), 3 μl of dNTP mix (each at 10 mM), and 3 μl of SuperScript II (Invitrogen). The reactions were incubated at 42°C for 2 h. The cDNAs from each reaction were combined, applied to the NucleoSpin Extract II kit (Clontech), and eluted in 20 μl of elution buffer. In this way, 1 μl of the cDNA comprised transcripts from 1 million PBMCs. The immunoglobulin gene family-specific PCR was set up in a total volume of 50 μl, using 1 μl of the heavy-chain plasmid mix or 5 μl of the cDNA as template (equivalent of transcripts from 5 million PBMCs). The DNA polymerase systems used was the Platinum Taq High-Fidelity (HiFi) DNA Polymerase System (Invitrogen). According to the instructions of the manufacturer, the reaction mix was composed of water, 5 μl of 10× buffer, and 1 μl of supplied MgSO4, 2 μl of dNTP mix (each at 10 mM), 1–2 μl of primers (Table S1 in Supplementary Material) at 25 μM, and 1 μl of Platinum Taq HiFi DNA polymerase. The primers each contained the appropriate adaptor sequences (XLR-A or XLR-B) for subsequent 454 pyrosequencing. The PCRs were initiated at 95°C for 30 s, followed by 25 cycles of 95°C for 30 s, 58°C for 30 s, and 72°C for 1 min, then incubated at 72°C for 10 min. The PCR products at the expected size (∼500 bp) were gel extracted and purified (Qiagen), followed by further phenol/chloroform purification.

454 Pyrosequencing and Library Preparation

The 454 pyrosequencing was carried out as described previously (Wu et al., 2011). Briefly, PCR products were quantified using Qubit (Life Technologies, Carlsbad, CA, USA). Library concentrations were determined using the KAPA Biosystems qPCR system (Woburn, MA, USA) with 454 pyrosequencing standards provided in the KAPA system. Pyrosequencing of the PCR products was performed on a GS FLX sequencing instrument (Roche-454 Life Sciences, Bradford, CT, USA) using the manufacturer’s suggested methods and reagents. Initial image collection was performed on the GS FLX instrument and subsequent signal processing, quality filtering, and generation of nucleotide sequence and quality scores were performed on an off-instrument linux cluster using 454 application software (version 2.5.3). The amplicon quality filtering parameters were adjusted based on the manufacturer’s recommendations (Roche-454 Life Sciences Application Brief No. 001-2010). Quality scores were assigned to each nucleotide using methodologies incorporated into the 454 application software to convert flowgram intensity values to Phred-based quality scores and as described (Brockman et al., 2008). The quality of each run was assessed by analysis of internal control sequences included in the 454 pyrosequencing reagents. Reports were generated for each region of the PicoTiterPlate (PTP) for both the internal controls and the samples.

Bioinformatics Analysis of 454 Pyrosequencing-Determined Antibody Sequences

Our previously described bioinformatics pipeline (Wu et al., 2011) was refined and currently consists of five steps. Starting from a 454 pyrosequencing-determined antibodyome, each sequence read was (1) reformatted and labeled with a unique index number; (2) assigned to variable (V), diverse (D), and joining (J) gene families and alleles using an in-house implementation of IgBLAST¹, and sequences with E-value > 10⁻³ for V gene assignment were rejected; (3) subjected to a template-based error-correction procedure, in which 454 pyrosequencing homopolymer errors in V, D, and J regions were detected based on the alignment to their respective germline sequences. Note that only insertion and deletion errors of less than three nucleotides were corrected. D and J gene were corrected only when their gene assignment was reliable, indicated by E-value < 10⁻³; (4) compared with the a set of template antibody sequences at both nucleotide level and amino-acid level using a global alignment module in CLUSTALW2 (Larkin et al., 2007); (5) subjected to a multiple sequence alignment (MSA)-based scheme to determine the third complementarity-determining region (CDR H3 or L3), which was further compared with a set of template CDR H3 or L3 sequences at nucleotide level, and to determine the sequence boundary of variable domain. For a large population of highly similar sequences, a “divide-and-conquer” procedure could be used to derive a consensus sequence to represent the population and to reduce random sequencing errors. First, a clustering using BLASTClust (Altschul et al., 1997) with a 95% sequence identity cutoff is performed on the sequence population. Then, the largest cluster is divided into 10–50 sets, for each of which a consensus can be derived from MSA. A final consensus is obtained by averaging over the subset consensuses.

Intra-donor phylogenetic analysis use the same procedure as cross-donor phylogenetic analysis, which has been described in detail in previous study (Wu et al., 2011), except that the template antibodies are from the same donor (intra-donor) rather than added exogenously (cross-donor), and intra-donor phylogenetic analysis is equally applicable to heavy and light chains. Briefly, the computational procedure consists of an iterative analysis based on the neighbor-joining (NJ) method (Kuhner and Felsenstein, 1994) implemented in CLUSTALW2 (Larkin et al., 2007) and a final analysis based on the maximum-likelihood (ML) method with molecular clock implemented in DNAMLK² in the PHYLIP package v3.69³. In the NJ-based analysis, donor sequences of a particular germline origin were first randomly shuffled and divided into subsets of no more than 5,000 sequences. Then, PGT135–137 and respective germline sequence, IGHV4-39*07 for heavy chain and IGKV3-15*01 for light chain, were added to each subset. A NJ tree was constructed for each subset using the “Phylogenetic trees” option in CLUSTALW2 (Larkin et al., 2007). The donor sequences that clustered in the smallest branch that contains PGT135–137 were extracted from each NJ tree and combined into a new data set for the next round of analysis. The analysis was repeated until convergence, where all the donor sequences resided within a subtree containing PGT135–137 and no other sequences resided between this subtree and the root, and where further repeat of the analysis did not change the NJ tree. The ML-based analysis was used to confirm the intra-donor dendrogram derived from the NJ-based analysis. Starting from the data set obtained from the last iteration of NJ analysis, the MSA generated by CLUSTALW2 (Larkin et al., 2007) was provided as input to construct a phylogenetic tree using DNAMLK. Usually, any sequences outside the ML-defined subtree were discarded, but in this study we tested light chains identified by NJ method but immediately outside the rooted ML-defined PGT135–137 subtree. The displayed phylogenetic trees were generated using Dendroscope (Huson et al., 2007), ordered to ladderize right and rooted at the germline genes.

A description of the antibodyomics software (Antibodyomics1.0) utilized in this paper is being prepared for publication.

Antibody Expression and Purification

Antibody production followed previously described procedures (Wu et al., 2011). Briefly, sequences were selected using the respective bioinformatics procedure and checked for sequencing errors using an automatic error-correction procedure followed by manual inspection. The corrected antibody sequences were synthesized (GenScript USA Inc. and Blue Heron Biotech, LLC.) and cloned into the CMV/R expression vector (Barouch and Nabel, 2005) containing the constant regions of IgG1. All synthesized heavy chains were paired with PGT137 light-chain DNA, and synthesized light chains were paired with PGT137 heavy-chain DNA for transfection. Full-length IgGs were expressed from transient transfection of 293F cells and purified using a recombinant protein-A column (Pierce).

HIV-1 Neutralization

Neutralization was measured using HIV-1 Env-pseudoviruses to infect TZM-bl cells as described (Li et al., 2005; Wu et al., 2009; Seaman et al., 2010). Neutralization curves were fit by non-linear regression using a five-parameter hill slope equation as described (Seaman et al., 2010). The 50% and 80% inhibitory concentrations (IC₅₀ and IC₈₀) were reported as the antibody concentrations required to inhibit infection by 50% and 80% respectively.

Results

Experiments involving both sequencing technologies and computational analyses are described. Because variable region transcripts of antibodies are over 300 nucleotides in length and because the high similarity between different antibody transcripts precludes assembly of full sequences from fragments, we used 454 pyrosequencing, which is currently one of the few next-generation sequencing technologies to provide reads of sufficient length (Reddy et al., 2009; Reddy and Georgiou, 2011; Wu et al., 2011). However, 454 pyrosequencing is known to suffer from high error rates (Prabakaran et al., 2011). We therefore begin by characterizing the accuracy of 454 pyrosequencing applied to a set of plasmid standards consisting of known HIV-neutralizing antibodies. We then describe 454 pyrosequencing of antibody heavy-chain transcripts from donor 39 (Walker et al., 2011), and analyze these data bioinformatically and functionally. We follow this with a similar analysis of donor 39 light-chain transcripts.

Characterization of 454 Pyrosequencing Errors on Antibody Transcripts

To investigate the extent of 454 pyrosequencing errors on the antibodyome analysis, we carried out a sequencing experiment on the heavy chains of 10 selected antibodies (Wu et al., 2011), including five from B cell sorting-based isolation, VRC01, VRC03, VRC-PG04, VRC-CH31, and VRC-CH33, one codon-optimized version of inferred reverted unmutated ancestor of VRC-PG04 (termed VRC-PG04_cog), and four identified from previous 454 pyrosequencing study, gVRC-H3_d74, gVRC-H6_d74, gVRC-H12_d74, and gVRC-H15_d74. The plasmid sequencing data was processed with the same bioinformatics pipeline used for donor sequencing data (Figure S1 in Supplementary Material). Sequence reads were subjected to an error-correction procedure, which was aimed to fix deletion and insertion errors that cause protein translation problems (see Materials and Methods). Results obtained with and without error correction were compared to examine the effect of error correction on observed sequence variation.

A divergence/identity analysis was first carried out on the 10 plasmid data set, obtained without (Figure 1) and with error correction (Figure S2 in Supplementary Material). Since divergence and identity were calculated at the nucleotide level, error correction appeared to have little effect on the sequence distribution. Ideally, if the 454 pyrosequencing did not produce any errors, especially mutations, the distribution – irrespective of the antibody being used as template – would yield, on divergence/identity plots, 10 discrete points, each corresponding to one of the input sequences. In contrast, divergence/identity plots revealed broad islands centered around each of these 10 antibody sequences (Figure 1). The shape and area of each island provide a visual representation of the extent of the 454 pyrosequencing errors. As shown in Table 1, 5 of the 10 antibodies – those with an identity gap of 25% or greater to the next most closely related sequence – were easily distinguished from each other, while other more closely related variants, e.g., VRC-CH31 and VRC-CH33, overlapped (Figure 1). Based on identity considerations (Table 1) and the scope of each island in divergence/identity plots (Figure 1), a single cutoff of 75% was applied to group 454 pyrosequencing-determined sequences for VRC01, VRC03, VRC-PG04_cog, gVRC-H3_d74, and gVRC-H6_d74.

FIGURE 1

Figure 1. Sequence variation as a consequence of 454 pyrosequencing for ten plasmid-control antibodies. To quantify sequencing error, ten antibodies, input as purified plasmid DNA, were subjected to 454 pyrosequencing. Tested plasmid antibodies included VRC01, VRC03, VRC-PG04, VRC-CH31, VRC-CH33, a codon-optimized version of inferred, reverted unmutated ancestor of VRC-PG04 (termed VRC-PG04_cog), gVRC-H3_d74, gVRC-H6_d74, gVRC-H12_d74, and gVRC-H15_d74. Heavy chain sequences are plotted as a function of sequence identity to the plasmid antibody (vertical axes) and of sequence divergence from their germline gene allele, IGHV1-2*02 (horizontal axes). The sequencing data used for divergence/identity analysis was processed by the standard bioinformatics pipeline without the error-correction step. Color coding indicates the number of sequences. For VRC01 and VRC03, additional contour plots displaying the estimated mutational error range (one root-mean-square deviation, 1.38% for VRC01 group and 1.26% for VRC03 group) have been shaded red around the input antibody.

TABLE 1

Table 1. Percent sequence-identity matrix of 10 plasmid antibody heavy-chain variable domains tested by 454 pyrosequencing.

Each of these five 454 pyrosequencing-determined sequence groups was analyzed for mutations, insertions, and deletions relative to the input plasmid sequence, as well as total number of reads and their redundancy (Table 2). For four of the plasmids ∼50,000 reads were obtained; for gVRC-H6_d74, however, only about one fourth as many were obtained, which may relate to a lower efficiency of the primer used for gVRC-H6_d74. In terms of redundancy, for three of the plasmids between one fifth and one half of the reads were identical to the input plasmid, whereas for VRC01 and gVRC-H6_d74, only a small fraction (<1 and <10%) of the reads were identical to the input plasmid, a result of insertions in most of the sequences. Note that after error correction, 20–3254 more sequences became identical to the input antibodies (Table 2). Overall, for an antibody of typical length, ∼5-nucleotide mutations were observed between 454 pyrosequencing reads and corresponding input sequences. Error correction appeared to cause an increased count of mutation errors while decreasing insertion and deletion errors that produce stop codons and nonsense codons in protein translation. Currently used correction procedure was able to improve the identity of translated protein sequence to respective germline gene by an average of 14.1% (Figures S1C,D in Supplementary Material).

TABLE 2

Table 2. Statistical analysis of 454 pyrosequencing-induced errors for five plasmid antibodies.

We then examined the accuracy of bioinformatically selected representative sequences for these five antibody groups. Note that all these sequences have been subjected to a template-based error-correction procedure in the pipeline processing. A “divide-and-conquer” procedure (See Materials and Methods) was used for sequence calculation. Remarkably, the representative sequence was 100% identical to the “true” sequence used as input for 454 pyrosequencing for VRC-PG04_cog, gVRC-H3_d74, and gVRC-H6_d74, while having one 1-nucleotide deletion and two 1-nucleotide insertions for VRC01 and VRC03, respectively. None had mutation errors. Such consensus-based sequence picking procedure may prove useful in the cases where a population of closely related sequences is observed on the divergence/identity plot, as indicated by a densely populated island.

454 Pyrosequencing of Donor 39 IGHV4 Family and Bioinformatics Analysis of Heavy Chains

We next performed 454 pyrosequencing of PGT135–137-related heavy-chain transcripts from donor 39 PBMCs. mRNA from ∼5 million PBMCs was used for reverse transcription to produce template cDNA, and PCR was used to amplify IgG and IgM heavy-chain sequences from the IGHV4 family using forward primers that overlapped the end of the V gene leader sequence and the start of the V region and reverse primers covering the start of the constant domain (Table S1 in Supplementary Material).

Next-generation pyrosequencing provided 918,298 reads, which were processed with a bioinformatics pipeline that involved assignment of germline origin genes, 454 pyrosequencing-error correction, and extraction of CDR H3 regions for lineage assignment. Overall about 85.3% of the raw reads spanned over 400 nucleotides, covering the entire variable domain. After computational assignment of V, D, and J gene components, 142,842 sequences were assigned to IGHV4-39 germline family, accounting for ∼16% of the expressed VH4 antibodyome. Each sequence was subjected to an automatic error-correction scheme. For donor 39 heavy chains, the correction procedure improved the accuracy of protein translation, measured by protein sequence identity to inferred gemline gene, by an average of 20.4%. The results for pipeline processing of heavy-chain data set are listed in Figure S3 in Supplementary Material.

First, germline family analyses were performed using two standard methods – IMGT (Brochet et al., 2008) and IgBLAST (see text footnote 1; Table 3). These analyses assigned PGT135–137 gene origins to IGHV4-39 with two possible alleles (*03 or *07), to three potential D genes, and the J gene IGHJ5*02. An analysis of the third complementarity-determining region of the heavy chain (CDR H3) showed 80–90% sequence identity between PGT135–137, suggestive of a common lineage. The likely clonal origin of PGT135–137 indicates that they will all have the same V(D)J origin, with the different origin gene assignments by IMGT and IgBLAST likely due to their high divergence of ∼20% from ancestral gene.

TABLE 3

Table 3. Recombination origins of antibodies PGT135–137.

Second, a divergence/identity analysis of 454 pyrosequencing-derived sequences assigned to IGHV4-39 origin was performed (Figure 2). The IGHV4-39-related sequences revealed a maximum divergence of 30.4% and an average divergence of 7.7% from germline. An island of sequences was observed at ∼90% identity to PGT137 with divergence of 20–25% from VH4-39, indicative of PGT137-related antibodies with similar evolutionary distance from the origin.

FIGURE 2

Figure 2. Repertoire of donor 39 heavy-chain variable-domain sequences of IGHV4-39 origin determined by 454 pyrosequencing. After processing by a standard bioinformatics pipeline (see Materials and Methods), 1,41,298 full-length, heavy-chain variable-domain sequences from IGHV4-39 germline family were obtained. These are plotted as a function of sequence identity to the heavy-chain variable domain of PGT135 (A), PGT136 (B), and PGT137 (C) and of sequence divergence from inferred IGHV4-39 germline allele. Color coding indicates the number of sequences. The 10%-identity gap indicates that the sequences within the upper island in 2C are somatic variants of PGT137 and not caused by sequencing errors.

Third, intra-donor phylogenetic analysis (see Materials and Methods) was applied to identify the somatic variants of PGT135–137 from the donor 39 heavy-chain sequencing data. In this analysis, a set of clonally related template antibodies is used to interrogate sequences from the same donor using phylogenetic analysis. Phylogenetic analysis, using a tree rooted by the inferred germline gene IGHV4-39*07, produced a ML dendrogram with 202 heavy-chain variable-domain sequences identified by their co-segregation with PGT135–137 (Figure 3). Most of the intra-donor-identified sequences clustered with PGT137, and one sequence clustered with PGT136.

FIGURE 3

Figure 3. Evolutionary similarity of PGT135–137 to donor 39 heavy-chain variable-domain sequences. Germline-rooted maximum-likelihood tree of PGT135–137 and 202 sequences identified by the iterative intra-donor phylogenetic analysis of donor 39 heavy-chain variable domain sequences determined by 454 pyrosequencing. The iterative intra-donor phylogenetic analysis was based on an implementation of neighbor-joining (NJ) method. Collapsed branches are indicated by Collapse {N: M}, in which N is the branch depth (number of intermediate nodes) and M is the number of sequences within the branch. All sequences are on the PGT137 branch except for 174091, which is somatically related to PGT136.

Fourth, CDR H3 variation was analyzed for the 202 PGT135–137-related heavy-chain variable-domain sequences. One hundred seven were found to have identical CDR H3 sequences, as the same as the nucleotide-sequence consensus. With a maximum of five mutations from the consensus, the average CDR H3 variation was 1.2, indicative of a rather conserved signature of PGT135–137 lineage.

PGT135–137 Somatic Heavy-Chain Populations and Functional Characterization

To gain insight into the functional diversity of the antibodies identified by 454 pyrosequencing and bioinformatics methods, a clustering procedure was used to analyze the 202 identified heavy chains and to select representative sequences for further characterization. We used BLASTClust (Altschul et al., 1997) clustering function and an identity cutoff of 95% to sample the natural variation. We chose this cutoff to be greater than the ∼1.6% “false” sequence variation induced by 454 pyrosequencing errors (Table 2). A total of 15 clusters emerged. In the BLASTClust output, the first sequence of each cluster was selected to “represent” the cluster (Figure 4A) and were synthesized and reconstituted with the PGT137 light chain for functional assessment of HIV-1 neutralization, which was carried out on two viruses sensitive to PGT135–137 antibodies. Out of 15 tested heavy-chain variable domain sequences, when paired with PGT137 light chain, 11 reconstituted antibodies showed neutralization to different extents (Table 4).

TABLE 4

Table 4. Neutralization titers of 21 chimeric antibodies derived from 454 pyrosequencing of donor 39 against HIV-1 pseudoviruses from clade A and clade D.

FIGURE 4

Figure 4. Sequence selection for functional characterization of heavy chains from donor 39. (A) Divergence/identity analysis of 15 heavy-chain variable-domain sequences obtained from the clustering analysis of 202 sequences identified by intra-donor phylogenetic analysis. Sequences of IGHV4-39 origin are plotted as a function of sequence identity to PGT137 heavy chain and sequence divergence from inferred germline allele, with 15 selected sequences shown as black triangles and their amino-acid consensus as red triangle. (B) Percent population of 15 clusters obtained using a sequence identity cutoff of 95%. Each cluster is indicated by its representative sequence. “Frequency” refers to the total number of sequences observed for each cluster. (C) Protein sequences of 15 cluster representatives and their amino-acid consensus. Sequences are aligned to the inferred germline gene, IGHV4-39*07. Framework regions (FR) and complementarity-determining regions (CDRs) are based on Kabat nomenclature. Amino acids mutated from the germline gene are shown in red.

The two largest clusters, with 136 and 46 sequences, respectively, accounted for ∼90% of the sequences (Figure 4B), while 10 of the 15 clusters contained only a single member. A consensus sequence (ConsAA), calculated from the alignment of 15 representative sequences (Figure 4C), was also synthesized. Notably, the reconstituted amino-acid consensus displayed neutralization almost on par with wild-type PGT137 (Table 4).

Despite their apparent clonality, the clustering procedure reveals 15 clusters. The topology of the dendrogram produced from phylogenetic analysis indicates that these 15 clusters represent populations of somatically related antibodies evolving along distinct branches by standard mechanisms of hypermutation (Figure 3). We analyzed these 15 somatic populations for prevalence of mutations, insertions, and deletions (Table S2 in Supplementary Material). Note that the representative sequence of cluster 1 (#844305) contained two insertions in the CDR H3 region which were not seen in other members of the cluster, suggesting that these insertions might be sequencing errors. Indeed, this heavy chain could not be expressed when reconstituted with PGT137 light chain. We also analyzed each of these populations by divergence/identity plot (Figure 5). Overall, sequences chosen to represent the 15 somatic populations showed diverse neutralization characteristics (Table S2 in Supplementary Material). Some antibodies, for example from clusters 2, 3, 14, and 15 (gVRC-H1_d39, gVRC-H2_d39, gVRC-H9_d39, and gVRC-H10_d39), neutralized clade A – RW020.2 and clade D – UG024.2 with roughly equal potency. Some antibodies, for example from clusters 4, 5, 8, and 10 (gVRC-H3-H6_d39), neutralized clade A-RW020.2 25-150-fold more potently than clade D. While the antibody from cluster 13 (gVRC-H8_d39) neutralized clade D – UG024.2 with at least 100-fold greater potency than clade A. These results provide an example for how somatically related antibodies can significantly differ in their neutralization specificities. This begins to provide insight into how populations of somatically related antibodies can engender neutralization breadth significantly different than any individual member.

FIGURE 5

Figure 5. Divergence/identity analysis of heavy-chain neutralization. (A) The expressed heavy-chain sequences color-coded based on the neutralization potency of reconstituted antibodies, with IC50 <1.0 for both viruses shown in red (effective neutralizers), IC50 >50.0 for both viruses in black (non-neutralizers), and other cases in gray (weak neutralizers). The amino-acid consensus, when reconstituted with PGT137 light chain, neutralized both viruses with an IC50 <1.0 and is shown as a red hollow star. (B) The three largest clusters are displayed on the enlarged divergence/identity plot, with 136, 46, and 7 members, respectively.

454 Pyrosequencing of Donor 39 IGKV3 Family and Bioinformatics Analysis of Light Chains

We next performed 454 pyrosequencing of PGT135–137-related light-chain transcripts from donor 39 PBMCs. mRNA from ∼5 million PBMCs was used for reverse transcription to produce template cDNA, and PCR was used to amplify light-chain sequences from the IGKV3 family.

The 454 pyrosequencing provided 971,165 reads, which were then processed using a pipeline adapted for κ-chain analysis. For donor 39, about 83.3% of the raw reads were 400 nt or longer, effectively covering the light-chain variable domain. After V and J gene assignment, 91,951 sequences were determined to belong to IGKV3-15 germline family, accounting for 10% of the light chain reads obtained. After error correction, the accuracy of protein translation measured by the protein sequence identity to inferred gemline gene was improved by an average of 16.5%. The results for pipeline processing of light-chain data set are listed in Figure S4 in Supplementary Material.

First, the recombination origins of PGT135–137 light chains were analyzed (Table 3). PGT135–137 light chains were assigned to the same germline V gene allele, IGKV3-15*01, recombined with the same J gene, IGKJ1*01, supporting the notion that the discrepancy in heavy-chain germline assignment was likely an artifact caused by their high divergence.

Second, the divergence/identity analysis of 454 pyrosequencing-derived sequences assigned to the IGKV3-15*01 origin was performed (Figure 6). The IGKV3-15*01-related sequences revealed a maximum divergence of 20.9% and an average divergence of 6.3% from germline. Distinct sequence islands were observed at ∼100% identity to PGT136 and 95% identity to PGT137 – both with divergence of 10–15% from IGKV3-15*01. No distinct sequence island was observed that was closely related to PGT135.

FIGURE 6

Figure 6. Repertoire of donor 39 light-chain variable-domain sequences of IGKV3-15 origin determined by 454 pyrosequencing. After processed by a standard bioinformatics pipeline, 87,229 full-length, light-chain variable-domain sequences from IGKV3-15 germline family are plotted as a function of sequence identity to the light-chain variable-domain of PGT135 (A), PGT136 (B), and PGT137 (C) and of sequence divergence from inferred IGKV3-15 germline allele. Color coding indicates the number of sequences.

Third, to identify light-chain somatic variants, we performed intra-donor phylogenetic analysis that combined an iterative NJ procedure for the high-throughput screening of sequencing data, and a ML calculation to confirm the NJ analysis and to provide the final dendrogram (see Materials and Methods). Two methods were usually in agreement, e.g., for donor 39 heavy chains, but differed here. The NJ-based analysis yielded 72 sequences within the PGT135–137 subtree, whereas the subsequent ML-based analysis retained 57 of the 72 sequences within the PGT135–137 subtree (Figure 7), providing an example for functional characterization of similar but somatically unrelated sequences.

FIGURE 7

Figure 7. Evolutionary similarity of PGT135–137 to donor 39 light-chain variable-domain sequences. Germline-rooted maximum-likelihood tree of PGT135–137 and 72 sequences identified by the iterative intra-donor phylogenetic analysis of donor 39 light-chain variable-domain sequences determined by 454 pyrosequencing. The iterative intra-donor phylogenetic analysis was based on an implementation of neighbor-joining (NJ) method. Collapsed branches are indicated by Collapse {N: M}, as in Figure 3. Sequences that are immediately outside the maximum-likelihood-defined PGT135–137 subtree are circled in blue dashed-line.

PGT135–137 Somatic Light-Chain Populations and Functional Characterization

By using the same 95% clustering procedure as for heavy chains, 14 light-chain clusters were identified from the phylogenetic tree. Representative sequences were selected, also as described for heavy chains, from the first 10 clusters for functional characterization (Figure 8A). We analyzed these 10 clusters for prevalence of mutations, insertions, and deletions (Table S3 in Supplementary Material). The largest cluster, lying within the population of PGT137-like sequences, contained 45 sequences or 63% of the subtree sequences (Figure 8B). All selected light-chain sequences possessed CDR L3s of the same length except for the sequences selected from the clusters 2 and 3 (Figure 8C). Out of 10 tested light-chain variable domain sequences, when reconstituted with the PGT137 heavy chain, six antibodies – representing six sequence clusters – showed neutralization of two HIV-1 strains from clade A and clade D. Notably, two of the light chains (gVRC-L1_d39 and gVRC-L2_d39) showed neutralization breadth slightly better than PGT135–137, and the light-chain variants neutralized clade A about 10-fold more effectively than the clade D (Table 4).

FIGURE 8

Figure 8. Sequence selection for functional characterization of light chains from donor 39. (A) Divergence/identity analysis of 10 light-chain variable-domain sequences obtained from the clustering analysis of 72 sequences identified by intra-donor phylogenetic analysis. Sequences of IGKV3-15*01 origin are plotted as a function of sequence identity to PGT137 light chain and sequence divergence from inferred germline allele, with 10 selected sequences shown as black triangles. (B) Percent population of 10 clusters obtained using a sequence identity cutoff of 95%. Each cluster is indicated by its representative sequence. (C) Protein sequences of 10 cluster representatives. Sequences are aligned to the inferred germline gene, IGKV3-15*01. Framework regions (FR) and complementarity-determining regions (CDRs) are based on Kabat nomenclature. Amino acids mutated from the germline gene are shown in red.

In contrast to the 454 pyrosequencing-identified heavy chains, the six neutralizing light-chain clusters were not localized to a single divergence/identity island (Figure 9). Indeed, neutralization was observed with clusters from at least three diverse locations on the divergence/identity plot. Nevertheless, the topology of the light-chain phylogenetic analysis indicates that these six clusters represent populations of somatically related antibodies (Figure 7).

FIGURE 9

Figure 9. Divergence/identity analysis of light-chain neutralization. (A) The expressed light-chain sequences color-coded based on the neutralization potency of reconstituted antibodies, with IC50 <1.0 for both viruses shown in red (effective neutralizers), IC50 >50.0 for both viruses in black (non-neutralizers), and other cases in gray (weak neutralizers). (B) The three largest clusters are displayed on the enlarged divergence/identity plot, with 45, 6, and 4 members, respectively.

Discussion

Recently, select antibodies with the ability to neutralize diverse strains of HIV-1 have been identified in HIV-1 infected donors (Walker et al., 2009, 2011; Corti et al., 2010; Wu et al., 2010, 2011; Scheid et al., 2011). Like PGT135–137, antibodies from these donors often appear to be clonally related, to possess similar neutralization characteristics, and to cluster in a localized island (or islands) on identity/diversity plots. These islands observed in 454 pyrosequencing-derived analyses are often nearby but rarely overlap the few antibodies experimentally isolated from the same individual (even if they start with samples of exactly the same time point, as we have done here with donor 39). The differences between antibodies identified from sorting of memory B cells or by 454 pyrosequencing of B cell transcripts suggest that the experimental approaches may capture or sample different B cell population. In addition to exploring differences in phenotype of antibody identified by the two methods, we also explored differences related to the quantity of identified antibody. In particular, we ask whether the less-sparse view of the antibody repertoire provided by next-generation sequencing and systems-level bioinformatics might provide insight into the diversity of the antibody response.

With the heavy chains of PGT135–137, select sequences representing 15 distinct populations, showed dramatically different neutralization characteristics toward clade A and D viruses when reconstituted with the same light chain from PGT137. With the light chains of PGT135–137, select sequences representing 10 distinct populations were not localized to a discrete sequence island, indicating substantial differences in identity and diversity (Figure 8). Thus, even though these antibodies are somatically related, both their neutralization and sequence characteristics can diverge substantially (Table 4). These results demonstrate the utility of next-generation sequencing, which provides a more comprehensive sampling of sequences, and of systems-level bioinformatics approaches, which enable these data to be mined effectively. Overall, data-intensive methods may be generally required to obtain true insight into questions of biological diversity such as the humoral immune response.

Prior next-generation sequencing and bioinformatics analyses have revealed the extraordinary genetic diversity of HIV-1 (Eriksson et al., 2008; Archer et al., 2009; Tsibris et al., 2009; Fischer et al., 2010). These same methods are now beginning to reveal the extraordinary diversity of antibodies generated in response to HIV-1 infection (Wu et al., 2011). Although this response appears to provide little benefit to the HIV-1-infected host (Poignard et al., 1999), if similar responses could be generated through vaccination, then in principle effective protection could be achieved in the setting of initial infection (Burton, 2002; Burton et al., 2004, 2005). The populations of antibodies we identify here may provide broader protection than a monoclonal member of the group. Furthermore, responses to infection or vaccination would be expected to generate diverse populations of antibodies, as we have shown here. Thus, population diversity, even within a single antibody clone or lineage, is likely to have a substantial impact on the effectiveness of the immune response.

Data Deposition

Next-generation sequencing data from donor 39 (heavy and light chains) and also for the 10 plasmid control have been deposited in the National Center for Biotechnology Information Short Reads Archives (SRA) under accession no. SRA055820. Information deposited with GenBank includes the heavy- and light-chain variable region sequences of genomically identified neutralizers: 10 heavy chains, gVRC-H1-10_d39 (JX313021-30), amino-acid consensus heavy-chain gVRC-H11_d39 (JX444560), and 6 light chains, gVRC-L1-6_d39 (JX313030-36).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank H. Coleman, M. Park, B. Schmidt, and A. Young for 454 pyrosequencing at the NIH Intramural Sequencing Center (NISC), J. Stuckey for assistance with figures. We also thank members of the Structural Biology Section and Structural Bioinformatics Core, Vaccine Research Center, for discussions or comments on the manuscript. We would like to thank all the study participants and research staff at each of the Protocol G clinical centers, and all of the Protocol G team members, the IAVI Human Immunology Laboratory, and all of the Protocol G clinical investigators, specifically, George Miiro, Anton Pozniak, Dale McPhee, Olivier Manigart, Etienne Karita, Andre Inwoley, Walter Jaoko, Jack DeHovitz, Linda-Gail Bekker, Punnee Pitisuttithum, Robert Paris, Jennifer Serwanga, and Susan Allen. Support for this work was provided by the Intramural Research Program of the Vaccine Research Center, National Institute of Allergy and Infectious Diseases and the National Human Genome Research Institute, National Institutes of Health, and by grants from the International AIDS Vaccine Initiative’s Neutralizing Antibody Consortium.

Supplementary Material

The Supplementary Material for this article can be found on line at http://www.frontiersin.org/Virology/10.3389/fmicb.2012.00315/abstract

Figure S1. Pipeline processing of heavy-chain sequences of 10 plasmid antibodies determined by 454 pyrosequencing.

Figure S2. Pyrosequencing-induced sequence variation for 10 plasmid antibodies after being processed by an error-correction procedure.

Figure S3. Pipeline processing of donor 39 heavy-chain sequences determined by 454 pyrosequencing.

Figure S4. Pipeline processing of donor 39 light-chain sequences determined by 454 pyrosequencing.

Table S1. PCR primers and DNA polymerase systems used to prepare samples for 454 pyrosequencing.

Table S2. Neutralization of reconstituted antibodies by pairing clustering-selected heavy-chain sequences from 454 pyrosequencing with PGT137 light chain.

Table S3. Neutralization of reconstituted antibodies by pairing clustering-selected light-chain sequences from 454 pyrosequencing with PGT137 heavy chain.

Footnotes

References

Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J. H., Zhang, Z., Miller, W., and Lipman, D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Archer, J., Braverman, M. S., Taillon, B. E., Desany, B., James, I., Harrigan, P. R., Lewis, M., and Robertson, D. L. (2009). Detection of low-frequency pretherapy chemokine (CXC motif) receptor 4 (CXCR4)-using HIV-1 with ultra-deep pyrosequencing. AIDS 23, 1209–1218.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Balazs, A. B., Chen, J., Hong, C. M., Rao, D. S., Yang, L., and Baltimore, D. (2011). Antibody-based protection against HIV infection by vectored immunoprophylaxis. Nature 481, 81–84.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Barouch, D. H., and Nabel, G. J. (2005). Adenovirus vector-based vaccines for human immunodeficiency virus type 1. Hum. Gene Ther. 16, 149–156.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Boyd, S. D., Gaeta, B. A., Jackson, K. J., Fire, A. Z., Marshall, E. L., Merker, J. D., Maniar, J. M., Zhang, L. N., Sahaf, B., Jones, C. D., Simen, B. B., Hanczaruk, B., Nguyen, K. D., Nadeau, K. C., Egholm, M., Miklos, D. B., Zehnder, J. L., and Collins, A. M. (2010). Individual variation in the germline Ig gene repertoire inferred from variable region gene rearrangements. J. Immunol. 184, 6986–6992.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Brochet, X., Lefranc, M.-P., and Giudicelli, V. (2008). IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis. Nucleic Acids Res. 36, W503–W508.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Brockman, W., Alvarez, P., Young, S., Garber, M., Giannoukos, G., Lee, W. L., Russ, C., Lander, E. S., Nusbaum, C., and Jaffe, D. B. (2008). Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res. 18, 763–770.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Burton, D. R. (2002). Antibodies, viruses and vaccines. Nat. Rev. Immunol. 2, 706–713.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Burton, D. R., Desrosiers, R. C., Doms, R. W., Koff, W. C., Kwong, P. D., Moore, J. P., Nabel, G. J., Sodroski, J., Wilson, I. A., and Wyatt, R. T. (2004). HIV vaccine design and the neutralizing antibody problem. Nat. Immunol. 5, 233–236.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Burton, D. R., Stanfield, R. L., and Wilson, I. A. (2005). Antibody vs. HIV in a clash of evolutionary titans. Proc. Natl. Acad. Sci. U.S.A. 102, 14943–14948.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, R., Mias, G. I., Li-Pook-Than, J., Jiang, L., Lam, H. Y., Chen, R., Miriami, E., Karczewski, K. J., Hariharan, M., Dewey, F. E., Cheng, Y., Clark, M. J., Im, H., Habegger, L., Balasubramanian, S., O’Huallachain, M., Dudley, J. T., Hillenmeyer, S., Haraksingh, R., Sharon, D., Euskirchen, G., Lacroute, P., Bettinger, K., Boyle, A. P., Kasowski, M., Grubert, F., Seki, S., Garcia, M., Whirl-Carrillo, M., Gallardo, M., Blasco, M. A., Greenberg, P. L., Snyder, P., Klein, T. E., Altman, R. B., Butte, A. J., Ashley, E. A., Gerstein, M., Nadeau, K. C., Tang, H., and Snyder, M. (2012). Personal omics profiling reveals dynamic molecular and medical phenotypes. Cell 148, 1293–1307.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Corti, D., Langedijk, J. P., Hinz, A., Seaman, M. S., Vanzetta, F., Fernandez-Rodriguez, B. M., Silacci, C., Pinna, D., Jarrossay, D., Balla-Jhagjhoorsingh, S., Willems, B., Zekveld, M. J., Dreja, H., O’Sullivan, E., Pade, C., Orkin, C., Jeffs, S. A., Montefiori, D. C., Davis, D., Weissenhorn, W., McKnight, A., Heeney, J. L., Sallusto, F., Sattentau, Q. J., Weiss, R. A., and Lanzavecchia, A. (2010). Analysis of memory B cell responses and isolation of novel monoclonal antibodies with neutralizing breadth from HIV-1-infected individuals. PLoS ONE 5, e8805. doi:10.1371/journal.pone.0008805

CrossRef Full Text

Doria-Rose, N. A., Klein, R. M., Daniels, M. G., O’Dell, S., Nason, M., Lapedes, A., Bhattacharya, T., Migueles, S. A., Wyatt, R. T., Korber, B. T., Mascola, J. R., and Connors, M. (2010). Breadth of human immunodeficiency virus-specific neutralizing activity in sera: clustering analysis and association with clinical variables. J. Virol. 84, 1631–1636.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Eriksson, N., Pachter, L., Mitsuya, Y., Rhee, S.-Y., Wang, C., Gharizadeh, B., Ronaghi, M., Shafer, R. W., and Beerenwinkel, N. (2008). Viral population estimation using pyrosequencing. PLoS Comput. Biol. 4, e1000074. doi:10.1371/journal.pcbi.1000074

CrossRef Full Text

Fischer, W., Ganusov, V. V., Giorgi, E. E., Hraber, P. T., Keele, B. F., Leitner, T., Han, C. S., Gleasner, C. D., Green, L., Lo, C.-C., Nag, A., Wallstrom, T. C., Wang, S., McMichael, A. J., Haynes, B. F., Hahn, B. H., Perelson, A. S., Borrow, P., Shaw, G. M., Bhattacharya, T., and Korber, B. T. (2010). Transmission of single HIV-1 genomes and dynamics of early immune escape revealed by ultra-deep sequencing. PLoS ONE 5, e12303. doi:10.1371/journal.pone.0012303

CrossRef Full Text

Gnanakaran, S., Daniels, M. G., Bhattacharya, T., Lapedes, A. S., Sethi, A., Li, M., Tang, H., Greene, K., Gao, H., Haynes, B. F., Cohen, M. S., Shaw, G. M., Seaman, M. S., Kumar, A., Gao, F., Montefiori, D. C., and Korber, B. (2010). Genetic signatures in the envelope glycoproteins of HIV-1 that associate with broadly neutralizing antibodies. PLoS Comput. Biol. 6, e1000955. doi:10.1371/journal.pcbi.1000955

CrossRef Full Text

Gonda, M. A., Wongstaal, F., Gallo, R. C., Clements, J. E., Narayan, O., and Gilden, R. V. (1985). Sequence homology and morphologic similarity of HTLV-III and Visna virus, a pathogenic lenti-virus. Science 227, 173–177.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gray, E. S., Taylor, N., Wycuff, D., Moore, P. L., Tomaras, G. D., Wibmer, C. K., Puren, A., Decamp, A., Gilbert, P. B., Wood, B., Montefiori, D. C., Binley, J. M., Shaw, G. M., Haynes, B. F., Mascola, J. R., and Morris, L. (2009). Antibody specificities associated with neutralization breadth in plasma from human immunodeficiency virus type 1 subtype C-infected blood donors. J. Virol. 83, 8925–8937.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hawkins, R. D., Hon, G. C., and Ren, B. (2010). Next-generation genomics: an integrative approach. Nat. Rev. Genet. 11, 476–486.

Pubmed Abstract | Pubmed Full Text

Hessell, A. J., Poignard, P., Hunter, M., Hangartner, L., Tehrani, D. M., Bleeker, W. K., Parren, P. W., Marx, P. A., and Burton, D. R. (2009a). Effective, low-titer antibody protection against low-dose repeated mucosal SHIV challenge in macaques. Nat. Med. 15, 951–954.

CrossRef Full Text

Hessell, A. J., Rakasz, E. G., Poignard, P., Hangartner, L., Landucci, G., Forthal, D. N., Koff, W. C., Watkins, D. I., and Burton, D. R. (2009b). Broadly neutralizing human anti-HIV antibody 2G12 is effective in protection against mucosal SHIV challenge even at low serum neutralizing titers. PLoS Pathog. 5, e1000433. doi:10.1371/journal.ppat.1000433

CrossRef Full Text

Huson, D. H., Richter, D. C., Rausch, C., Dezulian, T., Franz, M., and Rupp, R. (2007). Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics 8, 460. doi:10.1186/1471-2105-8-460

CrossRef Full Text

Kahn, S. D. (2011). On the future of genomic data. Science 331, 728–729.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kitano, H. (2002). Systems biology: a brief overview. Science 295, 1662–1664.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Korber, B., Gaschen, B., Yusim, K., Thakallapally, R., Kesmir, C., and Detours, V. (2001). Evolutionary and immunological implications of contemporary HIV-1 variation. Br. Med. Bull. 58, 19–42.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kuhner, M. K., and Felsenstein, J. (1994). A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol. Biol. Evol. 11, 459–468.

Pubmed Abstract | Pubmed Full Text

Kwong, P. D., Mascola, J. R., and Nabel, G. J. (2011). Rational Design of vaccines to elicit broadly neutralizing antibodies to HIV-1. Cold Spring Harb. Perspect. Med. 1, a007278.

Pubmed Abstract | Pubmed Full Text

Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C., Zody, M. C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., Fitzhugh, W., Funke, R., Gage, D., Harris, K., Heaford, A., Howland, J., Kann, L., Lehoczky, J., Levine, R., McEwan, P., McKernan, K., Meldrim, J., Mesirov, J. P., Miranda, C., Morris, W., Naylor, J., Raymond, C., Rosetti, M., Santos, R., Sheridan, A., Sougnez, C., Stange-Thomann, N., Stojanovic, N., Subramanian, A., Wyman, D., Rogers, J., Sulston, J., Ainscough, R., Beck, S., Bentley, D., Burton, J., Clee, C., Carter, N., Coulson, A., Deadman, R., Deloukas, P., Dunham, A., Dunham, I., Durbin, R., French, L., Grafham, D., Gregory, S., Hubbard, T., Humphray, S., Hunt, A., Jones, M., Lloyd, C., McMurray, A., Matthews, L., Mercer, S., Milne, S., Mullikin, J. C., Mungall, A., Plumb, R., Ross, M., Shownkeen, R., Sims, S., Waterston, R. H., Wilson, R. K., Hillier, L. W., McPherson, J. D., Marra, M. A., Mardis, E. R., Fulton, L. A., Chinwalla, A. T., Pepin, K. H., Gish, W. R., Chissoe, S. L., Wendl, M. C., Delehaunty, K. D., Miner, T. L., Delehaunty, A., Kramer, J. B., Cook, L. L., Fulton, R. S., Johnson, D. L., Minx, P. J., Clifton, S. W., Hawkins, T., Branscomb, E., Predki, P., Richardson, P., Wenning, S., Slezak, T., Doggett, N., Cheng, J. F., Olsen, A., Lucas, S., Elkin, C., Uberbacher, E., Frazier, M., Gibbs, R. A., Muzny, D. M., Scherer, S. E., Bouck, J. B., Sodergren, E. J., Worley, K. C., Rives, C. M., Gorrell, J. H., Metzker, M. L., Naylor, S. L., Kucherlapati, R. S., Nelson, D. L., Weinstock, G. M., Sakaki, Y., Fujiyama, A., Hattori, M., Yada, T., Toyoda, A., Itoh, T., Kawagoe, C., Watanabe, H., Totoki, Y., Taylor, T., Weissenbach, J., Heilig, R., Saurin, W., Artiguenave, F., Brottier, P., Bruls, T., Pelletier, E., Robert, C., Wincker, P., Smith, D. R., Doucette-Stamm, L., Rubenfield, M., Weinstock, K., Lee, H. M., Dubois, J., Rosenthal, A., Platzer, M., Nyakatura, G., Taudien, S., Rump, A., Yang, H., Yu, J., Wang, J., Huang, G., Gu, J., Hood, L., Rowen, L., Madan, A., Qin, S., Davis, R. W., Federspiel, N. A., Abola, A. P., Proctor, M. J., Myers, R. M., Schmutz, J., Dickson, M., Grimwood, J., Cox, D. R., Olson, M. V., Kaul, R., Raymond, C., Shimizu, N., Kawasaki, K., Minoshima, S., Evans, G. A., Athanasiou, M., Schultz, R., Roe, B. A., Chen, F., Pan, H., Ramser, J., Lehrach, H., Reinhardt, R., McCombie, W. R., de la Bastide, M., Dedhia, N., Blöcker, H., Hornischer, K., Nordsiek, G., Agarwala, R., Aravind, L., Bailey, J. A., Bateman, A., Batzoglou, S., Birney, E., Bork, P., Brown, D. G., Burge, C. B., Cerutti, L., Chen, H. C., Church, D., Clamp, M., Copley, R. R., Doerks, T., Eddy, S. R., Eichler, E. E., Furey, T. S., Galagan, J., Gilbert, J. G., Harmon, C., Hayashizaki, Y., Haussler, D., Hermjakob, H., Hokamp, K., Jang, W., Johnson, L. S., Jones, T. A., Kasif, S., Kaspryzk, A., Kennedy, S., Kent, W. J., Kitts, P., Koonin, E. V., Korf, I., Kulp, D., Lancet, D., Lowe, T. M., McLysaght, A., Mikkelsen, T., Moran, J. V., Mulder, N., Pollara, V. J., Ponting, C. P., Schuler, G., Schultz, J., Slater, G., Smit, A. F., Stupka, E., Szustakowski, J., Thierry-Mieg, D., Thierry-Mieg, J., Wagner, L., Wallis, J., Wheeler, R., Williams, A., Wolf, Y. I., Wolfe, K. H., Yang, S. P., Yeh, R. F., Collins, F., Guyer, M. S., Peterson, J., Felsenfeld, A., Wetterstrand, K. A., Patrinos, A., Morgan, M. J., de Jong, P., Catanese, J. J., Osoegawa, K., Shizuya, H., Choi, S., Chen, Y. J., International Human Genome Sequencing Consortium. (2001). Initial sequencing and analysis of the human genome. Nature 409, 860–921.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Larkin, M. A., Blackshields, G., Brown, N. P., Chenna, R., McGettigan, P. A., McWilliam, H., Valentin, F., Wallace, I. M., Wilm, A., Lopez, R., Thompson, J. D., Gibson, T. J., and Higgins, D. G. (2007). Clustal W and clustal X version 2.0 . Bioinformatics 23, 2947–2948.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Li, M., Gao, F., Mascola, J. R., Stamatatos, L., Polonis, V. R., Koutsoukos, M., Voss, G., Goepfert, P., Gilbert, P., Greene, K. M., Bilska, M., Kothe, D. L., Salazar-Gonzalez, J. F., Wei, X., Decker, J. M., Hahn, B. H., and Montefiori, D. C. (2005). Human immunodeficiency virus type 1 env clones from acute and early subtype B infections for standardized assessments of vaccine-elicited neutralizing antibodies. J. Virol. 79, 10108–10125.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Li, Y., Migueles, S. A., Welcher, B., Svehla, K., Phogat, A., Louder, M. K., Wu, X., Shaw, G. M., Connors, M., Wyatt, R. T., and Mascola, J. R. (2007). Broad HIV-1 neutralization mediated by CD4-binding site antibodies. Nat. Med. 13, 1032–1034.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Mardis, E. R. (2008a). The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141.

CrossRef Full Text

Mardis, E. R. (2008b). Next-generation DNA sequencing methods. Annu. Rev. Genomics Hum. Genet. 9, 387–402.

CrossRef Full Text

Mascola, J. R. (2003). Defining the protective antibody response for HIV-1. Curr. Mol. Med. 3, 209–216.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Mascola, J. R., Lewis, M. G., Stiegler, G., Harris, D., Vancott, T. C., Hayes, D., Louder, M. K., Brown, C. R., Sapan, C. V., Frankel, S. S., Lu, Y., Robb, M. L., Katinger, H., and Birx, D. L. (1999). Protection of macaques against pathogenic simian/human immunodeficiency virus 89.6PD by passive transfer of neutralizing antibodies. J. Virol. 73, 4009–4018.

Pubmed Abstract | Pubmed Full Text

Mascola, J. R., Stiegler, G., Vancott, T. C., Katinger, H., Carpenter, C. B., Hanson, C. E., Beary, H., Hayes, D., Frankel, S. S., Birx, D. L., and Lewis, M. G. (2000). Protection of macaques against vaginal transmission of a pathogenic HIV- 1/SIV chimeric virus by passive infusion of neutralizing antibodies. Nat. Med. 6, 207–210.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Parren, P. W., Marx, P. A., Hessell, A. J., Luckay, A., Harouse, J., Cheng-Mayer, C., Moore, J. P., and Burton, D. R. (2001). Antibody protects macaques against vaginal challenge with a pathogenic R5 simian/human immunodeficiency virus at serum levels giving complete neutralization in vitro. J. Virol. 75, 8340–8347.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Parren, P. W., Moore, J. P., Burton, D. R., and Sattentau, Q. J. (1999). The neutralizing antibody response to HIV-1: viral evasion and escape from humoral immunity. AIDS 13, S137–S162.

Pubmed Abstract | Pubmed Full Text

Poignard, P., Sabbe, R., Picchio, G. R., Wang, M., Gulizia, R. J., Katinger, H., Parren, P. W., Mosier, D. E., and Burton, D. R. (1999). Neutralizing antibodies have limited effects on the control of established HIV-1 infection in vivo. Immunity 10, 431–438.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Prabakaran, P., Streaker, E., Chen, W., and Dimitrov, D. S. (2011). 454 antibody sequencing – error characterization and correction. BMC Res. Notes 4, 404. doi:10.1186/1756-0500-4-404

CrossRef Full Text

Preston, B. D., Poiesz, B. J., and Loeb, L. A. (1988). Fidelity of HIV-1 reverse transcriptase. Science 242, 1168–1171.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Reddy, S. T., Ge, X., Miklos, A. E., Hughes, R. A., Kang, S. H., Hoi, K. H., Chrysostomou, C., Hunicke-Smith, S. P., Iverson, B. L., Tucker, P. W., Ellington, A. D., and Georgiou, G. (2009). Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells. Nat. Biotechnol. 28, U965–U969.

CrossRef Full Text

Reddy, S. T., and Georgiou, G. (2011). Systems analysis of adaptive immunity by utilization of high-throughput technologies. Curr. Opin. Biotechnol. 22, 584–589.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sather, D. N., Armann, J., Ching, L. K., Mavrantoni, A., Sellhorn, G., Caldwell, Z., Yu, X., Wood, B., Self, S., Kalams, S., and Stamatatos, L. (2009). Factors associated with the development of cross-reactive neutralizing antibodies during human immunodeficiency virus type 1 infection. J. Virol. 83, 757–769.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Scheid, J. F., Mouquet, H., Feldhahn, N., Seaman, M. S., Velinzon, K., Pietzsch, J., Ott, R. G., Anthony, R. M., Zebroski, H., Hurley, A., Phogat, A., Chakrabarti, B., Li, Y., Connors, M., Pereyra, F., Walker, B. D., Wardemann, H., Ho, D., Wyatt, R. T., Mascola, J. R., Ravetch, J. V., and Nussenzweig, M. C. (2009). Broad diversity of neutralizing antibodies isolated from memory B cells in HIV-infected individuals. Nature 458, 636–640.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Scheid, J. F., Mouquet, H., Ueberheide, B., Diskin, R., Klein, F., Oliveira, T. Y., Pietzsch, J., Fenyo, D., Abadir, A., Velinzon, K., Hurley, A., Myung, S., Boulad, F., Poignard, P., Burton, D. R., Pereyra, F., Ho, D. D., Walker, B. D., Seaman, M. S., Bjorkman, P. J., Chait, B. T., and Nussenzweig, M. C. (2011). Sequence and structural convergence of broad and potent HIV antibodies that mimic CD4 binding. Science 333, 1633–1637.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Seaman, M. S., Janes, H., Hawkins, N., Grandpre, L. E., Devoy, C., Giri, A., Coffey, R. T., Harris, L., Wood, B., Daniels, M. G., Bhattacharya, T., Lapedes, A., Polonis, V. R., McCutchan, F. E., Gilbert, P. B., Self, S. G., Korber, B. T., Montefiori, D. C., and Mascola, J. R. (2010). Tiered categorization of a diverse panel of HIV-1 Env pseudoviruses for neutralizing antibody assessment. J. Virol. 84, 1439–1452.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Simek, M. D., Rida, W., Priddy, F. H., Pung, P., Carrow, E., Laufer, D. S., Lehrman, J. K., Boaz, M., Tarragona-Fiol, T., Miiro, G., Birungi, J., Pozniak, A., McPhee, D. A., Manigart, O., Karita, E., Inwoley, A., Jaoko, W., Dehovitz, J., Bekker, L. G., Pitisuttithum, P., Paris, R., Walker, L. M., Poignard, P., Wrin, T., Fast, P. E., Burton, D. R., and Koff, W. C. (2009). Human immunodeficiency virus type 1 elite neutralizers: individuals with broad and potent neutralizing activity identified by using a high-throughput neutralization assay together with an analytical selection algorithm. J. Virol. 83, 7337–7348.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sonigo, P., Alizon, M., Staskus, K., Klatzmann, D., Cole, S., Danos, O., Retzel, E., Tiollais, P., Haase, A., and Wainhobson, S. (1985). Nucleotide-sequence of the visna lentivirus – relationship to the AIDS virus. Cell 42, 369–382.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Stamatatos, L., Morris, L., Burton, D. R., and Mascola, J. R. (2009). Neutralizing antibodies generated during natural HIV-1 infection: good news for an HIV-1 vaccine? Nat. Med. 15, 866–870.

Pubmed Abstract | Pubmed Full Text

Starcich, B. R., Hahn, B. H., Shaw, G. M., McNeely, P. D., Modrow, S., Wolf, H., Parks, E. S., Parks, W. P., Josephs, S. F., Gallo, R. C., and Wongstaal, F. (1986). Identification and characterization of conserved and variable regions in the envelope gene of HTLV-III/LAV, the retrovirus of AIDS. Cell 45, 637–648.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tsibris, A. M. N., Korber, B., Arnaout, R., Russ, C., Lo, C.-C., Leitner, T., Gaschen, B., Theiler, J., Paredes, R., Su, Z., Hughes, M. D., Gulick, R. M., Greaves, W., Coakley, E., Flexner, C., Nusbaum, C., and Kuritzkes, D. R. (2009). Quantitative deep sequencing reveals dynamic HIV-1 escape and large population shifts during CCR5 antagonist therapy in vivo. PLoS ONE 4, e5683. doi:10.1371/journal.pone.0005683

CrossRef Full Text

UNAIDS. (2010). Joint United Nations Programme on HIV/AIDS. UN Report on the Global AIDS Epidemic 2010. Available at: http://www.unaids.org/globalreport/Global_report.htm

Veazey, R. S., Shattock, R. J., Pope, M., Kirijan, J. C., Jones, J., Hu, Q., Ketas, T., Marx, P. A., Klasse, P. J., Burton, D. R., and Moore, J. P. (2003). Prevention of virus transmission to macaque monkeys by a vaginally applied monoclonal antibody to HIV-1 gp120. Nat. Med. 9, 343–346.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Venter, J. C., Adams, M. D., Myers, E. W., Li, P. W., Mural, R. J., Sutton, G. G., Smith, H. O., Yandell, M., Evans, C. A., Holt, R. A., Gocayne, J. D., Amanatides, P., Ballew, R. M., Huson, D. H., Wortman, J. R., Zhang, Q., Kodira, C. D., Zheng, X. Q. H., Chen, L., Skupski, M., Subramanian, G., Thomas, P. D., Zhang, J. H., Miklos, G. L. G., Nelson, C., Broder, S., Clark, A. G., Nadeau, C., McKusick, V. A., Zinder, N., Levine, A. J., Roberts, R. J., Simon, M., Slayman, C., Hunkapiller, M., Bolanos, R., Delcher, A., Dew, I., Fasulo, D., Flanigan, M., Florea, L., Halpern, A., Hannenhalli, S., Kravitz, S., Levy, S., Mobarry, C., Reinert, K., Remington, K., Abu-Threideh, J., Beasley, E., Biddick, K., Bonazzi, V., Brandon, R., Cargill, M., Chandramouliswaran, I., Charlab, R., Chaturvedi, K., Deng, Z. M., Di Francesco, V., Dunn, P., Eilbeck, K., Evangelista, C., Gabrielian, A. E., Gan, W., Ge, W. M., Gong, F. C., Gu, Z. P., Guan, P., Heiman, T. J., Higgins, M. E., Ji, R. R., Ke, Z. X., Ketchum, K. A., Lai, Z. W., Lei, Y. D., Li, Z. Y., Li, J. Y., Liang, Y., Lin, X. Y., Lu, F., Merkulov, G. V., Milshina, N., Moore, H. M., Naik, A. K., Narayan, V. A., Neelam, B., Nusskern, D., Rusch, D. B., Salzberg, S., Shao, W., Shue, B. X., Sun, J. T., Wang, Z. Y., Wang, A. H., Wang, X., Wang, J., Wei, M. H., Wides, R., Xiao, C. L., Yan, C. H., Yao, A., Ye, J., Zhan, M., Zhang, W., Zhang, H., Zhao, Q., Zheng, L., Zhong, F., Zhong, W., Zhu, S., Zhao, S., Gilbert, D., Baumhueter, S., Spier, G., Carter, C., Cravchik, A., Woodage, T., Ali, F., An, H., Awe, A., Baldwin, D., Baden, H., Barnstead, M., Barrow, I., Beeson, K., Busam, D., Carver, A., Center, A., Cheng, M. L., Curry, L., Danaher, S., Davenport, L., Desilets, R., Dietz, S., Dodson, K., Doup, L., Ferriera, S., Garg, N., Gluecksmann, A., Hart, B., Haynes, J., Haynes, C., Heiner, C., Hladun, S., Hostin, D., Houck, J., Howland, T., Ibegwam, C., Johnson, J., Kalush, F., Kline, L., Koduru, S., Love, A., Mann, F., May, D., McCawley, S., McIntosh, T., McMullen, I., Moy, M., Moy, L., Murphy, B., Nelson, K., Pfannkoch, C., Pratts, E., Puri, V., Qureshi, H., Reardon, M., Rodriguez, R., Rogers, Y. H., Romblad, D., Ruhfel, B., Scott, R., Sitter, C., Smallwood, M., Stewart, E., Strong, R., Suh, E., Thomas, R., Tint, N. N., Tse, S., Vech, C., Wang, G., Wetter, J., Williams, S., Williams, M., Windsor, S., Winn-Deen, E., Wolfe, K., Zaveri, J., Zaveri, K., Abril, J. F., Guigó, R., Campbell, M. J., Sjolander, K. V., Karlak, B., Kejariwal, A., Mi, H., Lazareva, B., Hatton, T., Narechania, A., Diemer, K., Muruganujan, A., Guo, N., Sato, S., Bafna, V., Istrail, S., Lippert, R., Schwartz, R., Walenz, B., Yooseph, S., Allen, D., Basu, A., Baxendale, J., Blick, L., Caminha, M., Carnes-Stine, J., Caulk, P., Chiang, Y. H., Coyne, M., Dahlke, C., Mays, A., Dombroski, M., Donnelly, M., Ely, D., Esparham, S., Fosler, C., Gire, H., Glanowski, S., Glasser, K., Glodek, A., Gorokhov, M., Graham, K., Gropman, B., Harris, M., Heil, J., Henderson, S., Hoover, J., Jennings, D., Jordan, C., Jordan, J., Kasha, J., Kagan, L., Kraft, C., Levitsky, A., Lewis, M., Liu, X., Lopez, J., Ma, D., Majoros, W., McDaniel, J., Murphy, S., Newman, M., Nguyen, T., Nguyen, N., Nodell, M., Pan, S., Peck, J., Peterson, M., Rowe, W., Sanders, R., Scott, J., Simpson, M., Smith, T., Sprague, A., Stockwell, T., Turner, R., Venter, E., Wang, M., Wen, M., Wu, D., Wu, M., Xia, A., Zandieh, A., and Zhu, X. (2001). The sequence of the human genome. Science 291, 1304–1351.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Walker, L. M., and Burton, D. R. (2010). Rational antibody-based HIV-1 vaccine design: current approaches and future directions. Curr. Opin. Immunol. 22, 358–366.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Walker, L. M., Huber, M., Doores, K. J., Falkowska, E., Pejchal, R., Julien, J. P., Wang, S. K., Ramos, A., Chan-Hui, P. Y., Moyle, M., Mitcham, J. L., Hammond, P. W., Olsen, O. A., Phung, P., Fling, S., Wong, C. H., Phogat, S., Wrin, T., Simek, M. D., Koff, W. C., Wilson, I. A., Burton, D. R., and Poignard, P. (2011). Broad neutralization coverage of HIV by multiple highly potent antibodies. Nature 477, 466–470.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Walker, L. M., Phogat, S. K., Chan-Hui, P. Y., Wagner, D., Phung, P., Goss, J. L., Wrin, T., Simek, M. D., Fling, S., Mitcham, J. L., Lehrman, J. K., Priddy, F. H., Olsen, O. A., Frey, S. M., Hammond, P. W., Kaminsky, S., Zamb, T., Moyle, M., Koff, W. C., Poignard, P., and Burton, D. R. (2009). Broad and potent neutralizing antibodies from an African donor reveal a new HIV-1 vaccine target. Science 326, 285–289.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wei, X., Decker, J. M., Wang, S., Hui, H., Kappes, J. C., Wu, X., Salazar-Gonzalez, J. F., Salazar, M. G., Kilby, J. M., Saag, M. S., Komarova, N. L., Nowak, M. A., Hahn, B. H., Kwong, P. D., and Shaw, G. M. (2003). Antibody neutralization and escape by HIV-1. Nature 422, 307–312.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wu, X., Yang, Z. Y., Li, Y., Hogerkorp, C. M., Schief, W. R., Seaman, M. S., Zhou, T., Schmidt, S. D., Wu, L., Xu, L., Longo, N. S., McKee, K., O’Dell, S., Louder, M. K., Wycuff, D. L., Feng, Y., Nason, M., Doria-Rose, N., Connors, M., Kwong, P. D., Roederer, M., Wyatt, R. T., Nabel, G. J., and Mascola, J. R. (2010). Rational design of envelope identifies broadly neutralizing human monoclonal antibodies to HIV-1. Science 329, 856–861.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wu, X., Zhou, T., O’Dell, S., Wyatt, R. T., Kwong, P. D., and Mascola, J. R. (2009). Mechanism of human immunodeficiency virus type 1 resistance to monoclonal antibody B12 that effectively targets the site of CD4 attachment. J. Virol. 83, 10892–10907.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wu, X., Zhou, T., Zhu, J., Zhang, B., Georgiev, I., Wang, C., Chen, X., Longo, N. S., Louder, M., McKee, K., O’Dell, S., Perfetto, S., Schmidt, S. D., Shi, W., Wu, L., Yang, Y., Yang, Z. Y., Yang, Z., Zhang, Z., Bonsignori, M., Crump, J. A., Kapiga, S. H., Sam, N. E., Haynes, B. F., Simek, M., Burton, D. R., Koff, W. C., Doria-Rose, N. A., Connors, M., Mullikin, J. C., Nabel, G. J., Roederer, M., Shapiro, L., Kwong, P. D., and Mascola, J. R. (2011). Focused evolution of HIV-1 neutralizing antibodies revealed by structures and deep sequencing. Science 333, 1593–1602.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keywords: antibody bioinformatics, high-throughput sequencing, HIV-1, immunity, N-linked glycan

Citation: Zhu J, O’Dell S, Ofek G, Pancera M, Wu X, Zhang B, Zhang Z, NISC Comparative Sequencing Program, Mullikin JC, Simek M, Burton DR, Koff WC, Shapiro L, Mascola JR and Kwong PD (2012) Somatic populations of PGT135–137 HIV-1-neutralizing antibodies identified by 454 pyrosequencing and bioinformatics. Front. Microbio. 3:315. doi: 10.3389/fmicb.2012.00315

Received: 12 June 2012; Paper pending published: 04 July 2012;
Accepted: 13 August 2012; Published online: 11 September 2012.

Edited by:

Hironori Sato, National Institute of Infectious Diseases, Japan

Reviewed by:

Takamasa Takeuchi, National Institute of Infectious Diseases, Japan
Dimiter Dimitrov, National Institutes of Health, USA

Copyright: © 2012 Zhu, O’Dell, Ofek, Pancera, Wu, Zhang, Zhang, NISC Comparative Sequencing Program, Mullikin, Simek, Burton, Koff, Shapiro, Mascola and Kwong. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: Peter D. Kwong, Vaccine Research Center, NIAID/NIH, 40 Convent Drive, Building 40, Room 4508, Bethesda, MD 20892, USA. e-mail: pdkwong@nih.gov

^†Sijy O’Dell and Gilad Ofek and Marie Pancera and Xueling Wu and Baoshan Zhang and Zhenhai Zhang have contributed equally to this work.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.