Complete genomic sequence of Epstein-Barr virus in nasopharyngeal carcinoma cell line C666-1
Infectious Agents and Cancer volume 8, Article number: 29 (2013)
Nasopharyngeal carcinoma is a distinct type of head and neck cancer which is consistently associated with Epstein-Barr virus (EBV). The C666-1 cell line is the only in vitro native EBV-infected NPC cell model commonly used for study of the viral-host interaction. Nevertheless, the complete EBV genome sequence in this in vitro EBV-infected NPC model has not been characterized.
To determine the complete EBV genome sequence in C666-1 cells.
The C666-1 genome was sequenced by 100-bases pair-end massive parallel sequencing. Bioinformatics analysis was performed to extract the EBV sequences and construct an EBV consensus sequence map. PCR amplification and Sanger DNA sequencing were used for sequence validation and gap filling. A phylogenetic analysis of EBV strain in C666-1 cells and other reported EBV strains was performed.
A 171,317 bp complete EBV genome of C666-1 was successfully constructed (GenBank accession number: KC617875). Phylogenetic analysis of EBV genome in C666-1 revealed that the C666-1 EBV strain is closely related to the reported strains in NPC primary tumors.
C666-1 contains a representative NPC-associated EBV genome and might serve as an important model for studying the roles or function of viral proteins in NPC tumorigenesis.
NPC is a distinct type of head and neck cancer which is consistently associated with Epstein-Barr virus (EBV). Detection of clonal EBV genome in both precancerous lesions and invasive cancers indicates that EBV latent infection is an early event in the tumorigenesis of NPC. Since we established the EBV-positive NPC cell line C666-1 and reported it about fifteen years ago, it has been widely used for investigating host-viral interaction, elucidating the function and transcriptional regulation of EBV-encoded latent genes and miRNAs, and developing EBV targeting therapeutic strategies . The origin of this cell line was from an undifferentiated NPC biopsy of a Hong Kong patient . It contains normal episomal EBV genome and shows latency II EBV gene expression pattern. A number of studies demonstrated the distinct NF-κb, STAT3, AKT and NOTCH pathways in this cell line as well as the in vivo samples including EBV-positive NPC xenografts (e.g., C15, C17, xeno-2117) and primary tumors . Recently, two novel EBV-encoded microRNAs, miR-BART21 and miR-BART22 have been discovered from this EBV-positive epithelial cell line .
Despite C666-1 being the only in vitro native EBV-infected NPC model worldwide, the EBV genome in this cell line has not been fully characterized until now. To facilitate the EBV-related studies using this unique cell line, we constructed the EBV genome map through bioinformatic analysis and experimental validation of our recent whole-genome deep sequencing results (Additional file 1 Supplementary methodology). By 100-base pair-end genomic sequencing on Illumina HiSeq 2000 genome sequencer, the C666-1 genome was sequenced with average >75-fold coverage as described . A total of 2,511,210,660 reads (251 Gb) were collected from the sample. By using an approach that combines the results of two alignment strategies, namely aligning the reads to both human and EBV reference genomes (EBV-WT; GeneBank accession number AJ507799) at the same time, and aligning them first to the human genome and then the remaining reads to the EBV reference genome, we extracted a total of 857,595 kb EBV sequences from the collected C666-1 data. A high coverage value of 504 folds to EBV genome was yielded. All uniquely mapped EBV sequences were assembled into a 143,734 bpconsensus sequence with a read depth of at least 10 reads. We validated the poorly aligned and questionable regions and filled up the gaps by PCR amplification and conventional Sanger DNA sequencing. The regions failed to be assembled (e.g. with highly repetitive sequences) are represented by tracts of Ns as described previously . A 171,317 bp complete EBV genome of C666-1 was constructed (Figure 1a). This newly assembled C666-1 EBV sequence was submitted to GenBank with accession number KC617875. The study was approved by the University Animal Experimentation Ethics Committee (AEEC) (13-036-MIS) of the Chinese University of Hong Kong.
In this study, we have assembled the EBV genome in C666-1 using high-coverage genome sequencing data. Since no PCR amplification was involved, both homogenous and heterogeneous genome variations are accurately determined. Comparing with the EBV-WT reference genomic sequence (AJ507799), we have revealed a total of 1,268 homogenous and 87 heterogeneous sequence variations. These changes include 127 indels and 1,228 SNVs. Among the SNVs, 907 are located within the coding regions and 41.3% (386/907) of them are nonsynonymous (Figure 1b). The sequence variations in selected SNVs were confirmed by Sanger DNA sequencing. Phylogenetic analysis of whole EBV genomes in C666-1 and the reported strains (EBV-WT, AG876, GD1, GD2, and HKNPC1) showed that C666-1 is closely related to the GD2 and HKNPC1 strains (Figure 1c) [5, 6]. It has great divergence with the AG876 and reference EBV-WT genome. Similar results were observed when we compared the protein sequences of various EBV lytic (BZLF1, BLLF1) and latent (EBNA1, LMP1, LMP2) genes (Figure 2). A number of studies have also shown that BZLF1 and LMP1 sequences of the isolates from Hong Kong NPC patients are distinct from that of the EBV-infected lymphoid cells derived in Africa or Western countries –. The findings imply that C666-1 might serve as an important model for studying the roles or functions of viral proteins in NPC tumorigenesis. Among the four EBV strains from South China, the isolate from NPC patient’s saliva (GD1) shows the greatest divergence with those from the tumors (C666-1, GD2, HKNPC1). This finding suggests the presence of tumor-associated EBV strain(s) in NPC patients. Nevertheless, a comprehensive sequencing of EBV isolates from saliva, peripheral blood and tumor specimens in a panel of NPC patients may prove this hypothesis. A summary of non-synonymous SNVs in the majority of EBV-encoded lytic and latent genes of C666-1 strain versus those of GD2 and HKNPC1 is shown in Additional file 2: Table S1. In the latent genes including EBNA1, EBNA3B/3C, LMP1 and LMP2B genes, high frequencies of C666-1 specific non-synonymous SNVs were observed. The prevalence and function of these SNVs in NPC need further elucidations. Previously, we have demonstrated that multiple EBV-encoded BART miRNAs (miR-BART1-5p, miR-BART16 and miR-BART17-5p) target the 3′UTR of the LMP1 gene . The predicted target sequences of these 3 EBV-encoded BART miRNAs in the 3′UTR of the LMP1 gene are highly conserved in the NPC-derived EBV strains. In this study, we also found no polymorphism in the predicted target sequences of the miR-BART1-5p, 16, and 17-5p in the C666-1 EBV strain.
Apart from the missense mutations, a homogenous nonsense mutation in the lytic gene, BNRF1, which encodes an EBV major tegument protein was found. We confirmed the mutation in C666-1 by PCR amplification and Sanger Sequencing (Figure 1d). This finding indicates the deficiency of BNRF1 protein expression in this in vitro EBV-positive NPC models. Notably, it was reported that EBV with BNRF1 deletion also showed efficient lytic replication and production of mature viral particles. There are no major structural alterations in the BNRF1-deleted virus . Further elucidation of the virus production and lytic cycle of this BNRF1-deficient C666-1 strain is needed. On the other hand, a recent study has reported that BNRF1 activates viral early gene BZLF1 transcription via disrupting cellular DAXX-ATRX in 293 cells. Thus, BNRF1 deficiency may help to maintain the latent EBV genome in NPC cells . On the other hand, loss of BNRF1 in the C666-1 strain may impact the escape from the host immune responses in the NPC patients since BNRF1 is a defined target of the EBV-specific T-helper-cell response.
In summary, we delineated the whole EBV genome sequence in C666-1, which might serve as an important resource for NPC studies. The phylogenetic analysis indicates the C666-1 strain as a representative strain for EBV-associated NPC.
Ken Kai-Yuen Tso and Kevin Yuk-Lap Yip are co-first authors.
Cheung ST, Huang DP, Hui AB, Lo KW, Ko CW, Tsang YS, Wong N, Whitney BM, Lee JC: Nasopharyngeal carcinoma cell line (C666-1) consistently harbouring Epstein-Barr virus. Int J Cancer. 1999, 83 (1): 121-126. 10.1002/(SICI)1097-0215(19990924)83:1<121::AID-IJC21>3.0.CO;2-F.
Lo KW, Chung GT, To KF: Deciphering the molecular genetic basis of NPC through molecular, cytogenetic, and epigenetic approaches. Semin Cancer Biol. 2012, 22 (2): 79-86. 10.1016/j.semcancer.2011.12.011.
Lung RW, Tong JH, Sung YM, Leung PS, Ng DC, Chau SL, Chan AW, Ng EK, Lo KW, To KF: Modulation of LMP2A expression by a newly identified Epstein-Barr virus-encoded microRNA miR-BART22. Neoplasia. 2009, 11 (11): 1174-1184.
Ju YS, Lee WC, Shin JY, Lee S, Bleazard T, Won JK, Kim YT, Kim JI, Kang JH, Seo JS: A transforming KIF5B and RET gene fusion in lung adenocarcinoma revealed from whole-genome and transcriptome sequencing. Genome Res. 2012, 22 (3): 436-445. 10.1101/gr.133645.111.
Liu P, Fang X, Feng Z, Guo YM, Peng RJ, Liu T, Huang Z, Feng Y, Sun X, Xiong Z, Guo X, Pang SS, Wang B, Lv X, Feng FT, Li DJ, Chen LZ, Feng QS, Huang WL, Zeng MS, Bei JX, Zhang Y, Zeng YX: Direct sequencing and characterization of a clinical isolate of Epstein-Barr virus from nasopharyngeal carcinoma tissue by using next-generation sequencing technology. J Virol. 2011, 85 (21): 11291-11299. 10.1128/JVI.00823-11.
Kwok H, Tong AH, Lin CH, Lok S, Farrell PJ, Kwong DL, Chiang AK: Genomic sequencing and comparative analysis of Epstein-Barr virus genome isolated from primary nasopharyngeal carcinoma biopsy. PLoS One. 2012, 7 (5): e36939-10.1371/journal.pone.0036939.
Cheung ST, Leung SF, Lo KW, Chiu KW, Johnson PJ, Lee JCK, Huang DP: Specific latent membrane protein 1 gene sequences in type 1 and type 2 Epstein-Barr virus from nasopharyngeal carcinoma in Hong Kong. Int J Cancer. 1998, 76 (3): 399-406. 10.1002/(SICI)1097-0215(19980504)76:3<399::AID-IJC18>3.0.CO;2-6.
Tong JH, Lo KW, Au FW, Huang DP, To KF: Re: Discrete alterations in the BZLF1 promoter in tumor and non-tumor-associated Epstein-Barr virus. J Natl Cancer Inst. 2003, 95 (13): 1008-1009. 10.1093/jnci/95.13.1008.
Edwards RH, Sitki-Green D, Moore DT, Raab-Traub N: Potential selection of LMP1 variants in nasopharyngeal carcinoma. J Virol. 2004, 78 (2): 868-881. 10.1128/JVI.78.2.868-881.2004.
Lo AK, To KF, Lo KW, Lung RW, Hui JW, Liao G, Hayward SD: Modulation of LMP1 protein expression by EBV-encoded microRNAs. Proc Natl Acad Sci U S A. 2007, 104 (41): 16164-16169. 10.1073/pnas.0702896104.
Feederle R, Neuhierl B, Baldwin G, Bannert H, Hub B, Mautner J, Behrends U, Delecluse HJ: Epstein-Barr virus BNRF1 protein allows efficient transfer from the endosomal compartment to the nucleus of primary B lymphocytes. J Virol. 2006, 80 (19): 9435-9443. 10.1128/JVI.00473-06.
Tsai K, Thikmyanova N, Wojcechowskyj JA, Delecluse HJ, Lieberman PM: EBV tegument protein BNRF1 disrupts DAXX-ATRX to activate viral early gene transcription. PLoS Pathog. 2011, 7 (11): e1002376-10.1371/journal.ppat.1002376.
The research was supported by Focused Investments Scheme-A from the Chinese University of Hong Kong, and Hong Kong Research Grant Council – GRF (471610, 471211), CRF (CUHK8/CRF/11R), Theme-Based Research Scheme (T12-403/11 and T12-401/13-R) and AoE NPC (AoE/M-06/08).
The authors declare that they have no competing interests.
KWL and KYLY designed the study; KWL, KYLY, and KFT drafted the manuscript; KKYT, KYLY and SDL participated in the bioinformatics analysis and sequence alignment; CKYM, GTYC, STC carried out the molecular genetic studies. All authors read and approved the final manuscript.
Ken Kai-Yuen Tso, Kevin Yuk-Lap Yip contributed equally to this work.
Electronic supplementary material
Additional file 2: Table S1: Non-synonymous mutations and amino acid changes commonly found in NPC tumor samples (C666-1, HKNPC1 and GD2). (DOCX 40 KB)
About this article
Cite this article
Tso, K.KY., Yip, K.YL., Mak, C.KY. et al. Complete genomic sequence of Epstein-Barr virus in nasopharyngeal carcinoma cell line C666-1. Infect Agents Cancer 8, 29 (2013). https://doi.org/10.1186/1750-9378-8-29
- Epstein-Barr virus
- Nasopharyngeal carcinoma
- Whole-genome deep sequencing
- Single-nucleotide variations
- Phylogenetic analysis