Nature Research, Nature Genetics, 2(47), p. 158-163, 2015
DOI: 10.1038/ng.3178
Full text: Download
Human papillomavirus (HPV) integration is a key genetic event in cervical carcinogenesis. By conducting whole-genome sequencing and high-throughput viral integration detection, we identified 3,667 HPV integration breakpoints in 26 cervical intraepithelial neoplasias, 104 cervical carcinomas and five cell lines. Beyond recalculating frequencies for the previously reported frequent integration sites POU5F1B (9.7%), FHIT (8.7%), KLF12 (7.8%), KLF5 (6.8%), LRP1B (5.8%) and LEPREL1 (4.9%), we discovered new hot spots HMGA2 (7.8%), DLG2 (4.9%) and SEMA3D (4.9%). Protein expression from FHIT and LRP1B was downregulated when HPV integrated in their introns. Protein expression from MYC and HMGA2 was elevated when HPV integrated into flanking regions. Moreover, microhomologous sequence between the human and HPV genomes was significantly enriched near integration breakpoints, indicating that fusion between viral and human DNA may have occurred by microhomology-mediated DNA repair pathways. Our data provide insights into HPV integration-driven cervical carcinogenesis.