Processed unique sort BAM files of High Risk HPV types 16, 18, 45 and 68b transcripts

RNAseq raw data of RNA extracted from five cervical cancer cell lines were mapped to HPV16 (NC_001526.2 or NC_001526.4), HPV18 (NC_001357.1), HPV68b (FR751039.1) and HPV45 (X74479.1). The processed BAM files were further analysed using The Hisat2 v2.1.0 aligner, Cufflinks v. 2.2.1, Cuffmerge and Cuffdiff. The processed sort BAM files containing sequences of viral transcripts were shown as ‘sort BAM files’. The HPV16 transcripts data and their expression levels reported as FPKM can be visualized using IGV software. The number of total reads and viral reads after mapping to viral references were shown in ‘HR-HPV genes and isoforms FPKM tracking file’, the spliced transcripts were named as CUFF. The integration sites of four high risk HPV types were analysed and shown in ‘fusion file’. According to IGV visualization, the splicing junctions of all four HPV types were found within E6 and E1 regions. For E6 region, one splicing donor (SD) at the 5′ end and different splicing acceptor (SA) positions at the 3′ end were found as follow; three splicing junctions were found in HPV16 positive cervical cancer cell lines, CaSki and SiHa, SD226^SA409(E6I), SD226^SA526(E6II) and SD226^SA742(E6X). Two splicing junctions were found in HPV18 (HeLa), SD233^SA416(E6I), SD233^SA635, HPV45(MS751); SD230^SA412(E6I), SD230^SA640 and HPV68b (ME180); SD129^SA311(E6I), SD129^SA406. Splicing junctions within E1 region found in CaSki and SiHa were SD880^SA3358, SD880^SA3361, SD880^SA3391, SD880^SA1726, SD880^SA2405, SD880^SA2582(E1C), SD880^SA2709(E2), SD880^SA3020, SD880^SA3078, SD880^SA3329, SD577^SA6810, SD898^SA1725, SD1302^SA2709, SD1302^SA3358(E2C), SD1760^SA3391, SD1263^SA3391, SD2309^SA3461 and the other forms were SD96^SA1063, SD226^SA2709 (E6IV), SD226^SA3329, SD226^SA3358(E6*III), SD226^SA3361, SD226^SA3391 and SD579^SA6809. Splicing junctions within E1 region in HPV18(HeLa) were SD929^SA2779, SD977^SA1836, SD1342^SA1436, SD1987^SA2047 and one splicing event within E7 region, SD599^SA619. HPV68b(ME180) were SD839^SA2586, SD683^SA2586, SD839^SA2586 and no E1 splicing junctions were found in HPV45(MS751).

Identifier
DOI https://doi.org/10.17632/fh47g3dp6t.1
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-ci-tw51
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:181581
Provenance
Creator Chaiwongkot, A
Publisher Data Archiving and Networked Services (DANS)
Contributor Arkom Chaiwongkot
Publication Year 2020
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Resource Type Dataset
Discipline Other