Meloidogyne enterolobii E1834 gene prediction

DOI

Results of EuGene annotation on the M. enterolobii E1834 nuclear genome. Gene models prediction was done with the fully automated pipeline EuGene-EP (v1.6.5, Sallet et al., 2019). EuGene has been configured to integrate similarities with known proteins of Caenorhabditis elegans (PRJNA13758) from WormBase Parasite (Howe et al., 2017) and “nematoda” section of UniProtKB/Swiss-Prot library (UniProt Consortium, 2018), with the prior exclusion of proteins that were similar to those present in RepBase (Bao et al., 2015). The dataset of Meloidogyne enterolobii transcribed sequences (Koutsovoulos et al., 2020) was aligned on the genome and used by EuGene as transcription evidence. Only the alignments of datasets on the genome spanning 30% of the transcript length with at least 97% identity were retained. The EuGene default configuration was edited to set the “preserve” parameter to 1 for all datasets, the “gmap_intron_filter” parameter to 1 and the minimum intron length to 35 bp. Finally, the Nematodes-specific Weight Array Method matrices were used to score the splice sites (available at this URL: http://eugene.toulouse.inra.fr/Downloads/WAM_nematodes_20171017.tar.gz). Using the automated Eugene-EP pipeline, a total of 49,870 genes were predicted, with 45,924 being protein-coding genes and 3,946 being non-protein-coding genes such as rRNA, tRNA, and splice leader genes.

Identifier
DOI https://doi.org/10.57745/Y0O2LP
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.57745/Y0O2LP
Provenance
Creator Poullet, Marine ORCID logo; Danchin, G.J. Etienne ORCID logo; Rancurel, Corinne ORCID logo
Publisher Recherche Data Gouv
Contributor Poullet, Marine; GAME
Publication Year 2024
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Poullet, Marine (INRAE)
Representation
Resource Type Dataset
Format application/octet-stream; application/vnd.ms-excel
Size 89923015; 14642; 47267145; 93025568; 1034; 52106884; 3499677; 19297530
Version 1.0
Discipline Agriculture, Forestry, Horticulture; Computer Science