Linked machine learning models improve species classification of fungi when using error-prone long-reads on extended metabarcodes

The increased usage of long read sequencing has not been matched with publicly available databases suited for error-prone long reads when applied to metabarcodes. We address this gap and present a new method for classifying species using linked machine learning models. We demonstrate its capability for classifying species with high accuracy, show the benefit of this approach over current alignment and k-mer methods for classifying very closely related species, and suggest a confidence score cutoff of 0.85 to improve the potential for accurately identifying a target species from a mixed sample of unknown composition. Finally, we suggest future applicable use of the approach in medicine, agriculture, and biosecurity.

Identifier
Source https://data.blue-cloud.org/search-details?step=~01274C643700606AFC14239183AA1DDD20E6C1B7D46
Metadata Access https://data.blue-cloud.org/api/collections/74C643700606AFC14239183AA1DDD20E6C1B7D46
Provenance
Instrument MinION; OXFORD_NANOPORE
Publisher Blue-Cloud Data Discovery & Access service; ELIXIR-ENA
Publication Year 2024
OpenAccess true
Contact blue-cloud-support(at)maris.nl
Representation
Discipline Marine Science
Spatial Coverage (147.190W, -35.020S, 150.990E, -33.480N)
Temporal Coverage Begin 2016-01-01T00:00:00Z
Temporal Coverage End 2018-03-01T00:00:00Z