Triticum aestivum trait Corpus

DOI

The Taec corpus is a dataset of 540 Pubmed abstracts about bread wheat with manual annotations of species, phenotypes, and traits in the BioNLP-ST format. It is intended to learn and evaluate named entity recognition and linking methods in the plant trait domain.

Identifier
DOI https://doi.org/10.57745/GCYG3Q
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.57745/GCYG3Q
Provenance
Creator Nédellec, Claire ORCID logo; Sauvion, Clara; Deléger, Louise ORCID logo; Bossy, Robert ORCID logo; Zweigenbaum, Léonard
Publisher Recherche Data Gouv
Contributor Nédellec, Claire; Entrepôt-Catalogue Recherche Data Gouv
Publication Year 2023
Funding Reference ANR ANR-18-CE23-0017
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Nédellec, Claire (INRAE)
Representation
Resource Type Dataset
Format application/zip
Size 1043580
Version 1.1
Discipline Computer Science; Life Sciences; Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Medicine; Plant Breeding