Tibidabo Treebank and IULA Spanish LSP Treebank Train and Test Partitions

DOI

This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.

Identifier
DOI https://doi.org/10.34810/data314
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data314
Provenance
Creator Institut Universitari de Lingüística Aplicada; Marimon, Montserrat ORCID logo
Publisher CORA.Repositori de Dades de Recerca
Publication Year 2023
Rights Custom Dataset Terms; info:eu-repo/semantics/openAccess; https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.2/customlicense?persistentId=doi:10.34810/data314
OpenAccess true
Representation
Resource Type Textual data; Dataset
Format text/plain; application/zip
Size 1731; 5519589
Version 1.2
Discipline Other