IULA Spanish-English Technical Corpus

DOI

The corpus consists of a number of specialized texts (Law, Economics, Medicine, Environment and Computer Science domains) available in both Spanish and English languages. This LSP corpus has been compiled with articles from specialized Publications, PhD theses, etc./nIt contains about a total of about 2,1 M words in 127 documents in each language.

Identifier
DOI https://doi.org/10.34810/data268
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data268
Provenance
Creator Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
Publisher CORA.Repositori de Dades de Recerca
Publication Year 2023
Rights Custom Dataset Terms; info:eu-repo/semantics/openAccess; https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data268
OpenAccess true
Representation
Resource Type Textual data; Dataset
Format application/pdf; application/zip; application/vnd.openxmlformats-officedocument.wordprocessingml.document; text/plain; text/html; text/xml
Size 172184; 90970804; 35720; 254; 12919; 12954; 1273
Version 1.0
Discipline Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences