Lithuanian morphologically annotated corpus - MATAS

PID

MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked)

Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%)

Wordform count: 1,641,263 Files: 92 Encoding: UTF-8

Tagset: Human-readable (Lithuanian tags) e.g.

Date: 2014.08.06

Please use the following text to cite this item: Rimkutė E., Daudaravičius V., Utka A. 2007: Morphological Annotation of the Lithuanian Corpus. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics; Workshop Balto-Slavonic Natural Language Processing 2007, Prague, 94–99.

Licence: CLARIN-LT ACA

Identifier
PID http://hdl.handle.net/99999/9
Metadata Access https://clarin.vdu.lt/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin.vdu.lt:20.500.11821/9
Provenance
Creator Rimkutė, Erika
Publisher Vytautas Magnus University
Publication Year 2016
Rights ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT; https://clarin.vdu.lt/licenses/eula/ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm; ACA
OpenAccess true
Contact info(at)clarin.vdu.lt
Representation
Language Lithuanian
Resource Type corpus
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics