ALKSNIS v2.1
ALKSNIS v2.1 consists of 2,355 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit syntactic trees by the editor TrED (https://ufal.mff.cuni.cz/tred/).
Each node of a tree corresponds to a word, a punctuation mark or other text element (symbol, digit etc.) within a sentence. The following information is presented for each node: 1) a used form; 2) a lemma; 3) a morphology tag, and 4) a syntactic function (subject, object, etc.). Dependencies are shown by links between words.
The morphology tag set of the corpus is based on the MULTEXT-East format (http://nl.ijs.si/ME/V4/msd/html/index.html). Syntactically annotated sentences are corrected according to guidelines that were created by scientists of VMU CCL, following rules of Prague Dependency Treebank. All the sentences are being manually checked and corrected by a group of linguists.
The TreED editor and a style file is needed in order to view the files with .pml extension (with style file “antisDplus_schema“).
Reference: Bielinskienė A., Boizou L., Kovalevskaitė J., Rimkutė E. 2016: Lithuanian Dependency Treebank ALKSNIS. Proceedings of the Seventh International Conference Baltic HLT 2016. Amsterdam: IOS Press, 107–114.
http://ebooks.iospress.nl/volumearticle/45523