ILSP Feature-based multi-tiered POS Tagger

PID

ILSP FBT Tagger is an adaptation of the Brill tagger trained on Greek text. It uses a PAROLE compatible tagset of 584 different tags which capture the morphosyntactic particularities of the Greek language. Working on the output of a sentence detection and tokenisation tool, the tagger assigns initial tags, looking up in a lexicon created from a manually annotated corpus during training. A suffix lexicon is used for initially tagging unknown words. 799 contextual rules are then applied to improve the initial phase output.

Identifier
PID http://hdl.handle.net/11372/LRT-1308
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11372/LRT-1308
Provenance
Creator Papageorgiou, Haris; Prokopidis, Prokopis
Publisher ILSP/R.C. "Athena"
Contributor Prokopidis, Prokopis
Publication Year 2014
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Resource Type toolService
Format downloadable_files_count: 0
Discipline Linguistics