Q-CAT Corpus Annotation Tool 1.5

PID

The Q-CAT (Querying-Supported Corpus Annotation Tool) is a tool for manual linguistic annotation of corpora, which also enables advanced queries on top of these annotations. The tool has been used in various annotation campaigns related to the ssj500k reference training corpus of Slovenian (http://hdl.handle.net/11356/1210), such as named entities, dependency syntax, semantic roles and multi-word expressions, but it can also be used for adding new annotation layers of various types to this or other language corpora. Q-CAT is a .NET application, which runs on Windows operating system.

Version 1.1 enables the automatic attribution of token IDs and personalized font adjustments. Version 1.2 supports the CONLL-U format and working with UD POS tags. Version 1.3 supports adding new layers of annotation on top of CONLL-U (and then saving the corpus as XML TEI). Version 1.4 introduces new features in command line mode (filtering by sentence ID, multiple link type visualizations) Version 1.5 supports listening to audio recordings (provided in the # sound_url comment line in CONLL-U)

Identifier
PID http://hdl.handle.net/11356/1844
Related Identifier http://slovnica.ijs.si/wp-content/uploads/2019/10/Q-CAT_prirocnik.pdf
Related Identifier https://nl.ijs.si/jtdh20/pdf/JT-DH_2020_Krek-et-al_The-ssj500k-Training-Corpus-for-Slovene-Language-Processing.pdf
Related Identifier http://hdl.handle.net/11356/1684
Related Identifier https://slovenscina.eu/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1844
Provenance
Creator Brank, Janez
Publisher Jožef Stefan Institute
Publication Year 2023
Rights Apache License 2.0; PUB; https://opensource.org/licenses/Apache-2.0
OpenAccess true
Contact info(at)clarin.si
Representation
Resource Type toolService
Format text/plain; charset=utf-8; application/octet-stream; downloadable_files_count: 1
Discipline Linguistics