Wroclaw Corpus of Consumer Reviews Sentiment (WCCRS)

PID

Wroclaw Corpus of Consumer Reviews is a corpus of Polish reviews annotated with sentiment at the level of the whole text (text) and at the level of sentences (sentence) for the following domains: hotels, medicine, products and university (reviews). Sentences are annotated with sentiment only for hotels and medicine. Each sentence file contains a single sentence with a sentiment __label__z_X and each text file contains a single review with a sentiment __label__meta_X. Regardless a resource type, X can be: minus_m -- strong negative; minus_s -- weak negative, zero -- neutral, amb -- ambiguous, plus_s -- weak positive, plus_m -- strong positive. all* sets are groups of all domains within each text/sentence type. Train/dev/test divisions were used for the evaluation. Results are available in the following paper:

@InProceedings{Kocon2019, Title = {{Multi-level analysis and recognition of the text sentiment on the example of consumer opinions}}, Author = {Koco{\'n}, Jan and Zaśko-Zielińska, Monika and Miłkowski, Piotr}, Booktitle = {Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2019}, Year = {2019}, }

Please cite this paper if you use this resource.

Identifier
PID http://hdl.handle.net/11321/700
Metadata Access https://clarin-pl.eu/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin-pl.eu:11321/700
Provenance
Creator Kocoń, Jan; Zaśko-Zielińska, Monika; Miłkowski, Piotr; Janz, Arkadiusz; Piasecki, Maciej
Publisher Wroclaw University of Science and Technology
Publication Year 2019
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); http://creativecommons.org/licenses/by-sa/4.0/; CC
OpenAccess true
Contact clarin-pl(at)pwr.edu.pl
Representation
Language Polish
Resource Type corpus
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 1
Discipline Linguistics