Sammanfattning
Sentiment analysis and opinion mining are essential tasks with many prominent application areas, e.g., when researching popular opinions on products or brands. Sentiments expressed in social media can be used in brand name monitoring and indicating fake news. In our survey of previous work, we note that there is no large- scale social media data set with sentiment polarity annotations for Finnish. This publication aims to remedy this shortcoming by introducing a 27,000-sentence data set annotated independently with sentiment polarity by three native annotators. We had three annotators annotate the whole data set, which provides a unique oppor- tunity for further studies of annotator behavior over the sample annotation order. We analyze their inter-annotator agreement and provide two baselines to validate the usefulness of the data set.
Originalspråk | engelska |
---|---|
Tidskrift | Language Resources and Evaluation |
Volym | 57 |
Nummer | 2 |
Sidor (från-till) | 581-609 |
Antal sidor | 29 |
ISSN | 1574-020X |
DOI | |
Status | Publicerad - juni 2023 |
MoE-publikationstyp | A1 Tidskriftsartikel-refererad |
Vetenskapsgrenar
- 6121 Språkvetenskaper
- 113 Data- och informationsvetenskap