Quotations, coreference resolution, and sentiment annotations in Croatian news articles: an exploratory study

izvorni znanstveni rad

izvorni znanstveni rad

Quotations, coreference resolution, and sentiment annotations in Croatian news articles: an exploratory study

Vrsta prilog sa skupa (u zborniku)
Tip izvorni znanstveni rad
Godina 2021
Nadređena publikacija Proceedings of the Conference on Digital Curation Technologies (Qurator 2021) Berlin, Germany, February 8th to 12th, 2021.
Stranice 16, 16
EISSN 1613-0073
Status objavljeno

Sažetak

This paper presents a corpus annotated for the task of direct speech extraction in Croatian. The paper focuses on the annotation of the quotation, co-reference resolution, and sentiment annotation in SETimes news corpus in Croatian and on the analysis of its language- specific differences compared to English. From this, a list of the phenomena that require special attention when performing these annotations is derived. The generated corpus with quotation features annotations can be used for multiple tasks in the field of Natural Language Processing.

Ključne riječi

reported-speech ; linguistic-phenomenon ; resource-creation