Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation

One of the fundamental components of understanding online discourse in social networks is the identification of narratives. For example, the analysis of disinformation campaigns requires some inference about their communication goals that, in turn, requires the identification of the narratives that...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jesus M. Fraile-Hernandez, Anselmo Penas, Pablo Moral
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Narrative identification natural language processing social media analysis evaluation methodology datasets
Online Access:	https://ieeexplore.ieee.org/document/10706846/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832590355196805120
author	Jesus M. Fraile-Hernandez Anselmo Penas Pablo Moral
author_facet	Jesus M. Fraile-Hernandez Anselmo Penas Pablo Moral
author_sort	Jesus M. Fraile-Hernandez
collection	DOAJ
description	One of the fundamental components of understanding online discourse in social networks is the identification of narratives. For example, the analysis of disinformation campaigns requires some inference about their communication goals that, in turn, requires the identification of the narratives that they promote. The research in this task involves a number of challenges such as the limited availability of labelled datasets, the subjectivity of the annotators and the time cost of annotation. This article present a definition of the Narrative Identification task, proposes an evaluation framework for Narrative Identification, and a methodology for the creation and annotation of Narrative Identification datasets taking into account the subjectivity of the task. Keeping in mind the goal of comparing systems performance, we explore how to reduce the annotation time while maintaining the reliability of the evaluation. Following this methodology, a set of eight tasks for narrative identification in the political domain has been developed in Spanish and English. Finally, we validated the evaluation framework by analysing its application to DIPROMATS 2024 shared task, together with the performance analysis of baseline and participant systems.
format	Article
id	doaj-art-bd033e6e732540508ca36dfa6a1ef7ae
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-bd033e6e732540508ca36dfa6a1ef7ae2025-01-24T00:01:50ZengIEEEIEEE Access2169-35362025-01-0113117341175310.1109/ACCESS.2024.347557910706846Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset CreationJesus M. Fraile-Hernandez0https://orcid.org/0009-0001-5474-4844Anselmo Penas1https://orcid.org/0000-0002-7867-0149Pablo Moral2https://orcid.org/0000-0003-3028-4369UNED NLP & IR Group, Universidad Nacional de Educación a Distancia (UNED), Madrid, SpainUNED NLP & IR Group, Universidad Nacional de Educación a Distancia (UNED), Madrid, SpainUNED NLP & IR Group, Universidad Nacional de Educación a Distancia (UNED), Madrid, SpainOne of the fundamental components of understanding online discourse in social networks is the identification of narratives. For example, the analysis of disinformation campaigns requires some inference about their communication goals that, in turn, requires the identification of the narratives that they promote. The research in this task involves a number of challenges such as the limited availability of labelled datasets, the subjectivity of the annotators and the time cost of annotation. This article present a definition of the Narrative Identification task, proposes an evaluation framework for Narrative Identification, and a methodology for the creation and annotation of Narrative Identification datasets taking into account the subjectivity of the task. Keeping in mind the goal of comparing systems performance, we explore how to reduce the annotation time while maintaining the reliability of the evaluation. Following this methodology, a set of eight tasks for narrative identification in the political domain has been developed in Spanish and English. Finally, we validated the evaluation framework by analysing its application to DIPROMATS 2024 shared task, together with the performance analysis of baseline and participant systems.https://ieeexplore.ieee.org/document/10706846/Narrative identificationnatural language processingsocial media analysisevaluation methodologydatasets
spellingShingle	Jesus M. Fraile-Hernandez Anselmo Penas Pablo Moral Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation IEEE Access Narrative identification natural language processing social media analysis evaluation methodology datasets
title	Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation
title_full	Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation
title_fullStr	Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation
title_full_unstemmed	Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation
title_short	Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation
title_sort	automatic identification of narratives evaluation framework annotation methodology and dataset creation
topic	Narrative identification natural language processing social media analysis evaluation methodology datasets
url	https://ieeexplore.ieee.org/document/10706846/
work_keys_str_mv	AT jesusmfrailehernandez automaticidentificationofnarrativesevaluationframeworkannotationmethodologyanddatasetcreation AT anselmopenas automaticidentificationofnarrativesevaluationframeworkannotationmethodologyanddatasetcreation AT pablomoral automaticidentificationofnarrativesevaluationframeworkannotationmethodologyanddatasetcreation

Automatic Identification of Narratives: Evaluation Framework, Annotation Methodology, and Dataset Creation

Similar Items