IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers

The performance of discriminative machine-learning classifiers, such as neural networks, is limited by training label inconsistencies. Even expert-based annotations can suffer from label inconsistencies, especially in the case of ambiguous phenomena-to-annotate. To address this, we propose a novel a...

Full description

Saved in:
Bibliographic Details
Main Authors: Einari Vaaras, Manu Airaksinen, Okko Rasanen
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10854471/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832575610519552000
author Einari Vaaras
Manu Airaksinen
Okko Rasanen
author_facet Einari Vaaras
Manu Airaksinen
Okko Rasanen
author_sort Einari Vaaras
collection DOAJ
description The performance of discriminative machine-learning classifiers, such as neural networks, is limited by training label inconsistencies. Even expert-based annotations can suffer from label inconsistencies, especially in the case of ambiguous phenomena-to-annotate. To address this, we propose a novel algorithm, iterative annotation refinement (IAR) 2.0, for refining inconsistent annotations for time-series data. IAR 2.0 uses a procedure that utilizes discriminative classifiers to iteratively combine original annotations with increasingly accurate posterior estimates of classes present in the data. Unlike most existing label refinement approaches, IAR 2.0 offers a simpler yet effective solution for resolving ambiguities in training labels, working with real label noise on time-series data instead of synthetic label noise on image data. We demonstrate the effectiveness of our algorithm through five distinct classification tasks on two highly distinct data modalities. As a result, we show that the labels produced by IAR 2.0 systematically improve classifier performance compared to using the original labels or a previous state-of-the-art method for label refinement. We also conduct a set of controlled simulations to systematically investigate when IAR 2.0 fails to improve on the original training labels. The simulation results demonstrate that IAR 2.0 improves performance in nearly all tested conditions. We also find that the decrease in performance when IAR 2.0 fails is small compared to the average performance gain when IAR 2.0 succeeds, encouraging the use of IAR 2.0 even when the nature of data is unknown. The code is freely available at <uri>https://github.com/SPEECHCOG/IAR_2</uri>.
format Article
id doaj-art-b26fa0c3b0724744a3613e21ee4b3a21
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-b26fa0c3b0724744a3613e21ee4b3a212025-01-31T23:05:13ZengIEEEIEEE Access2169-35362025-01-0113199791999510.1109/ACCESS.2025.353463710854471IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative ClassifiersEinari Vaaras0https://orcid.org/0000-0002-8714-6090Manu Airaksinen1Okko Rasanen2https://orcid.org/0000-0002-0537-0946Signal Processing Research Centre, Tampere University, Tampere, FinlandBABA Center, University of Helsinki, Helsinki, FinlandSignal Processing Research Centre, Tampere University, Tampere, FinlandThe performance of discriminative machine-learning classifiers, such as neural networks, is limited by training label inconsistencies. Even expert-based annotations can suffer from label inconsistencies, especially in the case of ambiguous phenomena-to-annotate. To address this, we propose a novel algorithm, iterative annotation refinement (IAR) 2.0, for refining inconsistent annotations for time-series data. IAR 2.0 uses a procedure that utilizes discriminative classifiers to iteratively combine original annotations with increasingly accurate posterior estimates of classes present in the data. Unlike most existing label refinement approaches, IAR 2.0 offers a simpler yet effective solution for resolving ambiguities in training labels, working with real label noise on time-series data instead of synthetic label noise on image data. We demonstrate the effectiveness of our algorithm through five distinct classification tasks on two highly distinct data modalities. As a result, we show that the labels produced by IAR 2.0 systematically improve classifier performance compared to using the original labels or a previous state-of-the-art method for label refinement. We also conduct a set of controlled simulations to systematically investigate when IAR 2.0 fails to improve on the original training labels. The simulation results demonstrate that IAR 2.0 improves performance in nearly all tested conditions. We also find that the decrease in performance when IAR 2.0 fails is small compared to the average performance gain when IAR 2.0 succeeds, encouraging the use of IAR 2.0 even when the nature of data is unknown. The code is freely available at <uri>https://github.com/SPEECHCOG/IAR_2</uri>.https://ieeexplore.ieee.org/document/10854471/Annotation refinementdaylong audio recordingsdiscriminative classifiershuman activity recognitioninconsistent labelslabel refinement
spellingShingle Einari Vaaras
Manu Airaksinen
Okko Rasanen
IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers
IEEE Access
Annotation refinement
daylong audio recordings
discriminative classifiers
human activity recognition
inconsistent labels
label refinement
title IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers
title_full IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers
title_fullStr IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers
title_full_unstemmed IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers
title_short IAR 2.0: An Algorithm for Refining Inconsistent Annotations for Time-Series Data Using Discriminative Classifiers
title_sort iar 2 0 an algorithm for refining inconsistent annotations for time series data using discriminative classifiers
topic Annotation refinement
daylong audio recordings
discriminative classifiers
human activity recognition
inconsistent labels
label refinement
url https://ieeexplore.ieee.org/document/10854471/
work_keys_str_mv AT einarivaaras iar20analgorithmforrefininginconsistentannotationsfortimeseriesdatausingdiscriminativeclassifiers
AT manuairaksinen iar20analgorithmforrefininginconsistentannotationsfortimeseriesdatausingdiscriminativeclassifiers
AT okkorasanen iar20analgorithmforrefininginconsistentannotationsfortimeseriesdatausingdiscriminativeclassifiers