Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study

Abstract Background and Method This study set out to identify the factors and combinations of factors associated with the individual’s premature death, using data from the Finnish Longitudinal Study on Ageing Municipal Employees (FLAME) which involved 6,257 participants over a 29-year follow-up peri...

Full description

Saved in:
Bibliographic Details
Main Authors: Lily Nosraty, Esko Turunen, Saila Kyrönlahti, Clas-Håkan Nygård, Prakash KC, Subas Neupane
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Medical Research Methodology
Subjects:
Online Access:https://doi.org/10.1186/s12874-025-02477-6
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832571617804288000
author Lily Nosraty
Esko Turunen
Saila Kyrönlahti
Clas-Håkan Nygård
Prakash KC
Subas Neupane
author_facet Lily Nosraty
Esko Turunen
Saila Kyrönlahti
Clas-Håkan Nygård
Prakash KC
Subas Neupane
author_sort Lily Nosraty
collection DOAJ
description Abstract Background and Method This study set out to identify the factors and combinations of factors associated with the individual’s premature death, using data from the Finnish Longitudinal Study on Ageing Municipal Employees (FLAME) which involved 6,257 participants over a 29-year follow-up period. Exact dates of death were obtained from the Finnish population register. Premature death was defined as a death occurring earlier than the age- and sex-specific actuarial life expectancy indicated by life tables for 1981, as the baseline, with the threshold period of nine months. Explanatory variables encompassed sociodemographic characteristics, health and functioning, health behaviors, subjective experiences, working conditions, and work abilities. Data were mined using the General Unary Hypothesis Automaton (GUHA) method, implemented with LISp-Miner software. GUHA involves an active dialogue between the user and the LISp-Miner software, with parameters tailored to the data and user interests. The parameters used are not absolute but depend on the data to be mined and the user’s interests. Results Over the follow-up period, 2,196 deaths were recorded, of which 70.4% were premature. Seven single factors and 67 sets of criteria (paths) were statistically significantly associated with premature mortality, passing the one-sided Fisher test. Single predicates of premature death included smoking, consuming alcohol a few times a month or once a week, poor self-rated fitness, incompetence to work and poor assured workability in two years’ time, and diseases causing work disability. Notably, most of the factors selected as single predicates of premature mortality did not appear in the multi-predicate paths. Factors appearing in the paths were smoking more than 20 cigarettes a day, symptoms that impaired functioning, past smoking, absence of musculoskeletal diseases, poor self-rated health, having pain, male sex, being married, use of medication, more physical strain compared to others, and high life satisfaction, intention to retire due to reduced work ability caused by diseases and demanding work. Sex-specific analysis revealed similar findings. Conclusion The findings indicate that associations between single predictors and premature mortality should be interpreted with caution, even when adjusted for a limited number of other factors. This highlights the complexity of premature mortality and the need for comprehensive models considering multiple interacting factors.
format Article
id doaj-art-d8d7c43f0c1b44fe8563690eb3c5fec4
institution Kabale University
issn 1471-2288
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Medical Research Methodology
spelling doaj-art-d8d7c43f0c1b44fe8563690eb3c5fec42025-02-02T12:30:13ZengBMCBMC Medical Research Methodology1471-22882025-01-0125111110.1186/s12874-025-02477-6Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up studyLily Nosraty0Esko Turunen1Saila Kyrönlahti2Clas-Håkan Nygård3Prakash KC4Subas Neupane5Faculty of Social Sciences, Centre of Excellence in Research on Ageing and Care, University of HelsinkiFaculty of Information Technology and Communication Sciences, Tampere UniversitiesFaculty of Social Sciences (Health Sciences) and Gerontology Research Center (GEREC), Tampere UniversitiesFaculty of Social Sciences (Health Sciences), Tampere UniversitiesFaculty of Social Sciences (Health Sciences), Tampere UniversitiesFaculty of Social Sciences (Health Sciences) and Gerontology Research Center (GEREC), Tampere UniversitiesAbstract Background and Method This study set out to identify the factors and combinations of factors associated with the individual’s premature death, using data from the Finnish Longitudinal Study on Ageing Municipal Employees (FLAME) which involved 6,257 participants over a 29-year follow-up period. Exact dates of death were obtained from the Finnish population register. Premature death was defined as a death occurring earlier than the age- and sex-specific actuarial life expectancy indicated by life tables for 1981, as the baseline, with the threshold period of nine months. Explanatory variables encompassed sociodemographic characteristics, health and functioning, health behaviors, subjective experiences, working conditions, and work abilities. Data were mined using the General Unary Hypothesis Automaton (GUHA) method, implemented with LISp-Miner software. GUHA involves an active dialogue between the user and the LISp-Miner software, with parameters tailored to the data and user interests. The parameters used are not absolute but depend on the data to be mined and the user’s interests. Results Over the follow-up period, 2,196 deaths were recorded, of which 70.4% were premature. Seven single factors and 67 sets of criteria (paths) were statistically significantly associated with premature mortality, passing the one-sided Fisher test. Single predicates of premature death included smoking, consuming alcohol a few times a month or once a week, poor self-rated fitness, incompetence to work and poor assured workability in two years’ time, and diseases causing work disability. Notably, most of the factors selected as single predicates of premature mortality did not appear in the multi-predicate paths. Factors appearing in the paths were smoking more than 20 cigarettes a day, symptoms that impaired functioning, past smoking, absence of musculoskeletal diseases, poor self-rated health, having pain, male sex, being married, use of medication, more physical strain compared to others, and high life satisfaction, intention to retire due to reduced work ability caused by diseases and demanding work. Sex-specific analysis revealed similar findings. Conclusion The findings indicate that associations between single predictors and premature mortality should be interpreted with caution, even when adjusted for a limited number of other factors. This highlights the complexity of premature mortality and the need for comprehensive models considering multiple interacting factors.https://doi.org/10.1186/s12874-025-02477-6Midlife antecedentsGeneral Unary Hypothesis Automaton (GUHA) methodData miningMortality
spellingShingle Lily Nosraty
Esko Turunen
Saila Kyrönlahti
Clas-Håkan Nygård
Prakash KC
Subas Neupane
Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study
BMC Medical Research Methodology
Midlife antecedents
General Unary Hypothesis Automaton (GUHA) method
Data mining
Mortality
title Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study
title_full Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study
title_fullStr Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study
title_full_unstemmed Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study
title_short Application of GUHA data mining method in cohort data to explore paths associated with premature death: a 29-year follow-up study
title_sort application of guha data mining method in cohort data to explore paths associated with premature death a 29 year follow up study
topic Midlife antecedents
General Unary Hypothesis Automaton (GUHA) method
Data mining
Mortality
url https://doi.org/10.1186/s12874-025-02477-6
work_keys_str_mv AT lilynosraty applicationofguhadataminingmethodincohortdatatoexplorepathsassociatedwithprematuredeatha29yearfollowupstudy
AT eskoturunen applicationofguhadataminingmethodincohortdatatoexplorepathsassociatedwithprematuredeatha29yearfollowupstudy
AT sailakyronlahti applicationofguhadataminingmethodincohortdatatoexplorepathsassociatedwithprematuredeatha29yearfollowupstudy
AT clashakannygard applicationofguhadataminingmethodincohortdatatoexplorepathsassociatedwithprematuredeatha29yearfollowupstudy
AT prakashkc applicationofguhadataminingmethodincohortdatatoexplorepathsassociatedwithprematuredeatha29yearfollowupstudy
AT subasneupane applicationofguhadataminingmethodincohortdatatoexplorepathsassociatedwithprematuredeatha29yearfollowupstudy