Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China

Abstract Background Although China has implemented multiple policies to encourage childbirth, the results have been underwhelming. Migrant workers account for a considerable proportion of China’s population, most of whom are of childbearing age. However, few articles focus on their fertility intenti...

Full description

Saved in:
Bibliographic Details
Main Authors: Xinghong Guo, Mingze Ma, Yiyang Chen, Zhaoyang Fang, Jingru Liu, Shuming Yan, Yifei Feng, Xinya Cheng, Jian Wu, Beizhu Ye
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Public Health
Subjects:
Online Access:https://doi.org/10.1186/s12889-025-21412-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594373128224768
author Xinghong Guo
Mingze Ma
Yiyang Chen
Zhaoyang Fang
Jingru Liu
Shuming Yan
Yifei Feng
Xinya Cheng
Jian Wu
Beizhu Ye
author_facet Xinghong Guo
Mingze Ma
Yiyang Chen
Zhaoyang Fang
Jingru Liu
Shuming Yan
Yifei Feng
Xinya Cheng
Jian Wu
Beizhu Ye
author_sort Xinghong Guo
collection DOAJ
description Abstract Background Although China has implemented multiple policies to encourage childbirth, the results have been underwhelming. Migrant workers account for a considerable proportion of China’s population, most of whom are of childbearing age. However, few articles focus on their fertility intentions. Method From August 3 to August 29, 2023, we conducted a cross-sectional survey in Henan Province, China, which included 18,806 participants. Machine learning was used to construct a predictive model for the fertility intention of migrant workers, and unsupervised clustering was used to explore subgroup classification. Result Out of 18,806 participants, only 1057 had fertility intention. We constructed a predictive model for fertility intention based on XGBoost, with an AUC of 0.83. Age, number of children, and marital status are the most important characteristics that affect the fertility intention of migrant workers. Subsequently, unsupervised clustering was conducted on participants without fertility intentions, and it was found that they could be divided into three categories of population. The first group of people is the youngest and mostly unmarried without pregnancy, the second group has the lowest monthly income and self-perceived economic level, and the third group is the oldest and has the highest proportion of women. Regardless of the group, economic and age factors are the main reasons for participants not having a family plan in the near future. Conclusion The fertility intention of the migrant workers is at an extremely low level. Improving childcare-related benefits and family support services to reduce the economic and time costs of childcare is an effective measure to reverse fertility intentions.
format Article
id doaj-art-d7459c730bcb4921bb9be19641eafc22
institution Kabale University
issn 1471-2458
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Public Health
spelling doaj-art-d7459c730bcb4921bb9be19641eafc222025-01-19T12:42:11ZengBMCBMC Public Health1471-24582025-01-012511910.1186/s12889-025-21412-4Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, ChinaXinghong Guo0Mingze Ma1Yiyang Chen2Zhaoyang Fang3Jingru Liu4Shuming Yan5Yifei Feng6Xinya Cheng7Jian Wu8Beizhu Ye9Department of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityFaculty of Arts and Social Sciences Hong Kong, Baptist UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityDepartment of Health Management of Public Health, College of Public Health, Zhengzhou UniversityAbstract Background Although China has implemented multiple policies to encourage childbirth, the results have been underwhelming. Migrant workers account for a considerable proportion of China’s population, most of whom are of childbearing age. However, few articles focus on their fertility intentions. Method From August 3 to August 29, 2023, we conducted a cross-sectional survey in Henan Province, China, which included 18,806 participants. Machine learning was used to construct a predictive model for the fertility intention of migrant workers, and unsupervised clustering was used to explore subgroup classification. Result Out of 18,806 participants, only 1057 had fertility intention. We constructed a predictive model for fertility intention based on XGBoost, with an AUC of 0.83. Age, number of children, and marital status are the most important characteristics that affect the fertility intention of migrant workers. Subsequently, unsupervised clustering was conducted on participants without fertility intentions, and it was found that they could be divided into three categories of population. The first group of people is the youngest and mostly unmarried without pregnancy, the second group has the lowest monthly income and self-perceived economic level, and the third group is the oldest and has the highest proportion of women. Regardless of the group, economic and age factors are the main reasons for participants not having a family plan in the near future. Conclusion The fertility intention of the migrant workers is at an extremely low level. Improving childcare-related benefits and family support services to reduce the economic and time costs of childcare is an effective measure to reverse fertility intentions.https://doi.org/10.1186/s12889-025-21412-4Fertility intentionMigrant workersMachine learning
spellingShingle Xinghong Guo
Mingze Ma
Yiyang Chen
Zhaoyang Fang
Jingru Liu
Shuming Yan
Yifei Feng
Xinya Cheng
Jian Wu
Beizhu Ye
Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China
BMC Public Health
Fertility intention
Migrant workers
Machine learning
title Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China
title_full Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China
title_fullStr Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China
title_full_unstemmed Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China
title_short Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China
title_sort prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning a cross sectional survey from henan china
topic Fertility intention
Migrant workers
Machine learning
url https://doi.org/10.1186/s12889-025-21412-4
work_keys_str_mv AT xinghongguo predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT mingzema predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT yiyangchen predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT zhaoyangfang predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT jingruliu predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT shumingyan predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT yifeifeng predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT xinyacheng predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT jianwu predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina
AT beizhuye predictionandunsupervisedclusteringoffertilityintentionamongmigrantworkersbasedonmachinelearningacrosssectionalsurveyfromhenanchina