Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction

This paper evaluated the potential application of big data technology to assessments of diminished ovarian reserve (DOR). The study enrolled 162 patients who underwent ovarian reserve function assessment for the first time in the Department of Ultrasound, Jiangsu Province Hospital of Chinese Medicin...

Full description

Saved in:
Bibliographic Details
Main Authors: Xia Ji'An, Ma YunFei, Wu YiYun, Zhao YouLin, Ni HaoRang, Liu XinYan
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10849517/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832575628024479744
author Xia Ji'An
Ma YunFei
Wu YiYun
Zhao YouLin
Ni HaoRang
Liu XinYan
author_facet Xia Ji'An
Ma YunFei
Wu YiYun
Zhao YouLin
Ni HaoRang
Liu XinYan
author_sort Xia Ji'An
collection DOAJ
description This paper evaluated the potential application of big data technology to assessments of diminished ovarian reserve (DOR). The study enrolled 162 patients who underwent ovarian reserve function assessment for the first time in the Department of Ultrasound, Jiangsu Province Hospital of Chinese Medicine from January 2023 to December 2023. Patients were divided into normal ovarian reserve function (n = 68), early-stage DOR (n = 66), mid-stage DOR (n = 12), and late-stage DOR (n = 16) groups. Hadoop and Spark frameworks were used to build a big data platform, and the MLlib parallel machine learning library was used to implement three multivariate classification models—multilayer perceptron, one-vs-rest, and random forest classifiers—to classify and analyse the ovarian reserve function dataset and evaluate the platform’s performance. In the big data platform, the random forest algorithm achieved the highest classification accuracy (89.47%), followed by the neural network (81.06%) and support vector machine (72.91%) methods. The random forest algorithm had the least time overhead for datasets smaller than 50 MB; for datasets exceeding 50 MB, the support vector machine algorithm had the least time overhead, followed by the random forest and neural network algorithms. The neural network algorithm’s speedup ratio was lower than that of the other two algorithms for small datasets, but with increasing dataset size, its speedup ratio significantly exceeded those of the other two algorithms. The random forest algorithm showed substantial growth for large datasets.
format Article
id doaj-art-f6f1302e097e43d6999eff8eafe21796
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-f6f1302e097e43d6999eff8eafe217962025-01-31T23:05:22ZengIEEEIEEE Access2169-35362025-01-0113194081941910.1109/ACCESS.2025.353279910849517Application of Big Data Technology to Assessments of Female Ovarian Reserve DysfunctionXia Ji'An0https://orcid.org/0000-0002-5207-6555Ma YunFei1Wu YiYun2Zhao YouLin3Ni HaoRang4Liu XinYan5School of Computer and Software, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, ChinaDepartment of Ultrasound, Jiangsu Province Hospital of Chinese Medicine, Nanjing, Jiangsu, ChinaDepartment of Ultrasound, Jiangsu Province Hospital of Chinese Medicine, Nanjing, Jiangsu, ChinaSchool of Computer and Software, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, ChinaSchool of Computer and Software, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, ChinaSchool of Computer and Software, Nanjing Vocational University of Industry Technology, Nanjing, Jiangsu, ChinaThis paper evaluated the potential application of big data technology to assessments of diminished ovarian reserve (DOR). The study enrolled 162 patients who underwent ovarian reserve function assessment for the first time in the Department of Ultrasound, Jiangsu Province Hospital of Chinese Medicine from January 2023 to December 2023. Patients were divided into normal ovarian reserve function (n = 68), early-stage DOR (n = 66), mid-stage DOR (n = 12), and late-stage DOR (n = 16) groups. Hadoop and Spark frameworks were used to build a big data platform, and the MLlib parallel machine learning library was used to implement three multivariate classification models—multilayer perceptron, one-vs-rest, and random forest classifiers—to classify and analyse the ovarian reserve function dataset and evaluate the platform’s performance. In the big data platform, the random forest algorithm achieved the highest classification accuracy (89.47%), followed by the neural network (81.06%) and support vector machine (72.91%) methods. The random forest algorithm had the least time overhead for datasets smaller than 50 MB; for datasets exceeding 50 MB, the support vector machine algorithm had the least time overhead, followed by the random forest and neural network algorithms. The neural network algorithm’s speedup ratio was lower than that of the other two algorithms for small datasets, but with increasing dataset size, its speedup ratio significantly exceeded those of the other two algorithms. The random forest algorithm showed substantial growth for large datasets.https://ieeexplore.ieee.org/document/10849517/Diminished ovarian reservethree-dimensional power Doppler ultrasoundmedical big datamachine learningclassification algorithmperformance evaluation
spellingShingle Xia Ji'An
Ma YunFei
Wu YiYun
Zhao YouLin
Ni HaoRang
Liu XinYan
Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction
IEEE Access
Diminished ovarian reserve
three-dimensional power Doppler ultrasound
medical big data
machine learning
classification algorithm
performance evaluation
title Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction
title_full Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction
title_fullStr Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction
title_full_unstemmed Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction
title_short Application of Big Data Technology to Assessments of Female Ovarian Reserve Dysfunction
title_sort application of big data technology to assessments of female ovarian reserve dysfunction
topic Diminished ovarian reserve
three-dimensional power Doppler ultrasound
medical big data
machine learning
classification algorithm
performance evaluation
url https://ieeexplore.ieee.org/document/10849517/
work_keys_str_mv AT xiajian applicationofbigdatatechnologytoassessmentsoffemaleovarianreservedysfunction
AT mayunfei applicationofbigdatatechnologytoassessmentsoffemaleovarianreservedysfunction
AT wuyiyun applicationofbigdatatechnologytoassessmentsoffemaleovarianreservedysfunction
AT zhaoyoulin applicationofbigdatatechnologytoassessmentsoffemaleovarianreservedysfunction
AT nihaorang applicationofbigdatatechnologytoassessmentsoffemaleovarianreservedysfunction
AT liuxinyan applicationofbigdatatechnologytoassessmentsoffemaleovarianreservedysfunction