Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries

In the era of big data, libraries manage huge electronic text resources, of which English text resources are particularly critical for academic research, student learning, and professional knowledge acquisition. This paper aims to improve the K-nearest neighbor algorithm and design an intelligent cl...

Full description

Saved in:
Bibliographic Details
Main Author: Qinwen Xu
Format: Article
Language:English
Published: Elsevier 2025-12-01
Series:Systems and Soft Computing
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772941925000043
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832586824224079872
author Qinwen Xu
author_facet Qinwen Xu
author_sort Qinwen Xu
collection DOAJ
description In the era of big data, libraries manage huge electronic text resources, of which English text resources are particularly critical for academic research, student learning, and professional knowledge acquisition. This paper aims to improve the K-nearest neighbor algorithm and design an intelligent classification model to improve the efficiency and quality of library services. An improved method based on in-class K-means clustering and class mean distance is used to characterize and extract text information with a vector space model. The results showed that the improved K-nearest neighbor algorithm achieved significant improvement in the precision, recall, and F1 values, reaching 90.50 %, 89.95 %, and 89.37 %, respectively. The classification time was significantly reduced to 1034.57 s. In addition, the improved algorithm had a classification accuracy of 94 %, surpassing other popular text classification algorithms. The research successfully realizes the efficient classification of text. The research results not only improve the classification efficiency of library English text resources but also provide strong support for readers to quickly obtain the required information, which has important application value and wide application prospects.
format Article
id doaj-art-d639db03796f4944bf81b44c8f825601
institution Kabale University
issn 2772-9419
language English
publishDate 2025-12-01
publisher Elsevier
record_format Article
series Systems and Soft Computing
spelling doaj-art-d639db03796f4944bf81b44c8f8256012025-01-25T04:11:29ZengElsevierSystems and Soft Computing2772-94192025-12-017200186Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in librariesQinwen Xu0Corresponding author.; Public Foundation Department, Henan Medical College, Zhengzhou, 451191, ChinaIn the era of big data, libraries manage huge electronic text resources, of which English text resources are particularly critical for academic research, student learning, and professional knowledge acquisition. This paper aims to improve the K-nearest neighbor algorithm and design an intelligent classification model to improve the efficiency and quality of library services. An improved method based on in-class K-means clustering and class mean distance is used to characterize and extract text information with a vector space model. The results showed that the improved K-nearest neighbor algorithm achieved significant improvement in the precision, recall, and F1 values, reaching 90.50 %, 89.95 %, and 89.37 %, respectively. The classification time was significantly reduced to 1034.57 s. In addition, the improved algorithm had a classification accuracy of 94 %, surpassing other popular text classification algorithms. The research successfully realizes the efficient classification of text. The research results not only improve the classification efficiency of library English text resources but also provide strong support for readers to quickly obtain the required information, which has important application value and wide application prospects.http://www.sciencedirect.com/science/article/pii/S2772941925000043K-means clusteringKNN algorithmCategory average distanceIntelligent classification of English textsBig data analysis
spellingShingle Qinwen Xu
Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
Systems and Soft Computing
K-means clustering
KNN algorithm
Category average distance
Intelligent classification of English texts
Big data analysis
title Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_full Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_fullStr Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_full_unstemmed Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_short Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_sort application of an intelligent english text classification model with improved knn algorithm in the context of big data in libraries
topic K-means clustering
KNN algorithm
Category average distance
Intelligent classification of English texts
Big data analysis
url http://www.sciencedirect.com/science/article/pii/S2772941925000043
work_keys_str_mv AT qinwenxu applicationofanintelligentenglishtextclassificationmodelwithimprovedknnalgorithminthecontextofbigdatainlibraries