Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries

In the era of big data, libraries manage huge electronic text resources, of which English text resources are particularly critical for academic research, student learning, and professional knowledge acquisition. This paper aims to improve the K-nearest neighbor algorithm and design an intelligent cl...

Full description

Saved in:

Bibliographic Details
Main Author:	Qinwen Xu
Format:	Article
Language:	English
Published:	Elsevier 2025-12-01
Series:	Systems and Soft Computing
Subjects:	K-means clustering KNN algorithm Category average distance Intelligent classification of English texts Big data analysis
Online Access:	http://www.sciencedirect.com/science/article/pii/S2772941925000043
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832586824224079872
author	Qinwen Xu
author_facet	Qinwen Xu
author_sort	Qinwen Xu
collection	DOAJ
description	In the era of big data, libraries manage huge electronic text resources, of which English text resources are particularly critical for academic research, student learning, and professional knowledge acquisition. This paper aims to improve the K-nearest neighbor algorithm and design an intelligent classification model to improve the efficiency and quality of library services. An improved method based on in-class K-means clustering and class mean distance is used to characterize and extract text information with a vector space model. The results showed that the improved K-nearest neighbor algorithm achieved significant improvement in the precision, recall, and F1 values, reaching 90.50 %, 89.95 %, and 89.37 %, respectively. The classification time was significantly reduced to 1034.57 s. In addition, the improved algorithm had a classification accuracy of 94 %, surpassing other popular text classification algorithms. The research successfully realizes the efficient classification of text. The research results not only improve the classification efficiency of library English text resources but also provide strong support for readers to quickly obtain the required information, which has important application value and wide application prospects.
format	Article
id	doaj-art-d639db03796f4944bf81b44c8f825601
institution	Kabale University
issn	2772-9419
language	English
publishDate	2025-12-01
publisher	Elsevier
record_format	Article
series	Systems and Soft Computing
spelling	doaj-art-d639db03796f4944bf81b44c8f8256012025-01-25T04:11:29ZengElsevierSystems and Soft Computing2772-94192025-12-017200186Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in librariesQinwen Xu0Corresponding author.; Public Foundation Department, Henan Medical College, Zhengzhou, 451191, ChinaIn the era of big data, libraries manage huge electronic text resources, of which English text resources are particularly critical for academic research, student learning, and professional knowledge acquisition. This paper aims to improve the K-nearest neighbor algorithm and design an intelligent classification model to improve the efficiency and quality of library services. An improved method based on in-class K-means clustering and class mean distance is used to characterize and extract text information with a vector space model. The results showed that the improved K-nearest neighbor algorithm achieved significant improvement in the precision, recall, and F1 values, reaching 90.50 %, 89.95 %, and 89.37 %, respectively. The classification time was significantly reduced to 1034.57 s. In addition, the improved algorithm had a classification accuracy of 94 %, surpassing other popular text classification algorithms. The research successfully realizes the efficient classification of text. The research results not only improve the classification efficiency of library English text resources but also provide strong support for readers to quickly obtain the required information, which has important application value and wide application prospects.http://www.sciencedirect.com/science/article/pii/S2772941925000043K-means clusteringKNN algorithmCategory average distanceIntelligent classification of English textsBig data analysis
spellingShingle	Qinwen Xu Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries Systems and Soft Computing K-means clustering KNN algorithm Category average distance Intelligent classification of English texts Big data analysis
title	Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_full	Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_fullStr	Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_full_unstemmed	Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_short	Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries
title_sort	application of an intelligent english text classification model with improved knn algorithm in the context of big data in libraries
topic	K-means clustering KNN algorithm Category average distance Intelligent classification of English texts Big data analysis
url	http://www.sciencedirect.com/science/article/pii/S2772941925000043
work_keys_str_mv	AT qinwenxu applicationofanintelligentenglishtextclassificationmodelwithimprovedknnalgorithminthecontextofbigdatainlibraries

Application of an intelligent English text classification model with improved KNN algorithm in the context of big data in libraries

Similar Items