Distributed K-Means algorithm based on a Spark optimization sample.

To address the instability and performance issues of the classical K-Means algorithm when dealing with massive datasets, we propose SOSK-Means, an improved K-Means algorithm based on Spark optimization. SOSK-Means incorporates several key modifications to enhance the clustering process.Firstly, a we...

Full description

Saved in:
Bibliographic Details
Main Authors: Yongan Feng, Jiapeng Zou, Wanjun Liu, Fu Lv
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0308993
Tags: Add Tag
No Tags, Be the first to tag this record!