The analysis of sculpture image classification in utilization of 3D reconstruction under K-means++
Abstract This study aims to address the issues of accuracy and efficiency in sculpture image classification. Due to the diversity and complexity of sculpture images, traditional image processing algorithms perform poorly in capturing the sculptures’ intricate shapes and structural features, resultin...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-05-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-01949-5 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract This study aims to address the issues of accuracy and efficiency in sculpture image classification. Due to the diversity and complexity of sculpture images, traditional image processing algorithms perform poorly in capturing the sculptures’ intricate shapes and structural features, resulting in suboptimal classification and recognition performance. To overcome this challenge, this study proposes an innovative image classification method that combines the ResNet50 model from the Deep Convolutional Neural Network (DCNN) with the K-means++ clustering algorithm. ResNet50 is chosen for its powerful feature extraction capabilities and outstanding performance in image classification tasks. At the same time, K-means++ is selected for its optimized initial centroid selection strategy, which enhances the stability and reliability of clustering. After the final convolutional layer of ResNet50, a self-attention module is added. This module learns and generates an attention map, which guides the model on which areas of the image to focus on in subsequent processing. ResNet50 includes residual blocks, each containing multiple convolutional layers and a skip connection, enabling the network to learn differences between inputs and outputs rather than directly learning outputs, thus improving performance. Initially, ResNet50 extracts feature vectors from original images, which are inputted into the K-means + + algorithm for clustering. K-means + + automatically partitions these feature vectors into different categories, achieving unsupervised image classification. The CMU-MINE architectural sculpture dataset is utilized in the experimental section, with ViT-Base, EfficientNet-B4, and ConvNeXt-Tiny as benchmarks to evaluate the proposed ResNet50 + K-means + + image classification approach. The final model achieves a loss value of 0.155 and a recall of 98.9%, significantly outperforming the other three models. In conclusion, performing feature point matching during three-dimensional reconstruction is crucial. This study employs a combined image classification method using the ResNet50 and K-means + + algorithm, optimizing the accuracy issues of traditional classification methods and achieving promising classification results. |
|---|---|
| ISSN: | 2045-2322 |