Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
The utilization of regression models for the prediction of construction material properties is well-established, yet their performance when applied to small datasets is still unclear. This study investigates the performance of different regression models combined with various data preprocessing tech...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Sustainable Development Press Limited
2024-12-01
|
Series: | Sustainable Structures |
Subjects: | |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832575360664862720 |
---|---|
author | Ahed Habib Samer Barakat Samir Dirar Salah Al-Toubat Zaid A. Al-Sadoon |
author_facet | Ahed Habib Samer Barakat Samir Dirar Salah Al-Toubat Zaid A. Al-Sadoon |
author_sort | Ahed Habib |
collection | DOAJ |
description | The utilization of regression models for the prediction of construction material properties is well-established, yet their performance when applied to small datasets is still unclear. This study investigates the performance of different regression models combined with various data preprocessing techniques in contexts where data is limited. Specifically, the research focuses on evaluating the suitability of five regression models across nine different data processing scenarios using concrete with recycled copper tailing as a case study. This study aims to determine which combinations of regression models and preprocessing methods yield the most accurate predictions in small data regimes. This research is motivated by the necessity to enhance prediction reliability in the field of construction materials, where experimental data can often be scarce or costly to obtain. Within the study context, a dataset comprising 21 experimental specimens is used to evaluate the performance of the models on various concrete properties, including fresh density, compressive strength, flexural strength, pull-off strength, abrasion resistance, water penetration, rapid chloride ion permeability, and air permeability. Through rigorous evaluation involving a 10-fold cross-validation process to verify accuracy, the research demonstrates that selecting the optimal regression model and data preprocessing technique selection substantially improves prediction outcomes, even with limited data. The findings highlight the importance of this research, suggesting that even small datasets, when handled correctly, can provide robust insights. |
format | Article |
id | doaj-art-36a81b212206411994f378d395ec83f9 |
institution | Kabale University |
issn | 2789-3111 2789-312X |
language | English |
publishDate | 2024-12-01 |
publisher | Sustainable Development Press Limited |
record_format | Article |
series | Sustainable Structures |
spelling | doaj-art-36a81b212206411994f378d395ec83f92025-02-01T09:05:05ZengSustainable Development Press LimitedSustainable Structures2789-31112789-312X2024-12-013410.54113/j.sust.2024.000056Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case studyAhed Habib0Samer Barakat1Samir Dirar2Salah Al-Toubat3Zaid A. Al-Sadoon4 Research Institute of Sciences and Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Civil and Environmental Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Architectural Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Civil and Environmental Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Civil and Environmental Engineering, University of Sharjah, Sharjah, United Arab Emirates.The utilization of regression models for the prediction of construction material properties is well-established, yet their performance when applied to small datasets is still unclear. This study investigates the performance of different regression models combined with various data preprocessing techniques in contexts where data is limited. Specifically, the research focuses on evaluating the suitability of five regression models across nine different data processing scenarios using concrete with recycled copper tailing as a case study. This study aims to determine which combinations of regression models and preprocessing methods yield the most accurate predictions in small data regimes. This research is motivated by the necessity to enhance prediction reliability in the field of construction materials, where experimental data can often be scarce or costly to obtain. Within the study context, a dataset comprising 21 experimental specimens is used to evaluate the performance of the models on various concrete properties, including fresh density, compressive strength, flexural strength, pull-off strength, abrasion resistance, water penetration, rapid chloride ion permeability, and air permeability. Through rigorous evaluation involving a 10-fold cross-validation process to verify accuracy, the research demonstrates that selecting the optimal regression model and data preprocessing technique selection substantially improves prediction outcomes, even with limited data. The findings highlight the importance of this research, suggesting that even small datasets, when handled correctly, can provide robust insights.regression modelssmall data regimescopper tailing concretemultivariable regressiondata preprocessing |
spellingShingle | Ahed Habib Samer Barakat Samir Dirar Salah Al-Toubat Zaid A. Al-Sadoon Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study Sustainable Structures regression models small data regimes copper tailing concrete multivariable regression data preprocessing |
title | Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study |
title_full | Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study |
title_fullStr | Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study |
title_full_unstemmed | Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study |
title_short | Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study |
title_sort | evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study |
topic | regression models small data regimes copper tailing concrete multivariable regression data preprocessing |
work_keys_str_mv | AT ahedhabib evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy AT samerbarakat evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy AT samirdirar evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy AT salahaltoubat evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy AT zaidaalsadoon evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy |