Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study

The utilization of regression models for the prediction of construction material properties is well-established, yet their performance when applied to small datasets is still unclear. This study investigates the performance of different regression models combined with various data preprocessing tech...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahed Habib, Samer Barakat, Samir Dirar, Salah Al-Toubat, Zaid A. Al-Sadoon
Format: Article
Language:English
Published: Sustainable Development Press Limited 2024-12-01
Series:Sustainable Structures
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832575360664862720
author Ahed Habib
Samer Barakat
Samir Dirar
Salah Al-Toubat
Zaid A. Al-Sadoon
author_facet Ahed Habib
Samer Barakat
Samir Dirar
Salah Al-Toubat
Zaid A. Al-Sadoon
author_sort Ahed Habib
collection DOAJ
description The utilization of regression models for the prediction of construction material properties is well-established, yet their performance when applied to small datasets is still unclear. This study investigates the performance of different regression models combined with various data preprocessing techniques in contexts where data is limited. Specifically, the research focuses on evaluating the suitability of five regression models across nine different data processing scenarios using concrete with recycled copper tailing as a case study. This study aims to determine which combinations of regression models and preprocessing methods yield the most accurate predictions in small data regimes. This research is motivated by the necessity to enhance prediction reliability in the field of construction materials, where experimental data can often be scarce or costly to obtain. Within the study context, a dataset comprising 21 experimental specimens is used to evaluate the performance of the models on various concrete properties, including fresh density, compressive strength, flexural strength, pull-off strength, abrasion resistance, water penetration, rapid chloride ion permeability, and air permeability. Through rigorous evaluation involving a 10-fold cross-validation process to verify accuracy, the research demonstrates that selecting the optimal regression model and data preprocessing technique selection substantially improves prediction outcomes, even with limited data. The findings highlight the importance of this research, suggesting that even small datasets, when handled correctly, can provide robust insights.
format Article
id doaj-art-36a81b212206411994f378d395ec83f9
institution Kabale University
issn 2789-3111
2789-312X
language English
publishDate 2024-12-01
publisher Sustainable Development Press Limited
record_format Article
series Sustainable Structures
spelling doaj-art-36a81b212206411994f378d395ec83f92025-02-01T09:05:05ZengSustainable Development Press LimitedSustainable Structures2789-31112789-312X2024-12-013410.54113/j.sust.2024.000056Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case studyAhed Habib0Samer Barakat1Samir Dirar2Salah Al-Toubat3Zaid A. Al-Sadoon4 Research Institute of Sciences and Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Civil and Environmental Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Architectural Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Civil and Environmental Engineering, University of Sharjah, Sharjah, United Arab Emirates.Department of Civil and Environmental Engineering, University of Sharjah, Sharjah, United Arab Emirates.The utilization of regression models for the prediction of construction material properties is well-established, yet their performance when applied to small datasets is still unclear. This study investigates the performance of different regression models combined with various data preprocessing techniques in contexts where data is limited. Specifically, the research focuses on evaluating the suitability of five regression models across nine different data processing scenarios using concrete with recycled copper tailing as a case study. This study aims to determine which combinations of regression models and preprocessing methods yield the most accurate predictions in small data regimes. This research is motivated by the necessity to enhance prediction reliability in the field of construction materials, where experimental data can often be scarce or costly to obtain. Within the study context, a dataset comprising 21 experimental specimens is used to evaluate the performance of the models on various concrete properties, including fresh density, compressive strength, flexural strength, pull-off strength, abrasion resistance, water penetration, rapid chloride ion permeability, and air permeability. Through rigorous evaluation involving a 10-fold cross-validation process to verify accuracy, the research demonstrates that selecting the optimal regression model and data preprocessing technique selection substantially improves prediction outcomes, even with limited data. The findings highlight the importance of this research, suggesting that even small datasets, when handled correctly, can provide robust insights.regression modelssmall data regimescopper tailing concretemultivariable regressiondata preprocessing
spellingShingle Ahed Habib
Samer Barakat
Samir Dirar
Salah Al-Toubat
Zaid A. Al-Sadoon
Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
Sustainable Structures
regression models
small data regimes
copper tailing concrete
multivariable regression
data preprocessing
title Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
title_full Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
title_fullStr Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
title_full_unstemmed Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
title_short Evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
title_sort evaluating suitability of regression models in small data regimes using concrete with recycled copper tailings as a case study
topic regression models
small data regimes
copper tailing concrete
multivariable regression
data preprocessing
work_keys_str_mv AT ahedhabib evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy
AT samerbarakat evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy
AT samirdirar evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy
AT salahaltoubat evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy
AT zaidaalsadoon evaluatingsuitabilityofregressionmodelsinsmalldataregimesusingconcretewithrecycledcoppertailingsasacasestudy