Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices

This article presents a comprehensive methodology for processing financial datasets of Apple Inc., encompassing quarterly income and daily stock prices, spanning from March 31, 2009, to December 31, 2023. Leveraging 60 observations for quarterly income and 3774 observations for daily stock prices, s...

Full description

Saved in:
Bibliographic Details
Main Authors: Ungar Kevin, Oprean-Stan Camelia
Format: Article
Language:English
Published: Sciendo 2025-03-01
Series:Studia Universitatis Vasile Goldis Arad, Seria Stiinte Economice
Subjects:
Online Access:https://doi.org/10.2478/sues-2025-0004
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832593653432844288
author Ungar Kevin
Oprean-Stan Camelia
author_facet Ungar Kevin
Oprean-Stan Camelia
author_sort Ungar Kevin
collection DOAJ
description This article presents a comprehensive methodology for processing financial datasets of Apple Inc., encompassing quarterly income and daily stock prices, spanning from March 31, 2009, to December 31, 2023. Leveraging 60 observations for quarterly income and 3774 observations for daily stock prices, sourced from Macrotrends and Yahoo Finance respectively, the study outlines five distinct datasets crafted through varied preprocessing techniques. Through detailed explanations of aggregation, interpolation (linear, polynomial, and cubic spline) and lagged variables methods, the study elucidates the steps taken to transform raw data into analytically rich datasets. Subsequently, the article delves into regression analysis, aiming to decipher which of the five data processing methods best suits capital market analysis, by employing both linear and polynomial regression models on each preprocessed dataset and evaluating their performance using a range of metrics, including cross-validation score, MSE, MAE, RMSE, R-squared, and Adjusted R-squared. The research findings reveal that linear interpolation with polynomial regression emerges as the top-performing method, boasting the lowest validation MSE and MAE values, alongside the highest R-squared and Adjusted R-squared values.
format Article
id doaj-art-ee328e534044455fb8dad19e4592635c
institution Kabale University
issn 2285-3065
language English
publishDate 2025-03-01
publisher Sciendo
record_format Article
series Studia Universitatis Vasile Goldis Arad, Seria Stiinte Economice
spelling doaj-art-ee328e534044455fb8dad19e4592635c2025-01-20T11:10:13ZengSciendoStudia Universitatis Vasile Goldis Arad, Seria Stiinte Economice2285-30652025-03-01351498210.2478/sues-2025-0004Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock PricesUngar Kevin0Oprean-Stan Camelia11Lucian Blaga University of Sibiu, Faculty of Economic Sciences, Sibiu, Romania2Lucian Blaga University of Sibiu, Faculty of Economic Sciences, Sibiu, RomaniaThis article presents a comprehensive methodology for processing financial datasets of Apple Inc., encompassing quarterly income and daily stock prices, spanning from March 31, 2009, to December 31, 2023. Leveraging 60 observations for quarterly income and 3774 observations for daily stock prices, sourced from Macrotrends and Yahoo Finance respectively, the study outlines five distinct datasets crafted through varied preprocessing techniques. Through detailed explanations of aggregation, interpolation (linear, polynomial, and cubic spline) and lagged variables methods, the study elucidates the steps taken to transform raw data into analytically rich datasets. Subsequently, the article delves into regression analysis, aiming to decipher which of the five data processing methods best suits capital market analysis, by employing both linear and polynomial regression models on each preprocessed dataset and evaluating their performance using a range of metrics, including cross-validation score, MSE, MAE, RMSE, R-squared, and Adjusted R-squared. The research findings reveal that linear interpolation with polynomial regression emerges as the top-performing method, boasting the lowest validation MSE and MAE values, alongside the highest R-squared and Adjusted R-squared values.https://doi.org/10.2478/sues-2025-0004linear regression analysispolynomial regressionstock pricesfinancial data processingpython programmingg14g21c45g32
spellingShingle Ungar Kevin
Oprean-Stan Camelia
Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
Studia Universitatis Vasile Goldis Arad, Seria Stiinte Economice
linear regression analysis
polynomial regression
stock prices
financial data processing
python programming
g14
g21
c45
g32
title Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
title_full Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
title_fullStr Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
title_full_unstemmed Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
title_short Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
title_sort optimizing financial data analysis a comparative study of preprocessing techniques for regression modeling of apple inc s net income and stock prices
topic linear regression analysis
polynomial regression
stock prices
financial data processing
python programming
g14
g21
c45
g32
url https://doi.org/10.2478/sues-2025-0004
work_keys_str_mv AT ungarkevin optimizingfinancialdataanalysisacomparativestudyofpreprocessingtechniquesforregressionmodelingofappleincsnetincomeandstockprices
AT opreanstancamelia optimizingfinancialdataanalysisacomparativestudyofpreprocessingtechniquesforregressionmodelingofappleincsnetincomeandstockprices