Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells

The worldwide demand for oil has been rising rapidly for many decades, being the first indicator of economic development. Oil is extracted from underneath reservoirs found below land or ocean using oil wells. An offshore oil well is an oil well type where a wellbore is drilled underneath the ocean b...

Full description

Saved in:
Bibliographic Details
Main Authors: Nida Aslam, Irfan Ullah Khan, Aisha Alansari, Marah Alrammah, Atheer Alghwairy, Rahaf Alqahtani, Razan Alqahtani, Maryam Almushikes, Mohammed AL Hashim
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Applied Computational Intelligence and Soft Computing
Online Access:http://dx.doi.org/10.1155/2022/1558381
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832565632352124928
author Nida Aslam
Irfan Ullah Khan
Aisha Alansari
Marah Alrammah
Atheer Alghwairy
Rahaf Alqahtani
Razan Alqahtani
Maryam Almushikes
Mohammed AL Hashim
author_facet Nida Aslam
Irfan Ullah Khan
Aisha Alansari
Marah Alrammah
Atheer Alghwairy
Rahaf Alqahtani
Razan Alqahtani
Maryam Almushikes
Mohammed AL Hashim
author_sort Nida Aslam
collection DOAJ
description The worldwide demand for oil has been rising rapidly for many decades, being the first indicator of economic development. Oil is extracted from underneath reservoirs found below land or ocean using oil wells. An offshore oil well is an oil well type where a wellbore is drilled underneath the ocean bed to obtain oil to the surface that demands more stability than other oil wells. The sensors of oil wells generate massive amounts of multivariate time-series data for surveillance engineers to analyze manually and have continuous insight into drilling operations. The manual analysis of data is challenging and time-consuming. Additionally, it can lead to several faulty events that could increase costs and production losses since the engineers tend to focus on the analysis rather than detecting the faulty events. Recently, machine learning (ML) techniques have significantly solved enormous real-time data anomaly problems by decreasing the data engineers’ interaction processes. Accordingly, this study aimed to utilize ML techniques to reduce the time spent manually to establish rules that detect abnormalities in oil wells, leading to rapid and more precise detection. Four ML algorithms were utilized, including random forest (RF), logistic regression (LR), k-nearest neighbor (K-NN), and decision tree (DT). The dataset used in this study suffers from the class imbalance issue; therefore, experiments were conducted using the original and sampled datasets. The empirical results demonstrated promising outcomes, where RF achieved the highest accuracy, recall, precision, F1-score, and AUC of 99.60%, 99.64%, 99.91%, 99.77%, and 1.00, respectively, using the sampled data, and 99.84%, 99.91%, 99.91%, 99.91%, and 1.00, respectively, using the original data. Besides, the study employed Explainable Artificial Intelligence (XAI) to enable surveillance engineers to interpret black box models to understand the causes of abnormalities. The proposed models can be used to successfully identify anomalous events in the oil wells.
format Article
id doaj-art-25b498faa6074be5a1c196b2110445c5
institution Kabale University
issn 1687-9732
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Applied Computational Intelligence and Soft Computing
spelling doaj-art-25b498faa6074be5a1c196b2110445c52025-02-03T01:06:58ZengWileyApplied Computational Intelligence and Soft Computing1687-97322022-01-01202210.1155/2022/1558381Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil WellsNida Aslam0Irfan Ullah Khan1Aisha Alansari2Marah Alrammah3Atheer Alghwairy4Rahaf Alqahtani5Razan Alqahtani6Maryam Almushikes7Mohammed AL Hashim8Department of Computer ScienceDepartment of Computer ScienceComputer Engineering DepartmentDepartment of Computer ScienceDepartment of Computer ScienceDepartment of Computer ScienceDepartment of Computer ScienceDepartment of Computer ScienceComputer Engineering DepartmentThe worldwide demand for oil has been rising rapidly for many decades, being the first indicator of economic development. Oil is extracted from underneath reservoirs found below land or ocean using oil wells. An offshore oil well is an oil well type where a wellbore is drilled underneath the ocean bed to obtain oil to the surface that demands more stability than other oil wells. The sensors of oil wells generate massive amounts of multivariate time-series data for surveillance engineers to analyze manually and have continuous insight into drilling operations. The manual analysis of data is challenging and time-consuming. Additionally, it can lead to several faulty events that could increase costs and production losses since the engineers tend to focus on the analysis rather than detecting the faulty events. Recently, machine learning (ML) techniques have significantly solved enormous real-time data anomaly problems by decreasing the data engineers’ interaction processes. Accordingly, this study aimed to utilize ML techniques to reduce the time spent manually to establish rules that detect abnormalities in oil wells, leading to rapid and more precise detection. Four ML algorithms were utilized, including random forest (RF), logistic regression (LR), k-nearest neighbor (K-NN), and decision tree (DT). The dataset used in this study suffers from the class imbalance issue; therefore, experiments were conducted using the original and sampled datasets. The empirical results demonstrated promising outcomes, where RF achieved the highest accuracy, recall, precision, F1-score, and AUC of 99.60%, 99.64%, 99.91%, 99.77%, and 1.00, respectively, using the sampled data, and 99.84%, 99.91%, 99.91%, 99.91%, and 1.00, respectively, using the original data. Besides, the study employed Explainable Artificial Intelligence (XAI) to enable surveillance engineers to interpret black box models to understand the causes of abnormalities. The proposed models can be used to successfully identify anomalous events in the oil wells.http://dx.doi.org/10.1155/2022/1558381
spellingShingle Nida Aslam
Irfan Ullah Khan
Aisha Alansari
Marah Alrammah
Atheer Alghwairy
Rahaf Alqahtani
Razan Alqahtani
Maryam Almushikes
Mohammed AL Hashim
Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells
Applied Computational Intelligence and Soft Computing
title Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells
title_full Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells
title_fullStr Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells
title_full_unstemmed Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells
title_short Anomaly Detection Using Explainable Random Forest for the Prediction of Undesirable Events in Oil Wells
title_sort anomaly detection using explainable random forest for the prediction of undesirable events in oil wells
url http://dx.doi.org/10.1155/2022/1558381
work_keys_str_mv AT nidaaslam anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT irfanullahkhan anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT aishaalansari anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT marahalrammah anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT atheeralghwairy anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT rahafalqahtani anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT razanalqahtani anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT maryamalmushikes anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells
AT mohammedalhashim anomalydetectionusingexplainablerandomforestforthepredictionofundesirableeventsinoilwells