Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach

Pulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsa...

Full description

Saved in:
Bibliographic Details
Main Authors: Ernesto Lee, Furqan Rustam, Wajdi Aljedaani, Abid Ishaq, Vaibhav Rupapara, Imran Ashraf
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Advances in Astronomy
Online Access:http://dx.doi.org/10.1155/2021/4916494
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832565685461450752
author Ernesto Lee
Furqan Rustam
Wajdi Aljedaani
Abid Ishaq
Vaibhav Rupapara
Imran Ashraf
author_facet Ernesto Lee
Furqan Rustam
Wajdi Aljedaani
Abid Ishaq
Vaibhav Rupapara
Imran Ashraf
author_sort Ernesto Lee
collection DOAJ
description Pulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsar stars are considered an important cosmic phenomenon, and scientists use them to study nuclear physics, gravitational waves, and collisions between black holes. Defining the process of automatic detection of pulsar stars can accelerate the study of pulsar stars by scientists. This study contrives an accurate and efficient approach for true pulsar detection using supervised machine learning. For experiments, the high time-resolution (HTRU2) dataset is used in this study. To resolve the data imbalance problem and overcome model overfitting, a hybrid resampling approach is presented in this study. Experiments are performed with imbalanced and balanced datasets using well-known machine learning algorithms. Results demonstrate that the proposed hybrid resampling approach proves highly influential to avoid model overfitting and increase the prediction accuracy. With the proposed hybrid resampling approach, the extra tree classifier achieves a 0.993 accuracy score for true pulsar star prediction.
format Article
id doaj-art-e7fb045ba5954333aecf42cbf43129bb
institution Kabale University
issn 1687-7977
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Advances in Astronomy
spelling doaj-art-e7fb045ba5954333aecf42cbf43129bb2025-02-03T01:07:07ZengWileyAdvances in Astronomy1687-79772021-01-01202110.1155/2021/4916494Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling ApproachErnesto Lee0Furqan Rustam1Wajdi Aljedaani2Abid Ishaq3Vaibhav Rupapara4Imran Ashraf5Department of Computer ScienceDepartment of Computer ScienceUniversity of North TexasDepartment of Computer ScienceSchool of Computing and Information SciencesDepartment of Information and Communication EngineeringPulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsar stars are considered an important cosmic phenomenon, and scientists use them to study nuclear physics, gravitational waves, and collisions between black holes. Defining the process of automatic detection of pulsar stars can accelerate the study of pulsar stars by scientists. This study contrives an accurate and efficient approach for true pulsar detection using supervised machine learning. For experiments, the high time-resolution (HTRU2) dataset is used in this study. To resolve the data imbalance problem and overcome model overfitting, a hybrid resampling approach is presented in this study. Experiments are performed with imbalanced and balanced datasets using well-known machine learning algorithms. Results demonstrate that the proposed hybrid resampling approach proves highly influential to avoid model overfitting and increase the prediction accuracy. With the proposed hybrid resampling approach, the extra tree classifier achieves a 0.993 accuracy score for true pulsar star prediction.http://dx.doi.org/10.1155/2021/4916494
spellingShingle Ernesto Lee
Furqan Rustam
Wajdi Aljedaani
Abid Ishaq
Vaibhav Rupapara
Imran Ashraf
Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach
Advances in Astronomy
title Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach
title_full Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach
title_fullStr Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach
title_full_unstemmed Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach
title_short Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach
title_sort predicting pulsars from imbalanced dataset with hybrid resampling approach
url http://dx.doi.org/10.1155/2021/4916494
work_keys_str_mv AT ernestolee predictingpulsarsfromimbalanceddatasetwithhybridresamplingapproach
AT furqanrustam predictingpulsarsfromimbalanceddatasetwithhybridresamplingapproach
AT wajdialjedaani predictingpulsarsfromimbalanceddatasetwithhybridresamplingapproach
AT abidishaq predictingpulsarsfromimbalanceddatasetwithhybridresamplingapproach
AT vaibhavrupapara predictingpulsarsfromimbalanceddatasetwithhybridresamplingapproach
AT imranashraf predictingpulsarsfromimbalanceddatasetwithhybridresamplingapproach