IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)

Sembung is a medicinal plant native to Indonesia that grows optimally in tropical climates. The secondary metabolite compounds found in the leaves of sembung are biopharmaceutical active ingredients. Fourier Transform Infrared (FTIR) spectroscopy can identify the functional compounds in sembung leav...

Full description

Saved in:
Bibliographic Details
Main Authors: Kusnaeni Kusnaeni, Nurul Fuady Adhalia, Abdul Khaliq Zulfattah
Format: Article
Language:English
Published: Universitas Pattimura 2025-01-01
Series:Barekeng
Subjects:
Online Access:https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/15175
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849389551344156672
author Kusnaeni Kusnaeni
Nurul Fuady Adhalia
Abdul Khaliq Zulfattah
author_facet Kusnaeni Kusnaeni
Nurul Fuady Adhalia
Abdul Khaliq Zulfattah
author_sort Kusnaeni Kusnaeni
collection DOAJ
description Sembung is a medicinal plant native to Indonesia that grows optimally in tropical climates. The secondary metabolite compounds found in the leaves of sembung are biopharmaceutical active ingredients. Fourier Transform Infrared (FTIR) spectroscopy can identify the functional compounds in sembung leaves by analyzing unique peaks in the spectrum, which correspond to specific functional groups of the compounds. In this research, 35 observations were made with 1,866 explanatory variables (wavelengths). Data in which the number of explanatory variables surpasses the number of observations is known as high-dimensional data. One method that can handle high-dimensional problems is to select important variables that affect the objective variable. The XGBoost algorithm can calculate the feature importance score that affects the goal variable so that it does not have to include all variables in the modeling, this can overcome problems in high-dimensional data. The results of the calculation of feature importance found Lignin Skeletal Band, CH, and CH2 aliphatic Stretching Group, C=C, C=N, C–H in ring structure, DNA and RNA backbones, NH2 Aminoacidic Group, C=O Ester Fatty Acid that the active compounds contained in the leaves of sembung.
format Article
id doaj-art-e774f3bfb5f24abda3342e78e260ee15
institution Kabale University
issn 1978-7227
2615-3017
language English
publishDate 2025-01-01
publisher Universitas Pattimura
record_format Article
series Barekeng
spelling doaj-art-e774f3bfb5f24abda3342e78e260ee152025-08-20T03:41:56ZengUniversitas PattimuraBarekeng1978-72272615-30172025-01-0119167568610.30598/barekengvol19iss1pp675-68615175IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)Kusnaeni Kusnaeni0Nurul Fuady Adhalia1Abdul Khaliq Zulfattah2Department of Mathematics, Institut Teknologi Bacharuddin Jusuf Habibie, IndonesiaDepartment of Mathematics, Institut Teknologi Bacharuddin Jusuf Habibie, IndonesiaDepartment of Information System, Institut Teknologi Bacharuddin Jusuf Habibie, IndonesiaSembung is a medicinal plant native to Indonesia that grows optimally in tropical climates. The secondary metabolite compounds found in the leaves of sembung are biopharmaceutical active ingredients. Fourier Transform Infrared (FTIR) spectroscopy can identify the functional compounds in sembung leaves by analyzing unique peaks in the spectrum, which correspond to specific functional groups of the compounds. In this research, 35 observations were made with 1,866 explanatory variables (wavelengths). Data in which the number of explanatory variables surpasses the number of observations is known as high-dimensional data. One method that can handle high-dimensional problems is to select important variables that affect the objective variable. The XGBoost algorithm can calculate the feature importance score that affects the goal variable so that it does not have to include all variables in the modeling, this can overcome problems in high-dimensional data. The results of the calculation of feature importance found Lignin Skeletal Band, CH, and CH2 aliphatic Stretching Group, C=C, C=N, C–H in ring structure, DNA and RNA backbones, NH2 Aminoacidic Group, C=O Ester Fatty Acid that the active compounds contained in the leaves of sembung.https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/15175sembung leavesfeature importancexgboostactive compound
spellingShingle Kusnaeni Kusnaeni
Nurul Fuady Adhalia
Abdul Khaliq Zulfattah
IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
Barekeng
sembung leaves
feature importance
xgboost
active compound
title IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
title_full IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
title_fullStr IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
title_full_unstemmed IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
title_short IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
title_sort implementation of feature importance xgboost algorithm to determine the active compounds of sembung leaves blumea balsamifera
topic sembung leaves
feature importance
xgboost
active compound
url https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/15175
work_keys_str_mv AT kusnaenikusnaeni implementationoffeatureimportancexgboostalgorithmtodeterminetheactivecompoundsofsembungleavesblumeabalsamifera
AT nurulfuadyadhalia implementationoffeatureimportancexgboostalgorithmtodeterminetheactivecompoundsofsembungleavesblumeabalsamifera
AT abdulkhaliqzulfattah implementationoffeatureimportancexgboostalgorithmtodeterminetheactivecompoundsofsembungleavesblumeabalsamifera