IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)
Sembung is a medicinal plant native to Indonesia that grows optimally in tropical climates. The secondary metabolite compounds found in the leaves of sembung are biopharmaceutical active ingredients. Fourier Transform Infrared (FTIR) spectroscopy can identify the functional compounds in sembung leav...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Universitas Pattimura
2025-01-01
|
| Series: | Barekeng |
| Subjects: | |
| Online Access: | https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/15175 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849389551344156672 |
|---|---|
| author | Kusnaeni Kusnaeni Nurul Fuady Adhalia Abdul Khaliq Zulfattah |
| author_facet | Kusnaeni Kusnaeni Nurul Fuady Adhalia Abdul Khaliq Zulfattah |
| author_sort | Kusnaeni Kusnaeni |
| collection | DOAJ |
| description | Sembung is a medicinal plant native to Indonesia that grows optimally in tropical climates. The secondary metabolite compounds found in the leaves of sembung are biopharmaceutical active ingredients. Fourier Transform Infrared (FTIR) spectroscopy can identify the functional compounds in sembung leaves by analyzing unique peaks in the spectrum, which correspond to specific functional groups of the compounds. In this research, 35 observations were made with 1,866 explanatory variables (wavelengths). Data in which the number of explanatory variables surpasses the number of observations is known as high-dimensional data. One method that can handle high-dimensional problems is to select important variables that affect the objective variable. The XGBoost algorithm can calculate the feature importance score that affects the goal variable so that it does not have to include all variables in the modeling, this can overcome problems in high-dimensional data. The results of the calculation of feature importance found Lignin Skeletal Band, CH, and CH2 aliphatic Stretching Group, C=C, C=N, C–H in ring structure, DNA and RNA backbones, NH2 Aminoacidic Group, C=O Ester Fatty Acid that the active compounds contained in the leaves of sembung. |
| format | Article |
| id | doaj-art-e774f3bfb5f24abda3342e78e260ee15 |
| institution | Kabale University |
| issn | 1978-7227 2615-3017 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | Universitas Pattimura |
| record_format | Article |
| series | Barekeng |
| spelling | doaj-art-e774f3bfb5f24abda3342e78e260ee152025-08-20T03:41:56ZengUniversitas PattimuraBarekeng1978-72272615-30172025-01-0119167568610.30598/barekengvol19iss1pp675-68615175IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA)Kusnaeni Kusnaeni0Nurul Fuady Adhalia1Abdul Khaliq Zulfattah2Department of Mathematics, Institut Teknologi Bacharuddin Jusuf Habibie, IndonesiaDepartment of Mathematics, Institut Teknologi Bacharuddin Jusuf Habibie, IndonesiaDepartment of Information System, Institut Teknologi Bacharuddin Jusuf Habibie, IndonesiaSembung is a medicinal plant native to Indonesia that grows optimally in tropical climates. The secondary metabolite compounds found in the leaves of sembung are biopharmaceutical active ingredients. Fourier Transform Infrared (FTIR) spectroscopy can identify the functional compounds in sembung leaves by analyzing unique peaks in the spectrum, which correspond to specific functional groups of the compounds. In this research, 35 observations were made with 1,866 explanatory variables (wavelengths). Data in which the number of explanatory variables surpasses the number of observations is known as high-dimensional data. One method that can handle high-dimensional problems is to select important variables that affect the objective variable. The XGBoost algorithm can calculate the feature importance score that affects the goal variable so that it does not have to include all variables in the modeling, this can overcome problems in high-dimensional data. The results of the calculation of feature importance found Lignin Skeletal Band, CH, and CH2 aliphatic Stretching Group, C=C, C=N, C–H in ring structure, DNA and RNA backbones, NH2 Aminoacidic Group, C=O Ester Fatty Acid that the active compounds contained in the leaves of sembung.https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/15175sembung leavesfeature importancexgboostactive compound |
| spellingShingle | Kusnaeni Kusnaeni Nurul Fuady Adhalia Abdul Khaliq Zulfattah IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA) Barekeng sembung leaves feature importance xgboost active compound |
| title | IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA) |
| title_full | IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA) |
| title_fullStr | IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA) |
| title_full_unstemmed | IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA) |
| title_short | IMPLEMENTATION OF FEATURE IMPORTANCE XGBOOST ALGORITHM TO DETERMINE THE ACTIVE COMPOUNDS OF SEMBUNG LEAVES (BLUMEA BALSAMIFERA) |
| title_sort | implementation of feature importance xgboost algorithm to determine the active compounds of sembung leaves blumea balsamifera |
| topic | sembung leaves feature importance xgboost active compound |
| url | https://ojs3.unpatti.ac.id/index.php/barekeng/article/view/15175 |
| work_keys_str_mv | AT kusnaenikusnaeni implementationoffeatureimportancexgboostalgorithmtodeterminetheactivecompoundsofsembungleavesblumeabalsamifera AT nurulfuadyadhalia implementationoffeatureimportancexgboostalgorithmtodeterminetheactivecompoundsofsembungleavesblumeabalsamifera AT abdulkhaliqzulfattah implementationoffeatureimportancexgboostalgorithmtodeterminetheactivecompoundsofsembungleavesblumeabalsamifera |