Machine Learning for Lung Cancer Subtype Classification: Combining Clinical, Histopathological, and Biophysical Features

<b>Background/Objectives:</b> Despite advances in diagnostic techniques, accurate classification of lung cancer subtypes remains crucial for treatment planning. Traditional methods like genomic studies face limitations such as high cost and complexity. This study investigates whether int...

Full description

Saved in:
Bibliographic Details
Main Authors: Aiga Andrijanova, Lasma Bugovecka, Sergejs Isajevs, Donats Erts, Uldis Malinovskis, Andis Liepins
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Diagnostics
Subjects:
Online Access:https://www.mdpi.com/2075-4418/15/2/127
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:<b>Background/Objectives:</b> Despite advances in diagnostic techniques, accurate classification of lung cancer subtypes remains crucial for treatment planning. Traditional methods like genomic studies face limitations such as high cost and complexity. This study investigates whether integrating atomic force microscopy (AFM) measurements with conventional clinical and histopathological data can improve lung cancer subtype classification. <b>Methods:</b> We developed and analyzed a novel dataset combining clinical, histopathological, and AFM-derived biophysical characteristics from 37 lung cancer patients. Various machine learning techniques were evaluated, with a focus on Bayesian Networks due to their ability to handle complex data with missing values. Leave-One-Out Cross-Validation was employed to assess model performance. <b>Results:</b> The integration of biophysical features improved classification accuracy from 86.49% to 89.19% using a data-driven Bayesian Network model, though this improvement was not statistically significant (<i>p</i> = 1.0). Four key features were identified as highly predictive: sex, vascular invasion, perineural invasion, and ALK mutation. A simplified model using only these features achieved identical performance with significantly reduced complexity (BIC 51.931 vs. 268.586). <b>Conclusions:</b> While AFM-derived measurements showed promise for enhancing lung cancer subtype classification, larger datasets are needed to fully validate their impact. Our findings demonstrate the feasibility of incorporating biophysical measurements into cancer classification frameworks and identify the most predictive features for accurate diagnosis. Further research with expanded datasets is needed to validate these findings.
ISSN:2075-4418