Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis

Lung cancer is one of the most prevalent and lethal malignant tumors worldwide. Currently, clinical diagnosis primarily relies on chest X-ray examinations, histopathological analysis, and the detection of tumor markers in blood. However, each of these methods has inherent limitations. The current st...

Full description

Saved in:
Bibliographic Details
Main Authors: Ao Song, Wanli Yang, Jun Wang, Yisa Cai, Lizheng Cai, Nan Pang, Ruihua Yu, Zhikun Liu, Chao Yang, Feng Jiang
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:SLAS Technology
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2472630325000111
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832087589241225216
author Ao Song
Wanli Yang
Jun Wang
Yisa Cai
Lizheng Cai
Nan Pang
Ruihua Yu
Zhikun Liu
Chao Yang
Feng Jiang
author_facet Ao Song
Wanli Yang
Jun Wang
Yisa Cai
Lizheng Cai
Nan Pang
Ruihua Yu
Zhikun Liu
Chao Yang
Feng Jiang
author_sort Ao Song
collection DOAJ
description Lung cancer is one of the most prevalent and lethal malignant tumors worldwide. Currently, clinical diagnosis primarily relies on chest X-ray examinations, histopathological analysis, and the detection of tumor markers in blood. However, each of these methods has inherent limitations. The current study aims to explore novel diagnostic approaches for lung cancer by employing attenuated total reflection-Fourier transform infrared (ATR-FTIR) spectroscopy in conjunction with multiple machine learning models. Fourier transform infrared spectroscopy can detect subtle differences in the material structures that reflect the carcinogenic process between lung cancer tissues and normal tissues. By applying principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) to analyze infrared spectral data, these subtle differences can be amplified. The study revealed that the combination of spectral bands within the 3500–3000 cm-1 and 1600–1500 cm-1 ranges is particularly significant for differentiating between the two groups. Three classification models—Support Vector Machine (SVM), k-Nearest Neighbor (kNN), and Linear Discriminant Analysis (LDA)—were constructed for spectral analysis of various band combinations. The results indicated that in detecting lung cancer samples, the combination of the 3500–3000 cm-1 and 1600–1500 cm-1 bands offers significant advantages. The analysis of the receiver operating characteristic (ROC) curve demonstrated that the area under the curve (AUC) exceeded 0.95 for all models, with the LDA model achieving an accuracy rate of 99.4% in identifying lung cancer patients compared to healthy individuals. The findings suggest that the integration of ATR-FTIR spectroscopy with multiple machine learning models represents a promising auxiliary diagnostic method for clinical lung cancer diagnosis, enabling detection at the molecular level.
format Article
id doaj-art-c0367250458541c386aea7a08860f42b
institution Kabale University
issn 2472-6303
language English
publishDate 2025-04-01
publisher Elsevier
record_format Article
series SLAS Technology
spelling doaj-art-c0367250458541c386aea7a08860f42b2025-02-06T05:12:38ZengElsevierSLAS Technology2472-63032025-04-0131100253Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosisAo Song0Wanli Yang1Jun Wang2Yisa Cai3Lizheng Cai4Nan Pang5Ruihua Yu6Zhikun Liu7Chao Yang8Feng Jiang9Jiangsu Key Laboratory of Regional Resource Exploitation and Medicinal Research, Huaiyin Institute of Technology, Huai'an 223003, China; Chongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, ChinaChongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, ChinaChongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, ChinaJiangsu Key Laboratory of Regional Resource Exploitation and Medicinal Research, Huaiyin Institute of Technology, Huai'an 223003, China; Chongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, ChinaChongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, China; Shanghai University of Traditional Chinese Medicine, Shanghai 201203, ChinaChongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, ChinaChongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, ChinaJiangsu Key Laboratory of Regional Resource Exploitation and Medicinal Research, Huaiyin Institute of Technology, Huai'an 223003, China; Corresponding authors.Chongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, China; Corresponding authors.Chongming Hospital Affiliated to Shanghai University of Medicine and Health Sciences, Shanghai 202150, China; Corresponding authors.Lung cancer is one of the most prevalent and lethal malignant tumors worldwide. Currently, clinical diagnosis primarily relies on chest X-ray examinations, histopathological analysis, and the detection of tumor markers in blood. However, each of these methods has inherent limitations. The current study aims to explore novel diagnostic approaches for lung cancer by employing attenuated total reflection-Fourier transform infrared (ATR-FTIR) spectroscopy in conjunction with multiple machine learning models. Fourier transform infrared spectroscopy can detect subtle differences in the material structures that reflect the carcinogenic process between lung cancer tissues and normal tissues. By applying principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) to analyze infrared spectral data, these subtle differences can be amplified. The study revealed that the combination of spectral bands within the 3500–3000 cm-1 and 1600–1500 cm-1 ranges is particularly significant for differentiating between the two groups. Three classification models—Support Vector Machine (SVM), k-Nearest Neighbor (kNN), and Linear Discriminant Analysis (LDA)—were constructed for spectral analysis of various band combinations. The results indicated that in detecting lung cancer samples, the combination of the 3500–3000 cm-1 and 1600–1500 cm-1 bands offers significant advantages. The analysis of the receiver operating characteristic (ROC) curve demonstrated that the area under the curve (AUC) exceeded 0.95 for all models, with the LDA model achieving an accuracy rate of 99.4% in identifying lung cancer patients compared to healthy individuals. The findings suggest that the integration of ATR-FTIR spectroscopy with multiple machine learning models represents a promising auxiliary diagnostic method for clinical lung cancer diagnosis, enabling detection at the molecular level.http://www.sciencedirect.com/science/article/pii/S2472630325000111Lung cancerATR-FTIR spectroscopyLinear discriminant analysisSerum
spellingShingle Ao Song
Wanli Yang
Jun Wang
Yisa Cai
Lizheng Cai
Nan Pang
Ruihua Yu
Zhikun Liu
Chao Yang
Feng Jiang
Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis
SLAS Technology
Lung cancer
ATR-FTIR spectroscopy
Linear discriminant analysis
Serum
title Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis
title_full Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis
title_fullStr Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis
title_full_unstemmed Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis
title_short Application of ATR-FTIR spectroscopy and multivariate statistical analysis in cancer diagnosis
title_sort application of atr ftir spectroscopy and multivariate statistical analysis in cancer diagnosis
topic Lung cancer
ATR-FTIR spectroscopy
Linear discriminant analysis
Serum
url http://www.sciencedirect.com/science/article/pii/S2472630325000111
work_keys_str_mv AT aosong applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT wanliyang applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT junwang applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT yisacai applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT lizhengcai applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT nanpang applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT ruihuayu applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT zhikunliu applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT chaoyang applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis
AT fengjiang applicationofatrftirspectroscopyandmultivariatestatisticalanalysisincancerdiagnosis