Exploration of linear and interpretable models for quantification of cell parameters via contactless short-wave infrared hyperspectral sensing

Abstract The development of optical sensors for label-free quantification of cell parameters has numerous uses in the biomedical arena. However, using current optical probes requires the laborious collection of sufficiently large datasets that can be used to calibrate optical probe signals to true m...

Full description

Saved in:
Bibliographic Details
Main Authors: Anjana Hevaganinge, Eva Lowenstein, Anna Filatova, Mihir Modak, Nandi Thales Mogo, Bryana Rowley, Jenny Yarmowsky, Joshua Ehizibolo, Ravidu Hevaganinge, Amy Musser, Abbey Kim, Anthony Neri, Jessica Conway, Yiding Yuan, Maurizio Cattaneo, Sui Seng Tee, Yang Tao
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-85930-2
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract The development of optical sensors for label-free quantification of cell parameters has numerous uses in the biomedical arena. However, using current optical probes requires the laborious collection of sufficiently large datasets that can be used to calibrate optical probe signals to true metabolite concentrations. Further, most practitioners find it difficult to confidently adapt black box chemometric models that are difficult to troubleshoot in high-stakes applications such as biopharmaceutical manufacturing. Replacing optical probes with contactless short-wave infrared (SWIR) hyperspectral cameras allows efficient collection of thousands of absorption signals in a handful of images. This high repetition allows for effective denoising of each spectrum, so interpretable linear models can quantify metabolites. To illustrate, an interpretable linear model called L-SLR is trained using small datasets obtained with a SWIR HSI camera to quantify fructose, viable cell density (VCD), glucose, and lactate. The performance of this model is also compared to other existing linear models, namely Partial Least Squares (PLS) and Non-negative Matrix Factorization (NMF). Using only 50% of the dataset for training, reasonable test performance of mean absolute error (MAE) and correlations (r2) are achieved for glucose (r2 = 0.88, MAE = 37 mg/dL), lactate (r2 = 0.93, MAE = 15.08 mg/dL), and VCD (r2 = 0.81, MAE = 8.6 × 105 cells/mL). Further, these models are also able to handle quantification of a metabolite like fructose in the presence of high background concentration of similar metabolite with almost identical chemical interactions in water like glucose. The model achieves reasonable quantification performance for large fructose level (100–1000 mg/dL) quantification (r2 = 0.92, MAE = 25.1 mg/dL) and small fructose level (< 60 mg/dL) concentrations (r2 = 0.85, MAE = 4.97 mg/dL) in complex media like Fetal Bovine Serum (FBS). Finally, the model provides sparse interpretable weight matrices that hint at the underlying solution changes that correlate to each cell parameter prediction.
ISSN:2045-2322