Integrating representation learning, permutation, and optimization to detect lineage-related gene expression patterns

Abstract Recent barcoding technologies allow reconstructing lineage trees while capturing paired single-cell RNA-sequencing (scRNA-seq) data. Such datasets provide opportunities to compare gene expression memory maintenance through lineage branching and pinpoint critical genes in these processes. He...

Full description

Saved in:
Bibliographic Details
Main Authors: Hannah M. Schlüter, Caroline Uhler
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-025-56388-7
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Recent barcoding technologies allow reconstructing lineage trees while capturing paired single-cell RNA-sequencing (scRNA-seq) data. Such datasets provide opportunities to compare gene expression memory maintenance through lineage branching and pinpoint critical genes in these processes. Here we develop Permutation, Optimization, and Representation learning based single Cell gene Expression and Lineage ANalysis (PORCELAN) to identify lineage-informative genes or subtrees where lineage and expression are tightly coupled. We validate our method using synthetic data and apply it to recent paired lineage and scRNA-seq data of lung cancer in a mouse model and embryogenesis of mouse and C. elegans. Our method pinpoints subtrees giving rise to metastases or new cell states, and genes identified as most informative about lineage overlap with known pathways involved in lung cancer progression. Furthermore, our method highlights differences in how gene expression memory is maintained through divisions in cancer and embryogenesis, thereby providing a tool for studying cell state memory through divisions across biological systems.
ISSN:2041-1723