CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data

Single-cell RNA sequencing (scRNA-seq) is a cutting-edge technique in molecular biology and genomics, revealing the cellular heterogeneity. However, scRNA-seq data often suffer from dropout events, meaning that certain genes exhibit very low or even zero expression levels due to technical limitation...

Full description

Saved in:
Bibliographic Details
Main Authors: Wanlin Juan, Kwang Woo Ahn, Yi-Guang Chen, Chien-Wei Lin
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Bioengineering
Subjects:
Online Access:https://www.mdpi.com/2306-5354/12/1/31
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589075783090176
author Wanlin Juan
Kwang Woo Ahn
Yi-Guang Chen
Chien-Wei Lin
author_facet Wanlin Juan
Kwang Woo Ahn
Yi-Guang Chen
Chien-Wei Lin
author_sort Wanlin Juan
collection DOAJ
description Single-cell RNA sequencing (scRNA-seq) is a cutting-edge technique in molecular biology and genomics, revealing the cellular heterogeneity. However, scRNA-seq data often suffer from dropout events, meaning that certain genes exhibit very low or even zero expression levels due to technical limitations. Existing imputation methods for dropout events lack comprehensive evaluations in downstream analyses and do not demonstrate robustness across various scenarios. In response to this challenge, we propose a consensus clustering-based imputation (CCI) method. CCI performs clustering on each subset of data sampling across genes and summarizes clustering outcomes to define cellular similarities. CCI leverages the information from similar cells and employs the similarities to impute gene expression levels. Our comprehensive evaluations demonstrate that CCI not only reconstructs the original data pattern, but also improves the performance of downstream analyses. CCI outperforms existing methods for data imputation under different scenarios, exhibiting accuracy, robustness, and generalization.
format Article
id doaj-art-c97fb958bbe54045a31118d6d93f6ea0
institution Kabale University
issn 2306-5354
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Bioengineering
spelling doaj-art-c97fb958bbe54045a31118d6d93f6ea02025-01-24T13:23:01ZengMDPI AGBioengineering2306-53542025-01-011213110.3390/bioengineering12010031CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq DataWanlin Juan0Kwang Woo Ahn1Yi-Guang Chen2Chien-Wei Lin3Division of Biostatistics, Data Science Institute, Medical College of Wisconsin (MCW), Milwaukee, WI 53226, USADivision of Biostatistics, Data Science Institute, Medical College of Wisconsin (MCW), Milwaukee, WI 53226, USADepartment of Pediatrics, Medical College of Wisconsin (MCW), Milwaukee, WI 53226, USADivision of Biostatistics, Data Science Institute, Medical College of Wisconsin (MCW), Milwaukee, WI 53226, USASingle-cell RNA sequencing (scRNA-seq) is a cutting-edge technique in molecular biology and genomics, revealing the cellular heterogeneity. However, scRNA-seq data often suffer from dropout events, meaning that certain genes exhibit very low or even zero expression levels due to technical limitations. Existing imputation methods for dropout events lack comprehensive evaluations in downstream analyses and do not demonstrate robustness across various scenarios. In response to this challenge, we propose a consensus clustering-based imputation (CCI) method. CCI performs clustering on each subset of data sampling across genes and summarizes clustering outcomes to define cellular similarities. CCI leverages the information from similar cells and employs the similarities to impute gene expression levels. Our comprehensive evaluations demonstrate that CCI not only reconstructs the original data pattern, but also improves the performance of downstream analyses. CCI outperforms existing methods for data imputation under different scenarios, exhibiting accuracy, robustness, and generalization.https://www.mdpi.com/2306-5354/12/1/31dropoutimputationconsensus clusteringscRNA-seq
spellingShingle Wanlin Juan
Kwang Woo Ahn
Yi-Guang Chen
Chien-Wei Lin
CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data
Bioengineering
dropout
imputation
consensus clustering
scRNA-seq
title CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data
title_full CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data
title_fullStr CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data
title_full_unstemmed CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data
title_short CCI: A Consensus Clustering-Based Imputation Method for Addressing Dropout Events in scRNA-Seq Data
title_sort cci a consensus clustering based imputation method for addressing dropout events in scrna seq data
topic dropout
imputation
consensus clustering
scRNA-seq
url https://www.mdpi.com/2306-5354/12/1/31
work_keys_str_mv AT wanlinjuan cciaconsensusclusteringbasedimputationmethodforaddressingdropouteventsinscrnaseqdata
AT kwangwooahn cciaconsensusclusteringbasedimputationmethodforaddressingdropouteventsinscrnaseqdata
AT yiguangchen cciaconsensusclusteringbasedimputationmethodforaddressingdropouteventsinscrnaseqdata
AT chienweilin cciaconsensusclusteringbasedimputationmethodforaddressingdropouteventsinscrnaseqdata