Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis

Determining the number of clusters in high-dimensional real-life datasets and interpreting the final outcome are among the challenging problems in data science. Discovering the number of classes in cancer and microarray data plays a vital role in the treatment and diagnosis of cancers and other rela...

Full description

Saved in:
Bibliographic Details
Main Author: Melisew Tefera Belachew
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Journal of Applied Mathematics
Online Access:http://dx.doi.org/10.1155/2021/6675829
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832565783497015296
author Melisew Tefera Belachew
author_facet Melisew Tefera Belachew
author_sort Melisew Tefera Belachew
collection DOAJ
description Determining the number of clusters in high-dimensional real-life datasets and interpreting the final outcome are among the challenging problems in data science. Discovering the number of classes in cancer and microarray data plays a vital role in the treatment and diagnosis of cancers and other related diseases. Nonnegative matrix factorization (NMF) plays a paramount role as an efficient data exploratory tool for extracting basis features inherent in massive data. Some algorithms which are based on incorporating sparsity constraints in the nonconvex NMF optimization problem are applied in the past for analyzing microarray datasets. However, to the best of our knowledge, none of these algorithms use block coordinate descent method which is known for providing closed form solutions. In this paper, we apply an algorithm developed based on columnwise partitioning and rank-one matrix approximation. We test this algorithm on two well-known cancer datasets: leukemia and multiple myeloma. The numerical results indicate that the proposed algorithm performs significantly better than related state-of-the-art methods. In particular, it is shown that this method is capable of robust clustering and discovering larger cancer classes in which the cluster splits are stable.
format Article
id doaj-art-62f6f6e5f3de45cd9b6d725e02fff1e9
institution Kabale University
issn 1687-0042
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Journal of Applied Mathematics
spelling doaj-art-62f6f6e5f3de45cd9b6d725e02fff1e92025-02-03T01:06:36ZengWileyJournal of Applied Mathematics1687-00422021-01-01202110.1155/2021/66758296675829Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data AnalysisMelisew Tefera Belachew0Department of MathematicsDetermining the number of clusters in high-dimensional real-life datasets and interpreting the final outcome are among the challenging problems in data science. Discovering the number of classes in cancer and microarray data plays a vital role in the treatment and diagnosis of cancers and other related diseases. Nonnegative matrix factorization (NMF) plays a paramount role as an efficient data exploratory tool for extracting basis features inherent in massive data. Some algorithms which are based on incorporating sparsity constraints in the nonconvex NMF optimization problem are applied in the past for analyzing microarray datasets. However, to the best of our knowledge, none of these algorithms use block coordinate descent method which is known for providing closed form solutions. In this paper, we apply an algorithm developed based on columnwise partitioning and rank-one matrix approximation. We test this algorithm on two well-known cancer datasets: leukemia and multiple myeloma. The numerical results indicate that the proposed algorithm performs significantly better than related state-of-the-art methods. In particular, it is shown that this method is capable of robust clustering and discovering larger cancer classes in which the cluster splits are stable.http://dx.doi.org/10.1155/2021/6675829
spellingShingle Melisew Tefera Belachew
Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
Journal of Applied Mathematics
title Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
title_full Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
title_fullStr Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
title_full_unstemmed Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
title_short Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
title_sort coordinate descent based sparse nonnegative matrix factorization for robust cancer class discovery and microarray data analysis
url http://dx.doi.org/10.1155/2021/6675829
work_keys_str_mv AT melisewteferabelachew coordinatedescentbasedsparsenonnegativematrixfactorizationforrobustcancerclassdiscoveryandmicroarraydataanalysis