On the sensitivity of feature ranked lists for large-scale biological data

The problem of feature selection for large-scale genomic data, for example from DNA microarray experiments, is one of the fundamental and well-investigated problems in modern computational biology.From the computational point of view, a selected gene list should be characterized by good predictive p...

Full description

Saved in:
Bibliographic Details
Main Authors: Danuta Gaweł, Krzysztof Fujarewicz
Format: Article
Language:English
Published: AIMS Press 2013-03-01
Series:Mathematical Biosciences and Engineering
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.667
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590111462653952
author Danuta Gaweł
Krzysztof Fujarewicz
author_facet Danuta Gaweł
Krzysztof Fujarewicz
author_sort Danuta Gaweł
collection DOAJ
description The problem of feature selection for large-scale genomic data, for example from DNA microarray experiments, is one of the fundamental and well-investigated problems in modern computational biology.From the computational point of view, a selected gene list should be characterized by good predictive power and should be understood and well explained from the biological point of view.Recently, another feature of selected gene lists is increasingly investigated, namely their stability which measures how the content and/or the gene order change when the data are perturbed.In this paper we propose a new approach to analysis of gene list stability, termed the sensitivity index, that does not require any data perturbationand allows the gene list that is most reliable in a biological sense to be chosen.
format Article
id doaj-art-43aa432f33d4478b9560aec445643e94
institution Kabale University
issn 1551-0018
language English
publishDate 2013-03-01
publisher AIMS Press
record_format Article
series Mathematical Biosciences and Engineering
spelling doaj-art-43aa432f33d4478b9560aec445643e942025-01-24T02:26:12ZengAIMS PressMathematical Biosciences and Engineering1551-00182013-03-0110366769010.3934/mbe.2013.10.667On the sensitivity of feature ranked lists for large-scale biological dataDanuta Gaweł0Krzysztof Fujarewicz1Silesian University of Technology, Institute of Automatic Control, Akademicka 16, 44-100 GliwiceSilesian University of Technology, Institute of Automatic Control, Akademicka 16, 44-100 GliwiceThe problem of feature selection for large-scale genomic data, for example from DNA microarray experiments, is one of the fundamental and well-investigated problems in modern computational biology.From the computational point of view, a selected gene list should be characterized by good predictive power and should be understood and well explained from the biological point of view.Recently, another feature of selected gene lists is increasingly investigated, namely their stability which measures how the content and/or the gene order change when the data are perturbed.In this paper we propose a new approach to analysis of gene list stability, termed the sensitivity index, that does not require any data perturbationand allows the gene list that is most reliable in a biological sense to be chosen.https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.667feature rankingsensitivitylarge-scale datadna microarrays.feature selectiongene list stability
spellingShingle Danuta Gaweł
Krzysztof Fujarewicz
On the sensitivity of feature ranked lists for large-scale biological data
Mathematical Biosciences and Engineering
feature ranking
sensitivity
large-scale data
dna microarrays.
feature selection
gene list stability
title On the sensitivity of feature ranked lists for large-scale biological data
title_full On the sensitivity of feature ranked lists for large-scale biological data
title_fullStr On the sensitivity of feature ranked lists for large-scale biological data
title_full_unstemmed On the sensitivity of feature ranked lists for large-scale biological data
title_short On the sensitivity of feature ranked lists for large-scale biological data
title_sort on the sensitivity of feature ranked lists for large scale biological data
topic feature ranking
sensitivity
large-scale data
dna microarrays.
feature selection
gene list stability
url https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.667
work_keys_str_mv AT danutagaweł onthesensitivityoffeaturerankedlistsforlargescalebiologicaldata
AT krzysztoffujarewicz onthesensitivityoffeaturerankedlistsforlargescalebiologicaldata