On the sensitivity of feature ranked lists for large-scale biological data

The problem of feature selection for large-scale genomic data, for example from DNA microarray experiments, is one of the fundamental and well-investigated problems in modern computational biology.From the computational point of view, a selected gene list should be characterized by good predictive p...

Full description

Saved in:
Bibliographic Details
Main Authors: Danuta Gaweł, Krzysztof Fujarewicz
Format: Article
Language:English
Published: AIMS Press 2013-03-01
Series:Mathematical Biosciences and Engineering
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.667
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The problem of feature selection for large-scale genomic data, for example from DNA microarray experiments, is one of the fundamental and well-investigated problems in modern computational biology.From the computational point of view, a selected gene list should be characterized by good predictive power and should be understood and well explained from the biological point of view.Recently, another feature of selected gene lists is increasingly investigated, namely their stability which measures how the content and/or the gene order change when the data are perturbed.In this paper we propose a new approach to analysis of gene list stability, termed the sensitivity index, that does not require any data perturbationand allows the gene list that is most reliable in a biological sense to be chosen.
ISSN:1551-0018