Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means

In high-dimensional gene expression experiments such as microarray and RNA-seq experiments, the number of measured variables is huge while the number of replicates is small. As a consequence, hypothesis testing is challenging because the power of tests can be very low after controlling multiple test...

Full description

Saved in:
Bibliographic Details
Main Authors: Peng Liu, Chong Wang
Format: Article
Language:English
Published: Wiley 2012-01-01
Series:Journal of Probability and Statistics
Online Access:http://dx.doi.org/10.1155/2012/913560
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832549998968963072
author Peng Liu
Chong Wang
author_facet Peng Liu
Chong Wang
author_sort Peng Liu
collection DOAJ
description In high-dimensional gene expression experiments such as microarray and RNA-seq experiments, the number of measured variables is huge while the number of replicates is small. As a consequence, hypothesis testing is challenging because the power of tests can be very low after controlling multiple testing error. Optimal testing procedures with high average power while controlling false discovery rate are preferred. Many methods were constructed to achieve high power through borrowing information across genes. Some of these methods can be shown to achieve the optimal average power across genes, but only under a normal assumption of alternative means. However, the assumption of a normal distribution is likely violated in practice. In this paper, we propose a novel semiparametric optimal testing (SPOT) procedure for high-dimensional data with small sample size. Our procedure is more robust because it does not depend on any parametric assumption for the alternative means. We show that the proposed test achieves the maximum average power asymptotically as the number of tests goes to infinity. Both simulation study and the analysis of a real microarray data with spike-in probes show that the proposed SPOT procedure performs better when compared to other popularly applied procedures.
format Article
id doaj-art-5a37a1e58c3f4e2ab4bc55531a5a7f34
institution Kabale University
issn 1687-952X
1687-9538
language English
publishDate 2012-01-01
publisher Wiley
record_format Article
series Journal of Probability and Statistics
spelling doaj-art-5a37a1e58c3f4e2ab4bc55531a5a7f342025-02-03T06:07:57ZengWileyJournal of Probability and Statistics1687-952X1687-95382012-01-01201210.1155/2012/913560913560Robust Semiparametric Optimal Testing Procedure for Multiple Normal MeansPeng Liu0Chong Wang1Department of Statistics, Iowa State University, Ames, IA 50011, USADepartment of Statistics, Iowa State University, Ames, IA 50011, USAIn high-dimensional gene expression experiments such as microarray and RNA-seq experiments, the number of measured variables is huge while the number of replicates is small. As a consequence, hypothesis testing is challenging because the power of tests can be very low after controlling multiple testing error. Optimal testing procedures with high average power while controlling false discovery rate are preferred. Many methods were constructed to achieve high power through borrowing information across genes. Some of these methods can be shown to achieve the optimal average power across genes, but only under a normal assumption of alternative means. However, the assumption of a normal distribution is likely violated in practice. In this paper, we propose a novel semiparametric optimal testing (SPOT) procedure for high-dimensional data with small sample size. Our procedure is more robust because it does not depend on any parametric assumption for the alternative means. We show that the proposed test achieves the maximum average power asymptotically as the number of tests goes to infinity. Both simulation study and the analysis of a real microarray data with spike-in probes show that the proposed SPOT procedure performs better when compared to other popularly applied procedures.http://dx.doi.org/10.1155/2012/913560
spellingShingle Peng Liu
Chong Wang
Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means
Journal of Probability and Statistics
title Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means
title_full Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means
title_fullStr Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means
title_full_unstemmed Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means
title_short Robust Semiparametric Optimal Testing Procedure for Multiple Normal Means
title_sort robust semiparametric optimal testing procedure for multiple normal means
url http://dx.doi.org/10.1155/2012/913560
work_keys_str_mv AT pengliu robustsemiparametricoptimaltestingprocedureformultiplenormalmeans
AT chongwang robustsemiparametricoptimaltestingprocedureformultiplenormalmeans