Variance of the number of false discoveries
In high throughput genomic work, a very large number "d" of hypotheses are tested based on "n">"d" data samples. The large number of tests necessitates an adjustment for false discoveries in which a true null hypothesis was rejected. The expected number of false discoveries is easy to obtain. Dependences between the hypothesis tests greatly affect the variance of the number of false discoveries. Assuming that the tests are independent gives an inadequate variance formula. The paper presents a variance formula that takes account of the correlations between test statistics. That formula involves "O"("d"-super-2) correlations, and so a naïve implementation has cost "O"("nd"-super-2). A method based on sampling pairs of tests allows the variance to be approximated at a cost that is independent of "d". Copyright 2005 Royal Statistical Society.
Year of publication: |
2005
|
---|---|
Authors: | Owen, Art B. |
Published in: |
Journal of the Royal Statistical Society Series B. - Royal Statistical Society - RSS, ISSN 1369-7412. - Vol. 67.2005, 3, p. 411-426
|
Publisher: |
Royal Statistical Society - RSS |
Saved in:
freely available
Saved in favorites
Similar items by person
-
Deterministic parallel analysis : an improved method for selecting factors and principal components
Dobriban, Edgar, (2018)
-
Designing experiments informed by observational studies
Rosenman, Evan T. R., (2021)
-
Estimating mean dimensionality of analysis of variance decompositions
Liu, Ruixue, (2006)
- More ...