Access

You are not currently logged in.

Access your personal account or get JSTOR access through your library or other institution:

login

Log in to your personal account or through your institution.

If you need an accessible version of this item please contact JSTOR User Support

Statistical Significance for Genomewide Studies

John D. Storey and Robert Tibshirani
Proceedings of the National Academy of Sciences of the United States of America
Vol. 100, No. 16 (Aug. 5, 2003), pp. 9440-9445
Stable URL: http://www.jstor.org/stable/3144228
Page Count: 6
  • Read Online (Free)
  • Cite this Item
If you need an accessible version of this item please contact JSTOR User Support
Statistical Significance for Genomewide Studies
Preview not available

Abstract

With the increase in genomewide experiments and the sequencing of multiple genomes, the analysis of large data sets has become commonplace in biology. It is often the case that thousands of features in a genomewide data set are tested against some null hypothesis, where a number of features are expected to be significant. Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate. This approach offers a sensible balance between the number of true and false positives that is automatically calibrated and easily interpreted. In doing so, a measure of statistical significance called the q value is associated with each tested feature. The q value is similar to the well known p value, except it is a measure of significance in terms of the false discovery rate rather than the false positive rate. Our approach avoids a flood of false positive results, while offering a more liberal criterion than what has been used in genome scans for linkage.

Page Thumbnails

  • Thumbnail: Page 
[9440]
    [9440]
  • Thumbnail: Page 
9441
    9441
  • Thumbnail: Page 
9442
    9442
  • Thumbnail: Page 
9443
    9443
  • Thumbnail: Page 
9444
    9444
  • Thumbnail: Page 
9445
    9445