Publikationsansicht

Control of the False Discovery Rate Applied to the Detection of Positively Selected Amino Acid Sites (2006)

Abstract
In this article, we consider the probabilistic identification of amino acid positions that evolve under positive selection as a multiple hypothesis testing problem. The null hypothesis "H0,s: site s evolves under a negative selection or under a neutral process of evolution" is tested at each codon site of the alignment of homologous coding sequences. Standard hypothesis testing is based on the control of the expected proportion of falsely rejected null hypotheses or type-I error rate. As the number of tests increases, however, the power of an individual test may become unacceptably low. Recent advances in statistics have shown that the false discovery rate--in this case, the expected proportion of sites that do not evolve under positive selection among those that are estimated to evolve under this selection regime--is a quantity that can be controlled. Keeping the proportion of false positives low among the significant results generally leads to an increase in power. In this article, we show that controlling the false detection rate is relevant when searching for positively selected sites. We also compare this new approach to traditional methods using extensive simulations.

Details der Publikation
Download http://hal-lirmm.ccsd.cnrs.fr/lirmm-00135171/en/
Herausgeber HAL - CCSD
Archiv CCSd/HAL : e-articles server (based on gBUS) (France)
Keywords Computer Science/Bioinformatics
Typ peer-reviewed article
Sprache Englisch
Verknüpfungen http://hal-lirmm.ccsd.cnrs.fr/docs/00/13/51/71/PDF/497.R2_everything.pdf

Literaturangaben in der Publikation (4)
The control of the false discovery rate in multiple testing under dependency (2001)
Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene.
Accuracy and Power of Statistical Methods for Detecting Adaptive Evolution in Protein Coding Sequences and for Identifying Positively Selected Sites
Codon-substitution models for heterogeneous selection pressure at amino acid sites.