Evaluation of three methods for calculating statistical significance when incorporating a systematic uncertainty into a test of the backgroundonly hypothesis for a Poisson process
Abstract
Hypothesis tests for the presence of new sources of Poisson counts amidst background processes are frequently performed in high energy physics (HEP), gamma ray astronomy (GRA), and other branches of science. While there are conceptual issues already when the mean rate of background is precisely known, the issues are even more difficult when the mean background rate has nonnegligible uncertainty. After describing a variety of methods to be found in the HEP and GRA literature, we consider in detail three classes of algorithms and evaluate them over a wide range of parameter space, by the criterion of how close the ensembleaverage Type I error rate (rejection of the backgroundonly hypothesis when it is true) compares with the nominal significance level given by the algorithm. We recommend wider use of an algorithm firmly grounded in frequentist tests of the ratio of Poisson means, although for very low counts the overcoverage can be severe due to the effect of discreteness. We extend the studies of Cranmer, who found that a popular Bayesianfrequentist hybrid can undercover severely when taken to high Zvalues. We also examine the profile likelihood method, which has long been used in GRA and HEP; it provides an excellent approximation in much of the parameter space, as previously studied by Rolke and collaborators.
 Publication:

Nuclear Instruments and Methods in Physics Research A
 Pub Date:
 October 2008
 DOI:
 10.1016/j.nima.2008.07.086
 arXiv:
 arXiv:physics/0702156
 Bibcode:
 2008NIMPA.595..480C
 Keywords:

 Physics  Data Analysis;
 Statistics and Probability
 EPrint:
 In v4, added line to Table 1 so that Z_PL is given separately for L_G and L_P