Should you test for statistical significance?
Subscribe to Decision Science News by Email (one email per week, easy unsubscribe)
ARGUMENTS AGAINST ALL SIGNIFICANCE TESTS
This week, the always-provocative J. Scott Armstrong submits this comment to Decision Science News:
“About two years ago, I was a reasonable person who argued that tests of statistical significance were useful in some limited situations. After completing research for “Significance tests harm progress in forecasting” in the International Journal of Forecasting, 23 (2007), 321-327, I have concluded that tests of statistical significance should never be used. Here is the abstract:
I briefly summarize prior research showing that tests of statistical significance are improperly used even in leading scholarly journals. Attempts to educate researchers to avoid pitfalls have had little success. Even when done properly, however, statistical significance tests are of no value. Other researchers have discussed reasons for these failures. I was unable to find empirical evidence to support the use of significance tests under any conditions. I then show that tests of statistical significance are harmful to the development of scientific knowledge because they distract the researcher from the use of proper methods. I illustrate the dangers of significance tests by examining a re-analysis of the M3-Competition. Although the authors of the re-analysis conducted a proper series of statistical tests, they suggested that the original M3-Competition was not justified in concluding that combined forecasts reduce errors, and that the selection of the best method is dependent on the selection of a proper error measure; however, I show that the original conclusions were correct. Authors should avoid tests of statistical significance; instead, they should report on effect sizes, confidence intervals, replications/extensions, and meta-analyses. Practitioners should ignore significance tests and journals should discourage them. http://dx.doi.org/10.1016/j.ijforecast.2007.03.004
The paper is followed by commentaries by Keith Ord, Herman Stekler, and Paul Goodwin, and by my reply “Statistical significance tests are unnecessary even when properly done and properly interpreted: Reply to commentaries” , which can be found online at http://dx.doi.org/10.1016/j.ijforecast.2007.01.010
This is happy news for practitioners, researchers, and students. On the other hand, it might create anguish among faculty who teach people about statistical significance.”