Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/149084
Title: | Manipulating the alpha level cannot cure significance testing |
Author: | Trafimow, David Amrhein, Valentin Areshenkoff, Corson N. Barrera-Causil, Carlos J. Beh, Eric J. Bilgiç, Yusuf K. Bono Cabré, Roser Bradley, Michael T. Briggs, William M. Cepeda-Freyre, Héctor A. Chaigneau, Sergio E. Ciocca, Daniel R. Correa, Juan Carlos de Boer, Michiel R. Dhar, Subhra Sankar Dolgov, Igor Gómez Benito, Juana Grendar, Marian Grice, James W. Guerrero Giménez, Martín E. Gutiérrez, Andrés Huedo-Medina, Tania B. Jaffe, Klaus Janyan, Armina Karimnezhad, Ali Korner-Nievergelt, Fränzi Kosugi, Koji Lachmair, Martin Ledesma, Rubén D. Limongi, Roberto Liuzza, Marco Tullio Lombardo, Rosaria Marks, Michael Meinlschmidt, Gunther Nalborczyk, Ladislas Nguyen, Hung T. Ospina, Raydonal Pérez-González, José D. Pfister, Roland Rahona, Juan José Cousineau, Denis |
Keywords: | Tests d'hipòtesi (Estadística) Statistical hypothesis testing |
Issue Date: | 2017 |
Publisher: | PeerJ |
Abstract: | We argue that depending on p-values to reject null hypotheses, including a recent call for changing the canonical alpha level for statistical significance from .05 to .005, is deleterious for the finding of new discoveries and the progress of science. Given that blanket and variable criterion levels both are problematic, it is sensible to dispense with significance testing altogether. There are alternatives that address study design and determining sample sizes much more directly than significance testing does; but none of the statistical tools should replace significance testing as the new magic method giving clear-cut mechanical answers. Inference should not be based on single studies at all, but on cumulative evidence from multiple independent studies. When evaluating the strength of the evidence, we should consider, for example, auxiliary assumptions, the strength of the experimental design, or implications for applications. To boil all this down to a binary decision based on a p-value threshold of .05, .01, .005, or anything else, is not acceptable. |
Note: | Reproducció del document publicat a: https://doi.org/10.7287/peerj.preprints.3411v1 |
It is part of: | PeerJ, 2017, vol. 5, p. e3411v3 |
URI: | http://hdl.handle.net/2445/149084 |
Related resource: | https://doi.org/10.7287/peerj.preprints.3411v1 |
ISSN: | 2167-8359 |
Appears in Collections: | Articles publicats en revistes (Psicologia Social i Psicologia Quantitativa) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
677902.pdf | 283.62 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License