Weak Statistical Standards Implicated in Scientific Irreproducibility
By Erika Check Hayden,
Nature
| 11. 11. 2013
The
plague of non-reproducibility in science may be mostly due to scientists’ use of weak statistical tests, as shown by an innovative method developed by statistician Valen Johnson, at Texas A&M University in College Station.
Johnson compared the strength of two types of tests: frequentist tests, which measure how unlikely a finding is to occur by chance, and Bayesian tests, which measure the likelihood that a particular hypothesis is correct given data collected in the study. The strength of the results given by these two types of tests had not been compared before, because they ask slightly different types of questions.
So Johnson developed a method that makes the results given by the tests — the P value in the frequentist paradigm, and the Bayes factor in the Bayesian paradigm — directly comparable. Unlike frequentist tests, which use objective calculations to reject a null hypothesis, Bayesian tests require the tester to define an alternative hypothesis to be tested — a subjective process. But Johnson developed a 'uniformly most powerful' Bayesian test that defines the alternative hypothesis in a standard way...
Related Articles
By Evelina Johansson Wilén, Jacobin | 01.18.2026
In her book The Argonauts, Maggie Nelson describes pregnancy as an experience marked by a peculiar duality. On the one hand, it is deeply transformative, bodily alien, sometimes almost incomprehensible to the person undergoing it. On the other hand...
By Daphne O. Martschenko and Julia E. H. Brown, Hastings Bioethics Forum | 01.14.2026
There is growing concern that falling fertility rates will lead to economic and demographic catastrophe. The social and political movement known as pronatalism looks to combat depopulation by encouraging people to have as many children as possible. But not just...
By Paula Siverino Bavio, BioNews | 01.12.2026
For more than ten years, gestational surrogacy in Uruguay existed in a state of legal latency: provided for by law, carefully regulated as an exception, yet without a single birth to make it real.
That situation changed with the arrival...
By Andrew Gregory, The Guardian | 01.11.2026
Google has removed some of its artificial intelligence health summaries after a Guardian investigation found people were being put at risk of harm by false and misleading information.
The company has said its AI Overviews, which use generative AI to...