4.6 Article

The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research

Related references

Note: Only part of the references are listed.
Editorial Material Obstetrics & Gynecology

Multiple comparisons: a tutorial. Part 1. Understanding hypothesis testing

Michael T. Lawson et al.

BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY (2021)

Article Multidisciplinary Sciences

Meta-assessment of bias in science

Daniele Fanelli et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2017)

Article Mathematics, Interdisciplinary Applications

Is Most Published Research Really False?

Jeffrey T. Leek et al.

ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4 (2017)

Letter Anesthesiology

Most of the time, P is an unreliable marker, so we need no exact cut-off

G. B. Drummond

BRITISH JOURNAL OF ANAESTHESIA (2016)

Article Public, Environmental & Occupational Health

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Sander Greenland et al.

EUROPEAN JOURNAL OF EPIDEMIOLOGY (2016)

Article Medicine, General & Internal

Evolution of Reporting P Values in the Biomedical Literature, 1990-2015

David Chavalarias et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2016)

Article Mathematics, Interdisciplinary Applications

Rejection odds and rejection ratios: A proposal for statistical practice in testing hypotheses

M. J. Bayarri et al.

JOURNAL OF MATHEMATICAL PSYCHOLOGY (2016)

Editorial Material Multidisciplinary Sciences

IS THERE A REPRODUCIBILITY CRISIS?

Monya Baker

NATURE (2016)

Letter Biochemical Research Methods

Confidence intervals are no salvation from the alleged fickleness of the P value

Jacques van Helden

NATURE METHODS (2016)

Article Cell Biology

What does research reproducibility mean?

Steven N. Goodman et al.

SCIENCE TRANSLATIONAL MEDICINE (2016)

Review Ecology

Transparency in Ecology and Evolution: Real Problems, Real Solutions

Timothy H. Parker et al.

TRENDS IN ECOLOGY & EVOLUTION (2016)

Article Biochemistry & Molecular Biology

Current Incentives for Scientists Lead to Underpowered Studies with Erroneous Conclusions

Andrew D. Higginson et al.

PLOS BIOLOGY (2016)

Article Multidisciplinary Sciences

The natural selection of bad science

Paul E. Smaldino et al.

ROYAL SOCIETY OPEN SCIENCE (2016)

Article Health Care Sciences & Services

Obtaining evidence by a single well-powered trial or several modestly powered trials

Joanna IntHout et al.

STATISTICAL METHODS IN MEDICAL RESEARCH (2016)

Article Psychology, Multidisciplinary

Marginally Significant Effects as Evidence for Hypotheses: Changing Attitudes Over Four Decades

Laura Pritschet et al.

PSYCHOLOGICAL SCIENCE (2016)

Article Psychology, Multidisciplinary

Misconceptions of the p-value among Chilean and Italian Academic Psychologists

Laura Badenes-Ribera et al.

FRONTIERS IN PSYCHOLOGY (2016)

Article Psychology, Social

Conceptualizing and evaluating the replication of research results

Leandre R. Fabrigar et al.

JOURNAL OF EXPERIMENTAL SOCIAL PSYCHOLOGY (2016)

Article Psychology, Multidisciplinary

What Should Researchers Expect When They Replicate Studies? A Statistical View of Replicability in Psychological Science

Prasad Patil et al.

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE (2016)

News Item Multidisciplinary Sciences

How scientists fool themselves - and how they can stop

Regina Nuzzo

NATURE (2015)

Article Biochemical Research Methods

The fickle P value generates irreproducible results

Lewis G. Halsey et al.

NATURE METHODS (2015)

Article Plant Sciences

Does the P Value Have a Future in Plant Pathology?

L. V. Madden et al.

PHYTOPATHOLOGY (2015)

Article Multidisciplinary Sciences

Estimating the reproducibility of psychological science

Alexander A. Aarts et al.

SCIENCE (2015)

Article Psychology, Multidisciplinary

Is Psychology Suffering From a Replication Crisis? What Does Failure to Replicate Really Mean?

Scott E. Maxwell et al.

AMERICAN PSYCHOLOGIST (2015)

Article Psychology, Multidisciplinary

Small Telescopes: Detectability and the Evaluation of Replication Results

Uri Simonsohn

PSYCHOLOGICAL SCIENCE (2015)

Article Multidisciplinary Sciences

The Statistical Crisis in Science

Andrew Gelman et al.

AMERICAN SCIENTIST (2014)

Editorial Material Ecology

Rejoinder

Paul A. Murtaugh

ECOLOGY (2014)

Editorial Material Ecology

Comment on Murtaugh

Michael Lavine

ECOLOGY (2014)

Article Ecology

To P or not to P?

Jarrett J. Barber et al.

ECOLOGY (2014)

Article Ecology

In defense of P values

Paul A. Murtaugh

ECOLOGY (2014)

Review Health Care Sciences & Services

Six Persistent Research Misconceptions

Kenneth J. Rothman

JOURNAL OF GENERAL INTERNAL MEDICINE (2014)

Article Medicine, General & Internal

Increasing value and reducing waste in research design, conduct, and analysis

John P. A. Ioannidis et al.

LANCET (2014)

Article Biochemistry & Molecular Biology

P-values in genomics: Apparent precision masks high uncertainty

L. C. Lazzeroni et al.

MOLECULAR PSYCHIATRY (2014)

Article Multidisciplinary Sciences

Why Publishing Everything Is More Effective than Selective Publishing of Statistically Significant Results

Marcel A. L. M. van Assen et al.

PLOS ONE (2014)

Letter Multidisciplinary Sciences

Adaptive revised standards for statistical evidence

Luis Pericchi et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2014)

Letter Multidisciplinary Sciences

Revised evidence for statistical standards

Andrew Gelman et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2014)

Letter Multidisciplinary Sciences

Reproducibility issues in science, is P value really the only answer?

Jean Gaudart et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2014)

Letter Multidisciplinary Sciences

Reply to Gelman, Gaudart, Pericchi: More reasons to revise standards for statistical evidence

Valen E. Johnson

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2014)

Editorial Material Psychology, Biological

On the persistence of low power in psychological science

Ivan Vankov et al.

QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY (2014)

Editorial Material Medicine, General & Internal

How to Make More Published Research True

John P. A. Ioannidis

PLOS MEDICINE (2014)

Article Multidisciplinary Sciences

An investigation of the false discovery rate and the misinterpretation of p-values

David Colquhoun

ROYAL SOCIETY OPEN SCIENCE (2014)

Article Psychology, Multidisciplinary

Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors

Andrew Gelman et al.

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE (2014)

Article Psychology, Multidisciplinary

Using Bayes to get the most out of non-significant results

Zoltan Dienes

FRONTIERS IN PSYCHOLOGY (2014)

Article Psychology, Multidisciplinary

Expectations for Replications Are Yours Realistic?

David J. Stanley et al.

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE (2014)

Article Psychology, Mathematical

When decision heuristics and science collide

Erica C. Yu et al.

PSYCHONOMIC BULLETIN & REVIEW (2014)

Article Psychology, Multidisciplinary

Malignant side effects of null-hypothesis significance testing

Marc Branch

THEORY & PSYCHOLOGY (2014)

Article Psychology, Multidisciplinary

The New Statistics: Why and How

Geoff Cumming

PSYCHOLOGICAL SCIENCE (2014)

Editorial Material Multidisciplinary Sciences

Do We Really Need the S-word?

Megan D. Higgs

AMERICAN SCIENTIST (2013)

Article Health Care Sciences & Services

How confidence intervals become confusion intervals

James McCormack et al.

BMC MEDICAL RESEARCH METHODOLOGY (2013)

Editorial Material Public, Environmental & Occupational Health

Living with Statistics in Observational Research

Sander Greenland et al.

EPIDEMIOLOGY (2013)

Editorial Material Public, Environmental & Occupational Health

Reconciling Theory and Practice What Is to Be Done with P Values?

David A. Savitz

EPIDEMIOLOGY (2013)

Review Neurosciences

Deep impact: unintended consequences of journal rank

Bjoern Brembs et al.

FRONTIERS IN HUMAN NEUROSCIENCE (2013)

Article Mathematics, Interdisciplinary Applications

Replication, statistical consistency, and publication bias

Gregory Francis

JOURNAL OF MATHEMATICAL PSYCHOLOGY (2013)

Editorial Material Mathematics, Interdisciplinary Applications

Interrogating p-values

Andrew Gelman

JOURNAL OF MATHEMATICAL PSYCHOLOGY (2013)

Letter Neurosciences

Confidence and precision increase with high statistical power

Katherine S. Button et al.

NATURE REVIEWS NEUROSCIENCE (2013)

Review Neurosciences

Power failure: why small sample size undermines the reliability of neuroscience

Katherine S. Button et al.

NATURE REVIEWS NEUROSCIENCE (2013)

Article Multidisciplinary Sciences

Revised standards for statistical evidence

Valen E. Johnson

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2013)

Article Psychology, Multidisciplinary

Why the Resistance to Statistical Innovations? Bridging the Communication Gap

Donald Sharpe

PSYCHOLOGICAL METHODS (2013)

Article Public, Environmental & Occupational Health

Nonsignificance Plus High Power Does Not Imply Support for the Null Over the Alternative

Sander Greenland

ANNALS OF EPIDEMIOLOGY (2012)

Article Psychology, Educational

Confidence Intervals Make a Difference: Effects of Showing Confidence Intervals on Inferential Reasoning

Rink Hoekstra et al.

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT (2012)

Article Computer Science, Interdisciplinary Applications

Negative results are disappearing from most disciplines and countries

Daniele Fanelli

SCIENTOMETRICS (2012)

Article Psychology, Multidisciplinary

A Vast Graveyard of Undead Theories: Publication Bias and Psychological Science's Aversion to the Null

Christopher J. Ferguson et al.

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE (2012)

Article Social Sciences, Mathematical Methods

Subjective p Intervals Researchers Underestimate the Variability of p Values Over Replication

Jerry Lai et al.

METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES (2012)

Article Psychology, Multidisciplinary

Measuring the Prevalence of Questionable Research Practices With Incentives for Truth Telling

Leslie K. John et al.

PSYCHOLOGICAL SCIENCE (2012)

Article Psychology, Applied

How Can Significance Tests Be Deinstitutionalized?

Marc Orlitzky

ORGANIZATIONAL RESEARCH METHODS (2012)

Article Statistics & Probability

P-Value Precision and Reproducibility

Dennis D. Boos et al.

AMERICAN STATISTICIAN (2011)

Review Behavioral Sciences

Issues in information theory-based statistical inference-a commentary from a frequentist's perspective

Roger Mundry

BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY (2011)

Article Behavioral Sciences

Cryptic multiple hypotheses testing in linear models: overestimated effect sizes and the winner's curse

Wolfgang Forstmeier et al.

BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY (2011)

Article Public, Environmental & Occupational Health

Magnitude of effects in clinical trials published in high-impact general medical journals

Konstantinos C. M. Siontis et al.

INTERNATIONAL JOURNAL OF EPIDEMIOLOGY (2011)

Article Psychology, Multidisciplinary

Bayes Factor Approaches for Testing Interval Null Hypotheses

Richard D. Morey et al.

PSYCHOLOGICAL METHODS (2011)

Article Statistics & Probability

Fisher, Neyman, and the Creation of Classical Statistics

Erich L. Lehmann

FISHER, NEYMAN, AND THE CREATION OF CLASSICAL STATISTICS (2011)

Editorial Material Psychiatry

How reliable are scientific studies?

Marcus R. Munafo et al.

BRITISH JOURNAL OF PSYCHIATRY (2010)

Article Health Care Sciences & Services

Dissemination and publication of research findings: an updated review of related biases

F. Song et al.

HEALTH TECHNOLOGY ASSESSMENT (2010)

Review Mathematical & Computational Biology

Meta-research: The art of getting it wrong

John P. A. Ioannidisa

RESEARCH SYNTHESIS METHODS (2010)

Article Psychology, Multidisciplinary

Confidence intervals permit, but do not guarantee, better inference than statistical significance testing

Melissa Coulson et al.

FRONTIERS IN PSYCHOLOGY (2010)

Letter Biochemistry & Molecular Biology

Bias in genetic association studies and impact factor

M. R. Munafo et al.

MOLECULAR PSYCHIATRY (2009)

Article Psychology

The Importance of Proving the Null

C. R. Gallistel

PSYCHOLOGICAL REVIEW (2009)

Review Psychology, Mathematical

What is the probability of replicating a statistically significant effect?

Jeff Miller

PSYCHONOMIC BULLETIN & REVIEW (2009)

Article Statistics & Probability

P-values are random variables

Duncan J. Murdoch et al.

AMERICAN STATISTICIAN (2008)

Review Public, Environmental & Occupational Health

Why most discovered true associations are inflated

John P. A. Ioannidis

EPIDEMIOLOGY (2008)

Editorial Material Medicine, General & Internal

Why Current Publication Practices May Distort Science

Neal S. Young et al.

PLOS MEDICINE (2008)

Article Psychology, Multidisciplinary

Replication and p Intervals p Values Predict the Future Only Vaguely, but Confidence Intervals Do Much Better

Geoff Cumming

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE (2008)

Article Communication

A communication researchers' guide to null hypothesis significance testing and alternatives

Timothy R. Levine et al.

HUMAN COMMUNICATION RESEARCH (2008)

Article Social Sciences, Mathematical Methods

Publication bias in empirical sociological research - Do arbitrary significance levels distort published results?

Alan S. Gerber et al.

SOCIOLOGICAL METHODS & RESEARCH (2008)

Article Education & Educational Research

Inference by Eye: Pictures of Confidence Intervals and Thinking About Levels of Confidence

Geoff Cumming

TEACHING STATISTICS (2007)

Article Genetics & Heredity

Upward bias in odds ratio estimates from genome-wide association studies

Chad Garner

GENETIC EPIDEMIOLOGY (2007)

Article Genetics & Heredity

Overcoming the winner's curse:: Estimating penetrance parameters from case-control data

Sebastian Zollner et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2007)

Article Psychology, Mathematical

Probability as certainty:: Dichotomous thinking and the misuse of p values

Rink Hoekstra et al.

PSYCHONOMIC BULLETIN & REVIEW (2006)

Article Statistics & Probability

The difference between significant and not significant is not itself statistically significant

Andrew Gelman et al.

AMERICAN STATISTICIAN (2006)

Article Ecology

Why do we still use stepwise modelling in ecology and behaviour?

Mark J. Whittingham et al.

JOURNAL OF ANIMAL ECOLOGY (2006)

Article History & Philosophy Of Science

Models and statistical inference: The controversy between Fisher and Neyman-Pearson

J Lenhard

BRITISH JOURNAL FOR THE PHILOSOPHY OF SCIENCE (2006)

Article Psychology, Clinical

Misuse of statistical tests in Archives of Clinical Neuropsychology publications

P Schatz et al.

ARCHIVES OF CLINICAL NEUROPSYCHOLOGY (2005)

Review Medicine, General & Internal

Contradicted and initially stronger effects in highly cited clinical research

JPA Ioannidis

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2005)

Article Psychology, Experimental

The p-value fallacy and how to avoid it

P Dixon

CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE (2003)

Article Statistics & Probability

Confusion over measures of evidence (p's) versus errors (a's) in classical statistical testing

R Hubbard et al.

AMERICAN STATISTICIAN (2003)

Article Behavioral Sciences

A survey of the statistical power of research in behavioral ecology and animal behavior

MD Jennions et al.

BEHAVIORAL ECOLOGY (2003)

Article Psychology, Multidisciplinary

Even statisticians are not immune to misinterpretations of null hypothesis significance tests

MP Lecoutre et al.

INTERNATIONAL JOURNAL OF PSYCHOLOGY (2003)

Letter Mathematical & Computational Biology

A comment on replication, p-values and evidence

S Senn

STATISTICS IN MEDICINE (2002)

Article Biology

Relationships fade with time: a meta-analysis of temporal trends in publication in ecology and evolution

MD Jennions et al.

PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES (2002)

Article Psychology, Mathematical

Interpretation of significance levels by psychological researchers: The .05 cliff effect may be overstated

J Poitevineau et al.

PSYCHONOMIC BULLETIN & REVIEW (2001)

Article Genetics & Heredity

Large upward bias in estimation of locus-specific effects from genomewide scans

HHH Göring et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2001)

Article Statistics & Probability

Calibration of p values for testing precise null hypotheses

T Sellke et al.

AMERICAN STATISTICIAN (2001)

Article Medicine, General & Internal

Sifting the evidence - what's wrong with significance tests?

JAC Sterne et al.

BMJ-BRITISH MEDICAL JOURNAL (2001)

Article Psychology, Multidisciplinary

Null hypothesis significance testing - On the survival of a flawed method

J Krueger

AMERICAN PSYCHOLOGIST (2001)

Article Ecology

Null hypothesis testing: Problems, prevalence, and an alternative

DR Anderson et al.

JOURNAL OF WILDLIFE MANAGEMENT (2000)