4.7 Article

Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated

Related references

Note: Only part of the references are listed.
Review Biotechnology & Applied Microbiology

Population genetic considerations for using biobanks as international resources in the pandemic era and beyond

Hannah Carress et al.

Summary: The over-representation of Europeans in genomic studies restricts global understanding of disease risk and inhibits research into genomic differences between carriers and patients. To address this, more diverse samples are needed, with diversity quantified, compared and annotated for insight.

BMC GENOMICS (2021)

Article Multidisciplinary Sciences

Learning from reproducing computational results: introducing three principles and the Reproduction Package

M. S. Krafczyk et al.

Summary: This article discusses efforts to reproduce computational results for seven published articles and outlines three principles to guide reproducible computational research: transparency, ease of (re-)executability, and determinism. Additionally, 12 specific guidelines are provided for implementing these principles in practice.

PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES (2021)

Article Genetics & Heredity

On the Unfounded Enthusiasm for Soft Selective Sweeps III: The Supervised Machine Learning Algorithm That Isn't

Eran Elhaik et al.

Summary: Soft selective sweep mechanisms have gained significant importance in evolutionary studies, but caution should be taken when interpreting results involving the use of supervised machine learning techniques due to the lack of legitimate training datasets, potentially leading to misleading conclusions.

GENES (2021)

Article Evolutionary Biology

Why Clusters and Other Patterns Can Seem to be Found in Analyses of High-Dimensional Data

F. James Rohlf

Summary: Recent studies have found that scatterplots generated using between-groups principal components analysis can suggest significant group differences even when samples are from the same multivariate normal distribution, particularly when there are many variables and small sample sizes. Canonical variates analysis demonstrates an even greater separation of groups, despite correct test statistics results, with these issues persisting even in large samples with uncorrelated variables. These problems are attributed to sampling from high-dimensional spaces and the curse of dimensionality, with a useful index for predicting false clusters being the ratio of variables to sample size. Additionally, multiple regression analysis can encounter similar issues when dealing with large numbers of independent variables due to the incompatibility of representing both full p-dimensional distances and low-dimensional projections in the same plot. These findings have implications for multivariate analyses in biology, including geometric morphometrics.

EVOLUTIONARY BIOLOGY (2021)

Article Biochemistry & Molecular Biology

High-resolution inference of genetic relationships among Jewish populations

Naama M. Kopelman et al.

EUROPEAN JOURNAL OF HUMAN GENETICS (2020)

Article Multidisciplinary Sciences

Factor analysis of ancient population genomic samples

Olivier Francois et al.

NATURE COMMUNICATIONS (2020)

Article Biochemistry & Molecular Biology

Substructured Population Growth in the Ashkenazi Jews Inferred with Approximate Bayesian Computation

Ariella L. Gladstein et al.

MOLECULAR BIOLOGY AND EVOLUTION (2019)

Article Multidisciplinary Sciences

The genomic history of the Iberian Peninsula over the past 8000 years

Inigo Olalde et al.

SCIENCE (2019)

Article Neurosciences

Evidence of Assortative Mating in Autism Spectrum Disorder

Siobhan Connolly et al.

BIOLOGICAL PSYCHIATRY (2019)

Editorial Material Ecology

Be careful with your principal components

Mats Bjorklund

EVOLUTION (2019)

Article Genetics & Heredity

A Prospective Analysis of Genetic Variants Associated with Human Lifespan

Kevin M. Wright et al.

G3-GENES GENOMES GENETICS (2019)

News Item Multidisciplinary Sciences

Genetics lab accused of misusing African DNA

Erik Stokstad

SCIENCE (2019)

Article Biochemical Research Methods

Pair Matcher (PaM): fast model-based optimization of treatment/case-control matches

Eran Elhaik et al.

BIOINFORMATICS (2019)

Article Multidisciplinary Sciences

A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots

Daniel J. Lawson et al.

NATURE COMMUNICATIONS (2018)

Article Multidisciplinary Sciences

The UK Biobank resource with deep phenotyping and genomic data

Clare Bycroft et al.

NATURE (2018)

Article Biochemistry & Molecular Biology

Across-cohort QC analyses of GWAS summary statistics from complex traits

Guo-Bo Chen et al.

EUROPEAN JOURNAL OF HUMAN GENETICS (2017)

Article Genetics & Heredity

Differences in the rare variant spectrum among human populations

Iain Mathieson et al.

PLOS GENETICS (2017)

Editorial Material Genetics & Heredity

Editorial: Population Genetics of Worldwide Jewish People

Eran Elhaik

FRONTIERS IN GENETICS (2017)

Article Genetics & Heredity

A GWAS in uveal melanoma identifies risk polymorphisms in the CLPTM1L locus

Lenha Mobuchon et al.

NPJ GENOMIC MEDICINE (2017)

Article Biochemistry & Molecular Biology

Detecting Genomic Signatures of Natural Selection with Principal Component Analysis: Application to the 1000 Genomes Data

Nicolas Duforet-Frebourg et al.

MOLECULAR BIOLOGY AND EVOLUTION (2016)

Article Genetics & Heredity

Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia

Kevin J. Galinsky et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2016)

Article Evolutionary Biology

Localizing Ashkenazic Jews to Primeval Villages in the Ancient Iranian Lands of Ashkenaz

Ranajit Das et al.

GENOME BIOLOGY AND EVOLUTION (2016)

Article Multidisciplinary Sciences

Genomic insights into the origin of farming in the ancient Near East

Iosif Lazaridis et al.

NATURE (2016)

Editorial Material Multidisciplinary Sciences

IS THERE A REPRODUCIBILITY CRISIS?

Monya Baker

NATURE (2016)

Article Multidisciplinary Sciences

Reconstructing Druze population history

Scarlett Marshall et al.

SCIENTIFIC REPORTS (2016)

Article Multidisciplinary Sciences

Who has your DNA--or wants it

J. Kaiser

SCIENCE (2015)

Article Multidisciplinary Sciences

Quantitating and Dating Recent Gene Flow between European and East Asian Populations

Pengfei Qin et al.

SCIENTIFIC REPORTS (2015)

Article Multidisciplinary Sciences

Genome flux and stasis in a five millennium transect of European prehistory

Cristina Gamba et al.

NATURE COMMUNICATIONS (2014)

Article Genetics & Heredity

Genome-wide analysis of the role of copy-number variation in pancreatic cancer risk

Jason A. Willis et al.

FRONTIERS IN GENETICS (2014)

Article Genetics & Heredity

RFMix: A Discriminative Modeling Approach for Rapid and Robust Local-Ancestry Inference

Brian K. Maples et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2013)

Article Multidisciplinary Sciences

Reconstructing Roma History from Genome-Wide Data

Priya Moorjani et al.

PLOS ONE (2013)

Article Multidisciplinary Sciences

Genome-wide data substantiate Holocene gene flow from India to Australia

Irina Pugach et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2013)

Article Genetics & Heredity

Ethiopian Genetic Diversity Reveals Linguistic Stratification and Complex Influences on the Ethiopian Gene Pool

Luca Pagani et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2012)

Article Genetics & Heredity

A model-based approach for analysis of spatial structure in genetic data

Wen-Yun Yang et al.

NATURE GENETICS (2012)

Article Multidisciplinary Sciences

Empirical Distributions of FST from Large-Scale Human Polymorphism Data

Eran Elhaik

PLOS ONE (2012)

Article Multidisciplinary Sciences

North African Jewish and non-Jewish populations form distinctive, orthogonal clusters

Christopher L. Campbell et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2012)

Article Multidisciplinary Sciences

Origins and Genetic Legacy of Neolithic Farmers and Hunter-Gatherers in Europe

Pontus Skoglund et al.

SCIENCE (2012)

Article Multidisciplinary Sciences

A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes

Daniel G. MacArthur et al.

SCIENCE (2012)

Article Multidisciplinary Sciences

The genetic prehistory of southern Africa

Joseph K. Pickrell et al.

NATURE COMMUNICATIONS (2012)

Article Genetics & Heredity

The History of African Gene Flow into Southern Europeans, Levantines, and Jews

Priya Moorjani et al.

PLOS GENETICS (2011)

Article Genetics & Heredity

Clustering by genetic ancestry using genome-wide SNP data

Nadia Solovieff et al.

BMC GENETICS (2010)

Article Biochemistry & Molecular Biology

Genetic structure of a unique admixed population: implications for medical research

Nick Patterson et al.

HUMAN MOLECULAR GENETICS (2010)

Article Biochemistry & Molecular Biology

Principal Component Analysis under Population Genetic Models of Range Expansion and Admixture

Olivier Francois et al.

MOLECULAR BIOLOGY AND EVOLUTION (2010)

Article Multidisciplinary Sciences

The genome-wide structure of the Jewish people

Doron M. Behar et al.

NATURE (2010)

Article Multidisciplinary Sciences

Integrating common and rare genetic variation in diverse human populations

David M. Altshuler et al.

NATURE (2010)

Article Multidisciplinary Sciences

Genetic history of an archaic hominin group from Denisova Cave in Siberia

David Reich et al.

NATURE (2010)

Article Genetics & Heredity

Common SNPs explain a large proportion of the heritability for human height

Jian Yang et al.

NATURE GENETICS (2010)

Article Multidisciplinary Sciences

Signatures of founder effects, admixture, and selection in the Ashkenazi Jewish population

Steven M. Bray et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2010)

Article Genetics & Heredity

Genome-wide Insights into the Patterns and Determinants of Fine-Scale Population Structure in Humans

Shameek Biswas et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2009)

Article Multidisciplinary Sciences

Reconstructing Indian population history

David Reich et al.

NATURE (2009)

Article Multidisciplinary Sciences

Analysis of genomic diversity in Mexican Mestizo populations to develop genomic medicine in Mexico

Irma Silva-Zolezzi et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2009)

Editorial Material Multidisciplinary Sciences

The Illusive Gold Standard in Genetic Ancestry Testing

Sandra Soo-Jin Lee et al.

SCIENCE (2009)

Article Genetics & Heredity

A Genealogical Interpretation of Principal Components Analysis

Gil McVean

PLOS GENETICS (2009)

Article Multidisciplinary Sciences

Genes mirror geography within Europe

John Novembre et al.

NATURE (2008)

Editorial Material Genetics & Heredity

Principal component analysis of genetic data

David Reich et al.

NATURE GENETICS (2008)

Article Genetics & Heredity

Interpreting principal component analyses of spatial population genetic variation

John Novembre et al.

NATURE GENETICS (2008)

Article Genetics & Heredity

Discerning the ancestry of European Americans in genetic association studies

Alkes L. Price et al.

PLOS GENETICS (2008)

Article Multidisciplinary Sciences

The Druze: A Population Genetic Refugium of the Near East

Liran I. Shlush et al.

PLOS ONE (2008)

Article Multidisciplinary Sciences

Analysis of East Asia Genetic Substructure Using Genome-Wide SNP Arrays

Chao Tian et al.

PLOS ONE (2008)

Article Multidisciplinary Sciences

Worldwide human relationships inferred from genome-wide patterns of variation

Jun Z. Li et al.

SCIENCE (2008)

Article Multidisciplinary Sciences

A second generation human haplotype map of over 3.1 million SNPs

Kelly A. Frazer et al.

NATURE (2007)

Article Genetics & Heredity

Population structure and eigenanalysis

Nick Patterson et al.

PLOS GENETICS (2006)

Article Genetics & Heredity

A worldwide survey of haplotype variation and linkage disequilibrium in the human genome

Donald F. Conrad et al.

NATURE GENETICS (2006)

Article Genetics & Heredity

Principal components analysis corrects for stratification in genome-wide association studies

Alkes L. Price et al.

NATURE GENETICS (2006)

Review Multidisciplinary Sciences

A haplotype map of the human genome

D Altshuler et al.

NATURE (2005)

Article Medicine, General & Internal

Why most published research findings are false

JPA Ioannidis

PLOS MEDICINE (2005)