4.5 Article

Biogeographical ancestry, variable selection, and PLS-DA method: a new panel to assess ancestry in forensic samples via MPS technology

Journal

FORENSIC SCIENCE INTERNATIONAL-GENETICS
Volume 62, Issue -, Pages -

Publisher

ELSEVIER IRELAND LTD
DOI: 10.1016/j.fsigen.2022.102806

Keywords

Biogeographical ancestry; Forensic samples; Victim?s identification; SNPs; Machine learning; Features selection

Ask authors/readers for more resources

Forensic scientists are using DNA analysis to infer externally visible features and biogeographical ancestry. With advancements in sequencing technologies, new tools have been developed to simultaneously analyze millions of markers. This study created a new BGA panel using over 3000 SNPs and evaluated its accuracy in inferring ancestry. The best SNPs were selected to predict biogeographical ancestry and applied in routine forensic cases using PCR amplification.
As evidenced by the large number of articles recently published in the literature, forensic scientists are making great efforts to infer externally visible features and biogeographical ancestry (BGA) from DNA analysis. Just as phenotypic, ancestry information obtained from DNA can provide investigative leads to identify the victims (missing/unidentified persons, crime/armed conflict/mass disaster victims) or trace their perpetrators when no matches were found with the reference profile or in the database. Recently, the advent of Massively Parallel Sequencing technologies associated with the possibility of harnessing high-throughput genetic data allowed us to investigate the associations between phenotypic and genomic variations in worldwide human populations and develop new BGA forensic tools capable of simultaneously analyzing up to millions of markers if for example the ancient DNA approach of hybridization capture was adopted to target SNPs of interest. In the present study, a selection of more than 3000 SNPs was performed to create a new BGA panel and the accuracy of the new panel to infer ancestry from unknown samples was evaluated by the PLS-DA method. Subsequently, the panel created was assessed using three variable selection techniques (Backward variable elimination, Genetic Algorithm and Regularized elimination procedure), and the best SNPs in terms of inferring bio-geographical ancestry at inter -and intra-continental level were selected to obtain panels to predict BGA with a reduced number of selected markers to be applied in routine forensic cases where PCR amplification is the best choice to target SNPs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available