The capability to achieve biogeographic ancestry (BGA) information from DNA profiles have been largely explored in forensic genetics because of its potential usefulness in providing investigative clues. For law enforcement and security purposes, when genetic data have been obtained from unknown evidence, but no reference samples are available and no hints come out from DNA databases, it would be extremely useful at least to infer the ethno-geographic origin of the stain donor by just examining traditional STRs DNA profiles. Current protocols for ethnic origin estimation using STRs profiles are usually based on Principal Component Analysis approaches and Bayesian methods. The present study provides an alternative approach that involves the use of target multivariate data analysis strategies for estimation of the BGA information from unknown biological traces. A powerful multivariate technique such as Partial Least Squares-Discriminant Analysis (PLS-DA) has been applied on NIST U.S. population datasets containing, for instance, the allele frequencies of African-American, Asian, Caucasian and Hispanic individuals. PLS-DA approach provided robust classifications, yielding high sensitivity and specificity models capable of discriminating the populations on ethnic basis. Finally, a real casework has been examined by extending the developed model to smaller and more geographically-restricted populations involving, for instance, Albanian, Italian and Montenegrian individuals.

A multivariate statistical approach to for the evaluation of the biogeographical ancestry information from traditional STRs

Alladio E.;Vincenti M.;
2019-01-01

Abstract

The capability to achieve biogeographic ancestry (BGA) information from DNA profiles have been largely explored in forensic genetics because of its potential usefulness in providing investigative clues. For law enforcement and security purposes, when genetic data have been obtained from unknown evidence, but no reference samples are available and no hints come out from DNA databases, it would be extremely useful at least to infer the ethno-geographic origin of the stain donor by just examining traditional STRs DNA profiles. Current protocols for ethnic origin estimation using STRs profiles are usually based on Principal Component Analysis approaches and Bayesian methods. The present study provides an alternative approach that involves the use of target multivariate data analysis strategies for estimation of the BGA information from unknown biological traces. A powerful multivariate technique such as Partial Least Squares-Discriminant Analysis (PLS-DA) has been applied on NIST U.S. population datasets containing, for instance, the allele frequencies of African-American, Asian, Caucasian and Hispanic individuals. PLS-DA approach provided robust classifications, yielding high sensitivity and specificity models capable of discriminating the populations on ethnic basis. Finally, a real casework has been examined by extending the developed model to smaller and more geographically-restricted populations involving, for instance, Albanian, Italian and Montenegrian individuals.
2019
7
1
253
255
http://www.elsevier.com
Autosomal STRs; Biogeographic ancestry (BGA); Forensim; Partial Least Squares – Discriminant Analysis (PLS-DA); Prediction
Alladio E.; Rocca C.D.; Cruciani F.; Vincenti M.; Garofano P.; Berti A.; Barni F.
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S1875176819303105-main.pdf

Accesso riservato

Descrizione: Articolo principale
Tipo di file: PDF EDITORIALE
Dimensione 2.38 MB
Formato Adobe PDF
2.38 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1727844
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact