4.5 Article

Assessing non-LUS stutter in DNA sequence data

Journal

FORENSIC SCIENCE INTERNATIONAL-GENETICS
Volume 59, Issue -, Pages -

Publisher

ELSEVIER IRELAND LTD
DOI: 10.1016/j.fsigen.2022.102706

Keywords

Stutter; Massively parallel sequencing (MPS); Short tandem repeats (STR); LUS; Non-LUS; DNA sequence

Funding

  1. National Institute of Justice, Office of Justice Programs, U.S. Department of Justice, USA [2018-DU-BX-0202]
  2. New York City Office of the Chief Medical Examiner - Department of Forensic Biology, Wash-ington DC Department of Forensic Sciences
  3. Promega Corporation

Ask authors/readers for more resources

Forensic DNA analysis is a well-developed field that uses short tandem repeats (STRs) to discriminate individuals, but the challenge of stutter artifacts during analysis needs to be better understood. Massively parallel sequencing (MPS) data can accurately detect and characterize stutter, including non-longest uninterrupted stretch (non-LUS) stutter. This study highlights the importance of characterizing motif-and allele-specific stutter, especially for analyzing DNA profiles with low-level contributors.
Forensic DNA analysis is among the most well-recognized and well-developed forensic disciplines. The field's use of DNA markers known as short tandem repeats (STRs) offer a robust means of discriminating individuals while also introducing challenges to the analysis. One of these challenges, stutter, is the result of a non-biological artifact introduced during PCR. The formation and amplification of these stutter products can occur at rates as high as 15-20% of the parent allele. The challenge inherent in this process is differentiating stutter artifacts from true alleles, particularly in the presence of a minor contributor. Traditionally, DNA profiles are obtained using capillary electrophoresis (CE), where amplified DNA fragments are separated by size, not sequence, and the identification of stutter is performed on a locus-specific level. The use of CE-based fragment data rather than sequence-based data, has limited the community's understanding of the precise behavior of stutter. Massively parallel sequencing (MPS) data provides an opportunity to better characterize stutter, permitting a more accurate means of detecting both size-or longest uninterrupted stretch (LUS)-based stutter but also allele and motif specific stutter characteristics. This study sheds light on the value of characterizing motif-and allele-specific stutter, including non-LUS stutter, when using MPS methods. Analysis and characterization of stutter sequences was performed using data generated from 539 samples amplified with the ForenSeq and PowerSeq 46GY library preparation kit and sequenced on the Illumina MiSeq FGx. Assessment of non-LUS stutter begins with calculating stutter rates for all potential stutter products at a given locus (and allele), additionally, the occurrence of these discrete stutter products were quantified. Results show that although the LUS sequence stutters at a higher rate than non-LUS motifs, the non-LUS stutter products do occur at detectable levels and potentially influence sequence-based mixture analysis. The data indicate that the stutter from one motif or allele can be distinguished from another motif or allele based on their unique stutter rates; however, the number of stutter products from each motif or allele may similarly make up the overall pool of stutter products. Motif-and allele specific stutter models provide the most comprehensive analysis of sequence stutter rates and provide the ability to differentiate stutter sequences more accurately from true allele stutter. This information provides a foundation for including the characterization of non-LUS stutter products when analyzing DNA profiles, specifically mixtures with potential low-level contributors.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available