4.6 Article

A Compositional Model to Predict the Aggregated Isotope Distribution for Average DNA and RNA Oligonucleotides

期刊

METABOLITES
卷 11, 期 6, 页码 -

出版社

MDPI
DOI: 10.3390/metabo11060400

关键词

DNA; RNA; oligonucleotide; prediction; isotope distribution; mass spectrometry; software

资金

  1. Special Research Fund [BOF19DOC33]
  2. Hasselt University
  3. Janssen Pharmaceutical Companies of Johnson and Johnson

向作者/读者索取更多资源

Structural modifications of DNA and RNA molecules are crucial in epigenetic and posttranscriptional regulation. Various MS and MS/MS-based tools are being developed for the analysis of nucleic acids. A modeling approach has been presented to predict the isotope distribution of DNA and RNA molecules based on their elemental composition.
Structural modifications of DNA and RNA molecules play a pivotal role in epigenetic and posttranscriptional regulation. To characterise these modifications, more and more MS and MS/MS- based tools for the analysis of nucleic acids are being developed. To identify an oligonucleotide in a mass spectrum, it is useful to compare the obtained isotope pattern of the molecule of interest to the one that is theoretically expected based on its elemental composition. However, this is not straightforward when the identity of the molecule under investigation is unknown. Here, we present a modelling approach for the prediction of the aggregated isotope distribution of an average DNA or RNA molecule when a particular (monoisotopic) mass is available. For this purpose, a theoretical database of all possible DNA/RNA oligonucleotides up to a mass of 25 kDa is created, and the aggregated isotope distribution for the entire database of oligonucleotides is generated using the BRAIN algorithm. Since this isotope information is compositional in nature, the modelling method is based on the additive log-ratio analysis of Aitchison. As a result, a univariate weighted polynomial regression model of order 10 is fitted to predict the first 20 isotope peaks for DNA and RNA molecules. The performance of the prediction model is assessed by using a mean squared error approach and a modified Pearson's chi(2) goodness-of-fit measure on experimental data. Our analysis has indicated that the variability in spectral accuracy contributed more to the errors than the approximation of the theoretical isotope distribution by our proposed average DNA/RNA model. The prediction model is implemented as an online tool. An R function can be downloaded to incorporate the method in custom analysis workflows to process mass spectral data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据