3.8 Article

HPMPdb: A machine learning-ready database of protein molecular phenotypes associated to human missense variants

期刊

CURRENT RESEARCH IN STRUCTURAL BIOLOGY
卷 4, 期 -, 页码 167-174

出版社

ELSEVIER
DOI: 10.1016/j.crstbi.2022.04.004

关键词

Single aminoacid variants; Molecular phenotype; Variant-effect predictor; Database; Bioinformatics

资金

  1. FWO post-doctoral fellowship
  2. Flanders Institute for Biotechnology (VIB) [C0401]
  3. KU Leuven (postdoctoral mandate) [PDM/19/157]
  4. Research Foundation - Flanders (FWO) [G053420N, SBO S000722N]

向作者/读者索取更多资源

This article presents HPMPdb, a database containing detailed descriptions of human Single Amino acid Variants (SAVs) and their effects on protein molecular phenotypes. The database allows researchers to go beyond the existing Deleterious/Neutral prediction paradigm and build molecular phenotype predictors. Necessary means for training and testing models on the database are provided.
Current human Single Amino acid Variants (SAVs) databases provide a link between a SAVs and their effect on the carrier individual phenotype, often dividing them into Deleterious/Neutral variants. This is a very coarse-grained description of the genotype-to-phenotype relationship because it relies on un-realistic assumptions such as the perfect Mendelian behavior of each SAV and considers only dichotomic phenotypes. Moreover, the link between the effect of a SAV on a protein (its molecular phenotype) and the individual phenotype is often very complex, because multiple level of biological abstraction connect the protein and individual level phenotypes. Here we present HPMPdb, a manually curated database containing human SAVs associated with the detailed description of the molecular phenotype they cause on the affected proteins. With particular regards to machine learning (ML), this database can be used to let researchers go beyond the existing Deleterious/Neutral prediction paradigm, allowing them to build molecular phenotype predictors instead. Our class labels describe in a succinct way the effects that each SAV has on 15 protein molecular phenotypes, such as protein-protein interaction, small molecules binding, function, post-translational modifications (PTMs), sub-cellular localization, mimetic PTM, folding and protein expression. Moreover, we provide researchers with all necessary means to re-producibly train and test their models on our database. The webserver and the data described in this paper are available at hpmp.esat.kuleuven.be.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据