4.5 Article

Large scale genotype- and phenotype-driven machine learning in Von Hippel-Lindau disease

期刊

HUMAN MUTATION
卷 43, 期 9, 页码 1268-1285

出版社

WILEY-HINDAWI
DOI: 10.1002/humu.24392

关键词

CIViC; genotype-phenotype; machine learning; spectral clustering; Von Hippel-Lindau

资金

  1. Cancer Moonshot and Childhood Cancer Data Initiative
  2. National Human Genome Research Institute [R00HG007940]
  3. National Center for Advancing Translational Sciences [UL1TR002345]
  4. National Cancer Institute [U01CA209936, U24CA237719]
  5. Starbucks Clinical Genetics/Genomics Research Studentship Award
  6. VHL Alliance
  7. National Institutes of Health [K22CA188163]
  8. Children's Discovery Institute

向作者/读者索取更多资源

The study aims to collect information on VHL disease by screening articles and analyzing genotype-phenotype data of VHL patients. Relationship trends and existing associations were identified, which can help identify new patterns and associations in VHL disease.
Von Hippel-Lindau (VHL) disease is a hereditary cancer syndrome where individuals are predisposed to tumor development in the brain, adrenal gland, kidney, and other organs. It is caused by pathogenic variants in the VHL tumor suppressor gene. Standardized disease information has been difficult to collect due to the rarity and diversity of VHL patients. Over 4100 unique articles published until October 2019 were screened for germline genotype-phenotype data. Patient data were translated into standardized descriptions using Human Genome Variation Society gene variant nomenclature and Human Phenotype Ontology terms and has been manually curated into an open-access knowledgebase called Clinical Interpretation of Variants in Cancer. In total, 634 unique VHL variants, 2882 patients, and 1991 families from 427 papers were captured. We identified relationship trends between phenotype and genotype data using classic statistical methods and spectral clustering unsupervised learning. Our analyses reveal earlier onset of pheochromocytoma/paraganglioma and retinal angiomas, phenotype co-occurrences and genotype-phenotype correlations including hotspots. It confirms existing VHL associations and can be used to identify new patterns and associations in VHL disease. Our database serves as an aggregate knowledge translation tool to facilitate sharing information about the pathogenicity of VHL variants.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据