☆ 4.7 Article

ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

期刊

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL

卷 21, 期 -, 页码 4743-4758

出版社

ELSEVIER

DOI: 10.1016/j.csbj.2023.09.017

关键词

Single amino acid variations; Biomolecular representations; Protein sequence annotations; Protein domains; Bioinformatics tools/models; Variant effect prediction; Machine learning

类别

Biochemistry & Molecular Biology Biotechnology & Applied Microbiology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this study, a method called ASCARIS is proposed for the quantitative representation of single amino acid variations (SAVs) to predict their functional effects or build integrative models. ASCARIS utilizes positional feature annotations, structural features, and physicochemical properties to construct reusable numerical representations of SAVs. Experimental results show that ASCARIS has competitive and complementary performance in variant effect prediction.

Background: Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and datacentric analysis of sequence variations. Method: In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. Results: We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS.

ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations

期刊

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations

期刊

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文