4.6 Article

Gene Association Classification for Autism Spectrum Disorder: Leveraging Gene Embedding and Differential Gene Expression Profiles to Identify Disease-Related Genes

Journal

APPLIED SCIENCES-BASEL
Volume 13, Issue 15, Pages -

Publisher

MDPI
DOI: 10.3390/app13158980

Keywords

gene associations; embeddings; network analysis; machine learning

Ask authors/readers for more resources

Through the integration of multiple data sources, we conducted a systematic analysis to reveal genes associated with Autism Spectrum Disorder (ASD). We generated a gene embedding profile and a differential gene expression profile, and utilized the XGBoost classifier to identify potential gene-gene associations and candidate genes. Our analysis identified 10,848 potential gene-gene associations and inferred 125 candidate genes, with DNA Topoisomerase I, ATP Synthase F1 Subunit Gamma, and Neuronal Calcium Sensor 1 being the top candidates. We also assessed the relevance of candidate genes to specific functions and pathways and identified sub-networks within the candidate network. Overall, our analysis contributes to ASD research and facilitates the discovery of new targets for molecularly targeted therapies.
Identifying genes associated with autism spectrum disorder (ASD) is crucial for understanding the underlying mechanisms of the disorder. However, ASD is a complex condition involving multiple mechanisms, and this has resulted in an unclear understanding of the disease and a lack of precise knowledge concerning the genes associated with ASD. To address these challenges, we conducted a systematic analysis that integrated multiple data sources, including associations among ASD-associated genes and gene expression data from ASD studies. With these data, we generated both a gene embedding profile that captured the complex relationships between genes and a differential gene expression profile (built from the gene expression data). We utilized the XGBoost classifier and leveraged these profiles to identify novel ASD associations. This approach revealed 10,848 potential gene-gene associations and inferred 125 candidate genes, with DNA Topoisomerase I, ATP Synthase F1 Subunit Gamma, and Neuronal Calcium Sensor 1 being the top three candidates. We conducted a statistical analysis to assess the relevance of candidate genes to specific functions and pathways. Additionally, we identified sub-networks within the candidate network to uncover sub-groups of associations that could facilitate the identification of potential ASD-related genes. Overall, our systematic analysis, which integrated multiple data sources, represents a significant step towards unraveling the complexities of ASD. By combining network-based gene associations, gene expression data, and machine learning, we contribute to ASD research and facilitate the discovery of new targets for molecularly targeted therapies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available