Journal
FRONTIERS IN GENETICS
Volume 12, Issue -, Pages -Publisher
FRONTIERS MEDIA SA
DOI: 10.3389/fgene.2021.722198
Keywords
essential genes; networks; machine learning; interaction network; database
Categories
Funding
- Intel India
Ask authors/readers for more resources
Essential gene prediction models traditionally rely on sequence-based features, but this study showed the importance of network-based features for accurate predictions. The NetGenes database, containing essential gene predictions for over 2,700 bacteria, offers various features for each gene, including essentiality scores, annotations, and feature vectors.
Essential gene prediction models built so far are heavily reliant on sequence-based features, and the scope of network-based features has been narrow. Previous work from our group demonstrated the importance of using network-based features for predicting essential genes with high accuracy. Here, we apply our approach for the prediction of essential genes to organisms from the STRING database and host the results in a standalone website. Our database, NetGenes, contains essential gene predictions for 2,700+ bacteria predicted using features derived from STRING protein-protein functional association networks. Housing a total of over 2.1 million genes, NetGenes offers various features like essentiality scores, annotations, and feature vectors for each gene. NetGenes database is available from https://rbc-dsai-iitm.github.io/NetGenes/.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available