Journal
BIOINFORMATICS
Volume 38, Issue 9, Pages 2633-2635Publisher
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac121
Keywords
-
Categories
Funding
- National Science Centre, Poland [2017/27/N/NZ1/00716]
- First TEAM program of the Foundation for Polish Science
- European Union under the European Regional Development Fund [POIR.04.04.00-00-5CF1/18-00]
Ask authors/readers for more resources
This study developed localpdb, a versatile Python library for managing protein structures and annotations. It features a flexible plugin system for unifying structural data with diverse auxiliary resources, suitable for bioinformatic tasks, especially large-scale protein structural analysis and machine learning.
Motivation: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized datasets combining the structural data with miscellaneous accessory resources providing functional, taxonomic and other annotations. Unfortunately, the functionality of currently available tools for the creation of such datasets is limited and their usage frequently requires laborious surveying of various data sources and resolving inconsistencies between their versions. Results: To address this problem, we developed localpdb, a versatile Python library for the management of protein structures and their annotations. The library features a flexible plugin system enabling seamless unification of the structural data with diverse auxiliary resources, full version control and powerful functionality of creating highly customized datasets. The localpdb can be used in a wide range of bioinformatic tasks, in particular those involving large-scale protein structural analyses and machine learning.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available