4.6 Article

A workflow for deriving chemical entities from crystallographic data and its application to the Crystallography Open Database

Journal

JOURNAL OF CHEMINFORMATICS
Volume 15, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13321-023-00780-2

Keywords

Crystallography Open Database; PubChem; Molecular perception; Chemical structure assignment

Ask authors/readers for more resources

Knowledge of the 3-dimensional structure, orientation, and interaction of chemical compounds is crucial in many fields. This publication presents an automatic pipeline to derive chemical attributes from crystallographic models and applies it to build a catalog in an open-access crystallographic database.
Knowledge about the 3-dimensional structure, orientation and interaction of chemical compounds is important in many areas of science and technology. X-ray crystallography is one of the experimental techniques capable of providing a large amount of structural information for a given compound, and it is widely used for characterisation of organic and metal-organic molecules. The method provides precise 3D coordinates of atoms inside crystals, however, it does not directly deliver information about certain chemical characteristics such as bond orders, delocalization, charges, lone electron pairs or lone electrons. These aspects of a molecular model have to be derived from crystallographic data using refined information about interatomic distances and atom types as well as employing general chemical knowledge. This publication describes a curated automatic pipeline for the derivation of chemical attributes of molecules from crystallographic models. The method is applied to build a catalogue of chemical entities in an open-access crystallographic database, the Crystallography Open Database (COD). The catalogue of such chemical entities is provided openly as a derived database. The content of this catalogue and the problems arising in the fully automated pipeline are discussed, along with the possibilities to introduce manual data curation into the process.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available