4.7 Article

A Searchable Map of PubChem

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING
Volume 50, Issue 11, Pages 1924-1934

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/ci100237q

Keywords

-

Funding

  1. University of Berne
  2. Swiss National Science Foundation
  3. Office Federal Suisse de l'Education et de la Science
  4. COST program Angiokem

Ask authors/readers for more resources

The database PubChem was classified using 42 integer value descriptors of molecular structure, here called molecular quantum numbers (MQNs), which count atoms and bond types, polar groups, and topological features. Principal component analysis of the MQN data set shows that Pub Chem compounds occupy a partially filled elliptical cone in the (PC1,PC2,PC3) space whose axis is the first principal component PC1 (65% variability) representing molecular size, and the ellipse axes are PC2 (18% variability, representing structural flexibility) and PC3 (7% variability, representing polarity). A visual overview of Pub Chem is provided by color-coded representations of the (PC2,PC3) plane. The MQNs form a scalar fingerprint which can be used to measure the similarity between pairs of molecules and enable ligand-based virtual screening, as illustrated for the enrichment of bioactives from the DUD data set from Pub Chem. An MQN-annotated version of Pub Chem with an MQN-similarity search tool is available at www.gdb.unibe.ch.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available