4.5 Article

Clustering files of chemical structures using the Szekely-Rizzo generalization of Ward's method

Journal

JOURNAL OF MOLECULAR GRAPHICS & MODELLING
Volume 28, Issue 2, Pages 187-195

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.jmgm.2009.06.006

Keywords

Clustering method; Distance coefficient; Energy clustering; Fingerprint; Fragment substructure; Joint between-within distance; Minimum variance clustering method; Soergel coefficient; Szekely-Rizzo clustering method; Ward's clustering method

Funding

  1. Conseil Regional de Basse Normandie

Ask authors/readers for more resources

Ward's method is extensively used for clustering chemical structures represented by 2D fingerprints. This paper compares Ward clusterings of 14 datasets (containing between 278 and 4332 molecules)with those obtained using the Szekely-Rizzo clustering method, a generalization of Ward's method. The clusters resulting from these two methods were evaluated by the extent to which the various classifications were able to group active molecules together, using a novel criterion of clustering effectiveness. Analysis of a total of 1400 classifications (Ward and Szekely-Rizzo clustering methods, 14 different datasets, 5 different fingerprints and 10 different distance coefficients) demonstrated the general superiority of the Szekely-Rizzo method. The distance coefficient first described by Soergel performed extremely well in these experiments, and this was also the case when it was used in simulated virtual screening experiments. (C) 2009 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available