4.2 Article

A new manner to use application of Shannon Entropy in similarity computation

Journal

JOURNAL OF MATHEMATICAL CHEMISTRY
Volume 49, Issue 10, Pages 2330-2344

Publisher

SPRINGER
DOI: 10.1007/s10910-011-9889-1

Keywords

Shannon Entropy; Tanimoto; Chemical similarity; Representative sample

Ask authors/readers for more resources

The paper proposes a new manner to use application of Shannon Entropy in similarity computation for any objects and any objects groups. In computation of the chemical similarity cases the proposed formulas use the values of one or more molecular descriptors, divided into classes (categories) by using a suitable criterion. The paper proposes original criteria to made difference between 'saturated', 'non-saturated non-aromatic' and 'aromatic' molecular fragments and between 'hydrogen-acceptor' and 'hydrogen-donor-acceptor' fragments for the purpose of classifying fragments into classes. According to the proposed formula two molecules A and B are similar enough if the value of Shannon Entropy of A + B aggregate is close to the value of Shannon Entropy of A molecule and close to the value of Shannon Entropy of B molecule. The proposed similarity formula can be used as statistical correlation index, useful if the number of values of two analyzed variables is unequal. The proposed formula is useful in the quantitative evaluation of the 'representative sample' character of any 'sample'. The paper presents the chemical similarity computation in Zimelidine/Fluoxtine/Chloramphenicol/Crufomate/Phoxim group. For comparison purposes, the paper also presents a Tanimoto coefficient calculation for the same molecules group. In addition, the paper presents two non-chemical examples regarding 'Representative Sample Index' calculation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available