4.6 Article

XSMILES: interactive visualization for molecules, SMILES and XAI attribution scores

Journal

JOURNAL OF CHEMINFORMATICS
Volume 15, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13321-022-00673-w

Keywords

SMILES; Molecule; Explainable artificial intelligence; Visualization; Artificial intelligence; Contribution; Attribution

Ask authors/readers for more resources

Explainable artificial intelligence (XAI) methods are increasingly applicable in chemistry, and visualization techniques can highlight the influence of molecule regions on predicted properties. However, some XAI techniques face challenges in representing attribution scores for non-atom tokens in SMILES strings. The proposed XSMILES tool provides an interactive visualization technique to address this issue and support the interpretation of SMILES.
Background Explainable artificial intelligence (XAI) methods have shown increasing applicability in chemistry. In this context, visualization techniques can highlight regions of a molecule to reveal their influence over a predicted property. For this purpose, some XAI techniques calculate attribution scores associated with tokens of SMILES strings or with atoms of a molecule. While an association of a score with an atom can be directly visually represented on a molecule diagram, scores computed for SMILES non-atom tokens cannot. For instance, a substring [N+] contains 3 non-atom tokens, i.e., [, +, and ], and their attributions, depending on the model, are not necessarily revealing an influ-ence of the nitrogen atom over the predicted property; for that reason, it is not possible to represent the scores on a molecule diagram. Moreover, SMILES's notation is complex, foregrounding the need for techniques to facilitate the analysis of explanations associated with their tokens.Results We propose XSMILES, an interactive visualization technique, to explore explainable artificial intelligence attributions scores and support the interpretation of SMILES. Users can input any type of score attributed to atom and non-atom tokens and visualize them on top of a 2D molecule diagram coordinated with a bar chart that represents a SMILES string. We demonstrate how attributions calculated for SMILES strings can be evaluated and better interpreted through interactivity with two use cases.Conclusions Data scientists can use XSMILES to understand their models' behavior and compare multiple modeling approaches. The tool provides a set of parameters to adapt the visualization to users' needs and it can be integrated into different platforms. We believe XSMILES can support data scientists to develop, improve, and communicate their models by making it easier to identify patterns and compare attributions through interactive exploratory visualization.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available