3.8 Review

Recent advances in the self-referencing embedded strings (SELFIES) library

Related references

Note: Only part of the references are listed.
Article Chemistry, Multidisciplinary

Group SELFIES: a robust fragment-based molecular string representation

Austin H. Cheng et al.

Summary: This article introduces Group SELFIES, a molecular string representation that utilizes group tokens to represent functional groups or entire substructures while ensuring chemical robustness. The advantages of capturing chemical motifs and flexibility are demonstrated in experiments, showing improvement in distribution learning of common molecular datasets. The article also highlights the improvement in the quality of generated molecules compared to regular SELFIES strings through random sampling of Group SELFIES strings.

DIGITAL DISCOVERY (2023)

Article Chemistry, Multidisciplinary

Model agnostic generation of counterfactual explanations for molecules

Geemi P. Wellawatte et al.

Summary: A major challenge in deep learning in chemistry is the lack of interpretability, which hinders the deployment of AI models. Counterfactuals provide a rationale for model predictions and insights into chemical structures. This work demonstrates a universal model-agnostic approach that can explain predictions of any black-box model.

CHEMICAL SCIENCE (2022)

Article Computer Science, Artificial Intelligence

SELFIES and the future of molecular string representations

Mario Krenn et al.

Summary: Artificial intelligence and machine learning have gained popularity in the field of chemistry and materials science, requiring a fluent chemical language. The traditional molecular string representation, SMILES, has limitations, but the introduction of SELFIES in 2020 has solved these issues and enabled new applications in chemistry. Looking ahead, 16 future projects for robust molecular representations are proposed.

PATTERNS (2022)

Article Chemistry, Multidisciplinary

STOUT: SMILES to IUPAC names using neural machine translation

Kohulan Rajan et al.

Summary: STOUT is a tool for compound naming using deep-learning neural machine translation, capable of generating IUPAC names from SMILES strings and performing reverse translation. The system achieves high accuracy, with even incorrect predictions showing high similarity to the true compounds.

JOURNAL OF CHEMINFORMATICS (2021)

Article Chemistry, Multidisciplinary

Beyond generative models: superfast traversal, optimization, novelty, exploration and discovery (STONED) algorithm for molecules using SELFIES

AkshatKumar Nigam et al.

Summary: The study introduces a new algorithm called STONED, which achieves performance comparable to deep generative models in the chemical space through interpolation and exploration without the need for large amounts of data and training time.

CHEMICAL SCIENCE (2021)

Article Chemistry, Multidisciplinary

DECIMER: towards deep learning for chemical image recognition

Kohulan Rajan et al.

JOURNAL OF CHEMINFORMATICS (2020)

Article Chemistry, Multidisciplinary

Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules

Rafael Gomez-Bombarelli et al.

ACS CENTRAL SCIENCE (2018)

Review Multidisciplinary Sciences

Inverse molecular design using machine learning: Generative models for matter engineering

Benjamin Sanchez-Lengeling et al.

SCIENCE (2018)

Article Multidisciplinary Sciences

Deep reinforcement learning for de novo drug design

Mariya Popova et al.

SCIENCE ADVANCES (2018)

Article Chemistry, Multidisciplinary

Molecular de-novo design through deep reinforcement learning

Marcus Olivecrona et al.

JOURNAL OF CHEMINFORMATICS (2017)

Article Chemistry, Multidisciplinary

Enhanced CACTVS browser of the open NCI database

WD Ihlenfeldt et al.

JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES (2002)

Article Chemistry, Multidisciplinary

Comparison of the NCI open database with seven large chemical structural databases

JH Voigt et al.

JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES (2001)