Graph isomorphism-based algorithm for cross-checking chemical and crystallographic descriptions

Abstract Published reports of chemical compounds often contain multiple machine-readable descriptions which may supplement each other in order to yield coherent and complete chemical representations. This publication presents a method to cross-check such descriptions using a canonical representation and isomorphism of molecular graphs. If immediate agreement between compound descriptions is not found, the algorithm derives the minimal set of simplifications required for both descriptions to arrive to a matching form (if any). The proposed algorithm is used to cross-check chemical descriptions from the Crystallography Open Database to identify coherently described entries as well as those requiring further curation..

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:15

Enthalten in:

Journal of cheminformatics - 15(2023), 1 vom: 23. Feb.

Sprache:

Englisch

Beteiligte Personen:

Merkys, Andrius [VerfasserIn]
Vaitkus, Antanas [VerfasserIn]
Grybauskas, Algirdas [VerfasserIn]
Konovalovas, Aleksandras [VerfasserIn]
Quirós, Miguel [VerfasserIn]
Gražulis, Saulius [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

Crystallography Open Database
Graph isomorphism
Molecular graphs
SMILES

Anmerkungen:

© The Author(s) 2023

doi:

10.1186/s13321-023-00692-1

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

SPR049438158