Graph isomorphism-based algorithm for cross-checking chemical and crystallographic descriptions
Abstract Published reports of chemical compounds often contain multiple machine-readable descriptions which may supplement each other in order to yield coherent and complete chemical representations. This publication presents a method to cross-check such descriptions using a canonical representation and isomorphism of molecular graphs. If immediate agreement between compound descriptions is not found, the algorithm derives the minimal set of simplifications required for both descriptions to arrive to a matching form (if any). The proposed algorithm is used to cross-check chemical descriptions from the Crystallography Open Database to identify coherently described entries as well as those requiring further curation..
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:15 |
---|---|
Enthalten in: |
Journal of cheminformatics - 15(2023), 1 vom: 23. Feb. |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Merkys, Andrius [VerfasserIn] |
---|
Links: |
Volltext [kostenfrei] |
---|
Themen: |
Crystallography Open Database |
---|
Anmerkungen: |
© The Author(s) 2023 |
---|
doi: |
10.1186/s13321-023-00692-1 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
SPR049438158 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | SPR049438158 | ||
003 | DE-627 | ||
005 | 20230510063846.0 | ||
007 | cr uuu---uuuuu | ||
008 | 230227s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1186/s13321-023-00692-1 |2 doi | |
035 | |a (DE-627)SPR049438158 | ||
035 | |a (SPR)s13321-023-00692-1-e | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Merkys, Andrius |e verfasserin |0 (orcid)0000-0002-7731-6236 |4 aut | |
245 | 1 | 0 | |a Graph isomorphism-based algorithm for cross-checking chemical and crystallographic descriptions |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
500 | |a © The Author(s) 2023 | ||
520 | |a Abstract Published reports of chemical compounds often contain multiple machine-readable descriptions which may supplement each other in order to yield coherent and complete chemical representations. This publication presents a method to cross-check such descriptions using a canonical representation and isomorphism of molecular graphs. If immediate agreement between compound descriptions is not found, the algorithm derives the minimal set of simplifications required for both descriptions to arrive to a matching form (if any). The proposed algorithm is used to cross-check chemical descriptions from the Crystallography Open Database to identify coherently described entries as well as those requiring further curation. | ||
650 | 4 | |a Molecular graphs |7 (dpeaa)DE-He213 | |
650 | 4 | |a Graph isomorphism |7 (dpeaa)DE-He213 | |
650 | 4 | |a SMILES |7 (dpeaa)DE-He213 | |
650 | 4 | |a Crystallography Open Database |7 (dpeaa)DE-He213 | |
700 | 1 | |a Vaitkus, Antanas |0 (orcid)0000-0002-5944-1391 |4 aut | |
700 | 1 | |a Grybauskas, Algirdas |0 (orcid)0000-0003-3391-9016 |4 aut | |
700 | 1 | |a Konovalovas, Aleksandras |4 aut | |
700 | 1 | |a Quirós, Miguel |0 (orcid)0000-0002-1583-4468 |4 aut | |
700 | 1 | |a Gražulis, Saulius |0 (orcid)0000-0002-7928-5218 |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of cheminformatics |d London : BioMed Central, 2009 |g 15(2023), 1 vom: 23. Feb. |w (DE-627)SPR031335551 |w (DE-600)2486539-4 |x 1758-2946 |7 nnns |
773 | 1 | 8 | |g volume:15 |g year:2023 |g number:1 |g day:23 |g month:02 |
856 | 4 | 0 | |u https://dx.doi.org/10.1186/s13321-023-00692-1 |z kostenfrei |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_SPRINGER | ||
951 | |a AR | ||
952 | |d 15 |j 2023 |e 1 |b 23 |c 02 |