A workflow for deriving chemical entities from crystallographic data and its application to the Crystallography Open Database

Abstract Knowledge about the 3-dimensional structure, orientation and interaction of chemical compounds is important in many areas of science and technology. X-ray crystallography is one of the experimental techniques capable of providing a large amount of structural information for a given compound, and it is widely used for characterisation of organic and metal-organic molecules. The method provides precise 3D coordinates of atoms inside crystals, however, it does not directly deliver information about certain chemical characteristics such as bond orders, delocalization, charges, lone electron pairs or lone electrons. These aspects of a molecular model have to be derived from crystallographic data using refined information about interatomic distances and atom types as well as employing general chemical knowledge. This publication describes a curated automatic pipeline for the derivation of chemical attributes of molecules from crystallographic models. The method is applied to build a catalogue of chemical entities in an open-access crystallographic database, the Crystallography Open Database (COD). The catalogue of such chemical entities is provided openly as a derived database. The content of this catalogue and the problems arising in the fully automated pipeline are discussed, along with the possibilities to introduce manual data curation into the process..

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:15

Enthalten in:

Journal of cheminformatics - 15(2023), 1 vom: 19. Dez.

Sprache:

Englisch

Beteiligte Personen:

Vaitkus, Antanas [VerfasserIn]
Merkys, Andrius [VerfasserIn]
Sander, Thomas [VerfasserIn]
Quirós, Miguel [VerfasserIn]
Thiessen, Paul A. [VerfasserIn]
Bolton, Evan E. [VerfasserIn]
Gražulis, Saulius [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

Chemical structure assignment
Crystallography Open Database
Molecular perception
PubChem

Anmerkungen:

© This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2023

doi:

10.1186/s13321-023-00780-2

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

SPR054137470