findMySequence : a neural-network-based approach for identification of unknown proteins in X-ray crystallography and cryo-EM

© Grzegorz Chojnowski et al. 2022..

Although experimental protein-structure determination usually targets known proteins, chains of unknown sequence are often encountered. They can be purified from natural sources, appear as an unexpected fragment of a well characterized protein or appear as a contaminant. Regardless of the source of the problem, the unknown protein always requires characterization. Here, an automated pipeline is presented for the identification of protein sequences from cryo-EM reconstructions and crystallographic data. The method's application to characterize the crystal structure of an unknown protein purified from a snake venom is presented. It is also shown that the approach can be successfully applied to the identification of protein sequences and validation of sequence assignments in cryo-EM protein structures.

Medienart:

E-Artikel

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

Zur Gesamtaufnahme - volume:9

Enthalten in:

IUCrJ - 9(2022), Pt 1 vom: 01. Jan., Seite 86-97

Sprache:

Englisch

Beteiligte Personen:

Chojnowski, Grzegorz [VerfasserIn]
Simpkin, Adam J [VerfasserIn]
Leonardo, Diego A [VerfasserIn]
Seifert-Davila, Wolfram [VerfasserIn]
Vivas-Ruiz, Dan E [VerfasserIn]
Keegan, Ronan M [VerfasserIn]
Rigden, Daniel J [VerfasserIn]

Links:

Volltext

Themen:

Bioinformatics
Cryo-EM
FindMySequence
Journal Article
Neural networks
Protein sequences
Protein structures
SIMBAD
Structure determination

Anmerkungen:

Date Revised 09.04.2022

published: Electronic-eCollection

Citation Status PubMed-not-MEDLINE

doi:

10.1107/S2052252521011088

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM335938418