HSMotifDiscover : identification of motifs in sequences composed of non-single-letter elements

© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissionsoup.com..

SUMMARY: The functional sub-string(s) of a biopolymer sequence defines the specificity of its interaction with other biomolecules and is often referred to as motifs. Computational algorithms and software have been broadly developed for finding such motifs in sequences in which the individual elements are single characters, such as those in DNA and protein sequences. However, there are more complex scenarios where the motifs exist in non-single-letter contexts, e.g. preferred patterns of chemical modifications on proteins, DNAs, RNAs or polysaccharides. To search for those motifs, we describe a new method that converts the modified sequence elements to representative single-letter codes and then uses a modified Gibbs-sampling algorithm to define the position specific scoring matrix representing the motif(s). As a proof of principle, we describe the implementation and application of an R package for discovering heparan sulfate (HS) motifs in glycan sequences, which are important in regulating protein-protein interactions. This software can be valuable for analyzing high-throughput glycoprotein binding data using microarrays with HS oligosaccharides or other biological polymers.

AVAILABILITY AND IMPLEMENTATION: HSMotifDiscover is freely available as an open source R package released under an MIT license at https://github.com/bioinfoDZ/HSMotifDiscover and also available in the form of an app at https://hsmotifdiscover.shinyapps.io/HSMotifDiscover_ShinyApp/.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Medienart:

E-Artikel

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

Zur Gesamtaufnahme - volume:38

Enthalten in:

Bioinformatics (Oxford, England) - 38(2022), 16 vom: 10. Aug., Seite 4036-4038

Sprache:

Englisch

Beteiligte Personen:

Singh, Vinod Kumar [VerfasserIn]
Misra, Rohan [VerfasserIn]
Almo, Steven C [VerfasserIn]
Steidl, Ulrich G [VerfasserIn]
Bülow, Hannes E [VerfasserIn]
Zheng, Deyou [VerfasserIn]

Links:

Volltext

Themen:

9007-49-2
DNA
Journal Article
Proteins
Research Support, N.I.H., Extramural

Anmerkungen:

Date Completed 14.11.2022

Date Revised 02.07.2023

published: Print

Citation Status MEDLINE

doi:

10.1093/bioinformatics/btac437

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM342924699