An attention-based hybrid deep neural networks for accurate identification of transcription factor binding sites

Abstract Transcription factors (TF) control gene expression by binding to specific regions of DNA sequence. TF play an important role in various disease processes, and their identification helps in understanding underlying gene regulation leading to disease risk. Currently, the most powerful models used for the predicting binding sites between TF and DNA sequence from ChIP-Seq dataset are lagging in terms of good feature extraction capabilities. We propose two models named PCLAtt and TranAtt for the prediction of 690 TF-cell line pairs from DNA sequence data. PCLAtt consists of two sets of convolutional neural networks—bidirectional long short-term memory (CNN-BiLSTM) layers in parallel followed by a multi-head attention layer and weight-shared dense layer which all contribute towards extracting efficient features from DNA sequence. TranAtt consists of convolution layers of a pre-trained model along with a BiLSTM layer and attention layer. The convolutional layers of the model act as a motif scanner and the BiLSTM layer learns the regulatory grammar of the motifs. Further, the attention mechanism is applied to give more importance to those sequence regions of DNA that consist of transcription factor binding motifs thus resulting in better performance of the proposed models. PCLAtt outperformed other state-of-the-art methods like DeepSEA, DanQ, TBiNet and DeepATT in prediction of binding sites between TF and the DNA sequence..

Medienart:

E-Artikel

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

Zur Gesamtaufnahme - volume:34

Enthalten in:

Neural computing & applications - 34(2022), 21 vom: 29. Juni, Seite 19051-19060

Sprache:

Englisch

Beteiligte Personen:

Bhukya, Raju [VerfasserIn]
Kumari, Archana [VerfasserIn]
Dasari, Chandra Mohan [VerfasserIn]
Amilpur, Santhosh [VerfasserIn]

Links:

Volltext [lizenzpflichtig]

Themen:

Convolution neural network-bidirectional long short-term memory
Multi-head attention
Transcription factors
Weight-shared dense

Anmerkungen:

© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022

doi:

10.1007/s00521-022-07502-z

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

OLC2132439990