NanoBERTa-ASP : predicting nanobody paratope based on a pretrained RoBERTa model

© 2024. The Author(s)..

BACKGROUND: Nanobodies, also known as VHH or single-domain antibodies, are unique antibody fragments derived solely from heavy chains. They offer advantages of small molecules and conventional antibodies, making them promising therapeutics. The paratope is the specific region on an antibody that binds to an antigen. Paratope prediction involves the identification and characterization of the antigen-binding site on an antibody. This process is crucial for understanding the specificity and affinity of antibody-antigen interactions. Various computational methods and experimental approaches have been developed to predict and analyze paratopes, contributing to advancements in antibody engineering, drug development, and immunotherapy. However, existing predictive models trained on traditional antibodies may not be suitable for nanobodies. Additionally, the limited availability of nanobody datasets poses challenges in constructing accurate models.

METHODS: To address these challenges, we have developed a novel nanobody prediction model, named NanoBERTa-ASP (Antibody Specificity Prediction), which is specifically designed for predicting nanobody-antigen binding sites. The model adopts a training strategy more suitable for nanobodies, based on an advanced natural language processing (NLP) model called BERT (Bidirectional Encoder Representations from Transformers). To be more specific, the model utilizes a masked language modeling approach named RoBERTa (Robustly Optimized BERT Pretraining Approach) to learn the contextual information of the nanobody sequence and predict its binding site.

RESULTS: NanoBERTa-ASP achieved exceptional performance in predicting nanobody binding sites, outperforming existing methods, indicating its proficiency in capturing sequence information specific to nanobodies and accurately identifying their binding sites. Furthermore, NanoBERTa-ASP provides insights into the interaction mechanisms between nanobodies and antigens, contributing to a better understanding of nanobodies and facilitating the design and development of nanobodies with therapeutic potential.

CONCLUSION: NanoBERTa-ASP represents a significant advancement in nanobody paratope prediction. Its superior performance highlights the potential of deep learning approaches in nanobody research. By leveraging the increasing volume of nanobody data, NanoBERTa-ASP can further refine its predictions, enhance its performance, and contribute to the development of novel nanobody-based therapeutics. Github repository: https://github.com/WangLabforComputationalBiology/NanoBERTa-ASP.

Medienart:

E-Artikel

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

Zur Gesamtaufnahme - volume:25

Enthalten in:

BMC bioinformatics - 25(2024), 1 vom: 21. März, Seite 122

Sprache:

Englisch

Beteiligte Personen:

Li, Shangru [VerfasserIn]
Meng, Xiangpeng [VerfasserIn]
Li, Rui [VerfasserIn]
Huang, Bingding [VerfasserIn]
Wang, Xin [VerfasserIn]

Links:

Volltext

Themen:

Antibodies
Antibody engineering
Journal Article
Nanobodies
Prediction of binding sites
Pretrained model
RoBERTa
Single-Domain Antibodies
Transfer learning
Transformers

Anmerkungen:

Date Completed 25.03.2024

Date Revised 25.03.2024

published: Electronic

Citation Status MEDLINE

doi:

10.1186/s12859-024-05750-5

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM370046293