Digital Repository

Species-Specific Prediction of B Cell Epitopes

Show simple item record

dc.contributor.advisor Raghava, Gajendra
dc.contributor.author SAHNI, RUCHIR
dc.date.accessioned 2025-05-20T04:09:45Z
dc.date.available 2025-05-20T04:09:45Z
dc.date.issued 2025-05
dc.identifier.citation 65 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10023
dc.description.abstract Antibody-Antigen interactions are fundamental in immunological research, influencing vaccines, diagnostics and therapeutic developments. However, existing computation models for predicting the antibody-interacting residues (epitopes) suffer from significant limitations, especially the lack of species specificity. These models are trained on data from diverse species and hence fail to account for the interspecies variation in antibody-antigen interactions. To address these limitations, this study presents a novel framework for species-specific B-Cell epitope prediction. We first establish the existence of interspecies differences in the interactions by comparing the amino acid composition of human-specific and mouse-specific epitopes, highlighting the need for species-specific models. Next, we develop HAIRpred (Human Antibody Interacting Residue predictor), a second-generation human-specific ensemble model based on random forests. HAIRpred integrates information from position-specific scoring matrices (PSSM) and relative solvent accessibility (RSA), which were identified as key features determining antibody-antigen interaction. The model achieves the area under the receiver operating characteristic curve (AUROC) score of 0.72 on the independent test dataset which surpasses the performance of the current state-of-the-art models. Additionally, we use the SHAP explainable AI (XAI) approach to analyse the contribution of residue position in the pattern in the eventual prediction. Furthermore, we recognize the need for mouse-specific prediction tools and we extend our methodology to develop a mouse-specific predictor. However, data scarcity is a significant challenge with only 290 available antibody-antigen complexes forming 94 clusters at 70% sequence identity threshold. To address this, we implement a cluster-based data splitting strategy, preventing overestimation of model performance. Our findings highlight the need for species-specific models and set a new standard for epitope prediction tools. By addressing the limitations of previous studies, we anticipate that these models will be a valuable resource in immunological research. en_US
dc.language.iso en en_US
dc.subject Research Subject Categories::NATURAL SCIENCES en_US
dc.subject Research Subject Categories::MATHEMATICS en_US
dc.title Species-Specific Prediction of B Cell Epitopes en_US
dc.type Thesis en_US
dc.description.embargo No Embargo en_US
dc.type.degree BS-MS en_US
dc.contributor.department Dept. of Data Science en_US
dc.contributor.registration 20201123 en_US


Files in this item

This item appears in the following Collection(s)

  • MS THESES [1980]
    Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository


Advanced Search

Browse

My Account