Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10318
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSAHNI, RUCHIRen_US
dc.contributor.authorKumar, Nishanten_US
dc.contributor.authorRaghava, Gajendra P. S.en_US
dc.date.accessioned2025-07-25T05:22:59Z-
dc.date.available2025-07-25T05:22:59Z-
dc.date.issued2025-08en_US
dc.identifier.citationProtein Science, 34(08).en_US
dc.identifier.issn0961-8368en_US
dc.identifier.issn1469-896Xen_US
dc.identifier.urihttps://doi.org/10.1002/pro.70212en_US
dc.identifier.urihttp://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10318-
dc.description.abstractIn the past, several methods have been developed for predicting conformational B-cell epitopes in antigens that are not specific to any host. Our primary analysis of antibody–antigen complexes indicated a need to develop host-specific B-cell epitopes. In this study, we present a novel approach to predict conformational B-cell epitopes specific to human hosts by focusing on human antibody interacting residues in antigens. We trained, tested, and evaluated our models on 277 complexes of human antibody–antigen complexes. Initially, we employed machine learning models based on the one hot encoding sequence profile of antigens, achieving a maximum area under the receiver operating characteristic curve (AUROC) of 0.61. The performance of the model improved significantly with the AUROC increasing from 0.61 to 0.67 when evolutionary profiles were used instead of one hot encoding profile. Models developed using embeddings from fine-tuned protein language models reached an AUROC of 0.61. Additionally, models utilizing predicted surface relative solvent accessibility achieved an AUROC of 0.67. Our ensemble model, which combined relative surface accessibility with evolutionary profiles, achieved the highest precision with an AUROC of 0.72. All models in this study were trained using fivefold cross-validation on a training dataset and evaluated on an independent dataset not used for training or validation. Our method outperforms existing approaches on the independent dataset. Furthermore, we used the SHAP eXplainable AI (XAI) method to interpret the importance of elements in features contributing to the predictions made by our models. To support the scientific community, we have developed a standalone software and web server, HAIRpred, for predicting human antibody interacting residues in proteinsen_US
dc.language.isoenen_US
dc.publisherWileyen_US
dc.subjectAntibody-antigen interactionen_US
dc.subjectAntibody interacting residuesen_US
dc.subjectB-cell epitopesen_US
dc.subjectMachine learningen_US
dc.subjectProtein language modelsen_US
dc.subject2025-JUL-WEEK4en_US
dc.subjectTOC-JUL-2025en_US
dc.subject2025en_US
dc.titleHAIRpred: Prediction of human antibody interacting residues in an antigen from its primary structureen_US
dc.typeArticleen_US
dc.contributor.departmentDept. of Data Scienceen_US
dc.identifier.sourcetitleProtein Scienceen_US
dc.publication.originofpublisherForeignen_US
Appears in Collections:JOURNAL ARTICLES

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.