Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/9837
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorLaha, Arnab K.-
dc.contributor.authorKARAMPURI, YASH-
dc.date.accessioned2025-05-14T04:15:45Z-
dc.date.available2025-05-14T04:15:45Z-
dc.date.issued2025-05-
dc.identifier.citation125en_US
dc.identifier.urihttp://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/9837-
dc.descriptionAll Python scripts used throughout this research, including those related to the supplementary studies, are available in the following GitHub repository: https://github.com/YashKarampuri/MSThesis-Supplementary.en_US
dc.description.abstractThis study explores several key concepts in graph-based learning and applies them to the problem of ligand-binding pocket prediction and clustering on protein surfaces. First, we investigate graph embedding techniques, scalable feature learning with algorithms like Node2vec, and graph representation learning methods. We then explore neighborhood reconstruction methods and how multi-relational data and knowledge graphs can be used, building a solid foundation for applying graph-based techniques to biological data. Next, we focus on the problem of predicting and clustering ligand-binding pockets on protein surfaces. Using a graph-based approach, we first generate a set of evenly spaced points on the protein’s Solvent Accessible Surface (SAS) with a fast algorithm from the CDK library. For each point, we calculate feature descriptors based on the local chemical environment, including properties of solvent-exposed atoms, distance-weighted properties of nearby atoms (within 6A), and other neighborhood features. These descriptors are used to predict ligandability scores through Graph Neural Networks (GNN) and Graph Convolutional Networks (GCN). Points with high ligandability scores are then clustered using single-linkage clustering with a 3A cut-off to form pocket predictions. The predicted pockets are ranked by their cumulative ligandability scores. This method provides an efficient framework for identifying potential ligand-binding pockets, contributing to drug discovery and protein-ligand interaction studies.en_US
dc.language.isoen_USen_US
dc.subjectGraph Neural Networks (GNNs)en_US
dc.subjectGraph Representation Learningen_US
dc.subjectProtein-Ligand Interactionen_US
dc.subjectStructure-Based Drug Designen_US
dc.subjectNode and Graph Embeddingsen_US
dc.titleGraph Representation Learning for Binding Pocket Prediction in Proteinsen_US
dc.title.alternativeGeometric Pre-Training of GNNs for Structure-Based Drug Designen_US
dc.typeThesisen_US
dc.description.embargoNo Embargoen_US
dc.type.degreeBS-MSen_US
dc.contributor.departmentDept. of Mathematicsen_US
dc.contributor.registration20201105en_US
Appears in Collections:MS THESES

Files in This Item:
File Description SizeFormat 
20201105_Karampuri Yash_MS_Thesis.pdfMS Thesis2.07 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.