Digital Repository

Design and application of scalable machine learning algorithms in molecular recognition, structure prediction and drug discovery

Show simple item record

dc.contributor.advisor MUKHERJEE, ARNAB en_US
dc.contributor.author GUPTA, ABHIJIT en_US
dc.date.accessioned 2022-03-25T04:30:01Z
dc.date.available 2022-03-25T04:30:01Z
dc.date.issued 2021-08 en_US
dc.identifier.citation 159 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6626
dc.description.abstract Starting with the problem of structure prediction, we leveraged machine learning to predict DNA conformation from its sequence accurately. We developed an end-to-end data-driven approach using machine learning and free energy calculations to offer a fresh perspective on this long-standing problem. Besides accurately predicting the DNA conformation, our model also explains why certain sequences adopt a particular conformation. Transitioning from the DNA to the world of proteins, we employed unsupervised learning (called hierarchical clustering) and our algebraic fitting algorithm to study the surface curvature of protein surfaces. We later used surface curvature to assess the shape complementarity among the interacting biomolecules, intending to devise a scoring algorithm for the fast selection of binders with complimentary curvature for a particular active site. To find out the binding mechanism at the molecular level, one needs to identify the appropriate reaction coordinate. Therefore, our next endeavour was to devised a novel approach based on regularized sparse autoencoders – an energy-based model, to predict a useful and physically intuitive set of reaction coordinates. Although finding strong binders is the first step towards finding a drug, it is not the most crucial step since all the binders to a receptor can not be characterized as drugs, which have to satisfy certain conditions called ADME condition. Therefore, finally, we tried to address this significant problem – “what makes a molecule a putative drug ?”. We used representation learning in conjunction with modern graph neural network architectures to learn and predict crucial attributes behind the prospective drug-like activity. Overall, the goal of the studies carried out in the thesis is to find a fast selection of putative drugs. en_US
dc.description.sponsorship IISER Pune, Department of Biotechnology, India (BT/PR34215/AI/133/22/2019). en_US
dc.language.iso en en_US
dc.subject Machine Learning en_US
dc.subject algorithm en_US
dc.subject HPC en_US
dc.subject drug discovery en_US
dc.subject structure prediction en_US
dc.subject deep learning en_US
dc.subject self-supervised learning en_US
dc.subject molecular recognition en_US
dc.title Design and application of scalable machine learning algorithms in molecular recognition, structure prediction and drug discovery en_US
dc.type Thesis en_US
dc.publisher.department Dept. of Chemistry en_US
dc.type.degree Int.Ph.D en_US
dc.contributor.department Dept. of Chemistry en_US
dc.contributor.registration 20152021 en_US


Files in this item

This item appears in the following Collection(s)

  • PhD THESES [583]
    Thesis submitted to IISER Pune in partial fulfilment of the requirements for the degree of Doctor of Philosophy

Show simple item record

Search Repository


Advanced Search

Browse

My Account