Design and application of scalable machine learning algorithms in molecular recognition, structure prediction and drug discovery

GUPTA, ABHIJIT

DR Home
→
THESES & PROJECT REPORTS
→
PhD THESES
→
View Item

dc.contributor.advisor	MUKHERJEE, ARNAB	en_US
dc.contributor.author	GUPTA, ABHIJIT	en_US
dc.date.accessioned	2022-03-25T04:30:01Z
dc.date.available	2022-03-25T04:30:01Z
dc.date.issued	2021-08	en_US
dc.identifier.citation	159	en_US
dc.identifier.uri	http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6626
dc.description.abstract	Starting with the problem of structure prediction, we leveraged machine learning to predict DNA conformation from its sequence accurately. We developed an end-to-end data-driven approach using machine learning and free energy calculations to offer a fresh perspective on this long-standing problem. Besides accurately predicting the DNA conformation, our model also explains why certain sequences adopt a particular conformation. Transitioning from the DNA to the world of proteins, we employed unsupervised learning (called hierarchical clustering) and our algebraic fitting algorithm to study the surface curvature of protein surfaces. We later used surface curvature to assess the shape complementarity among the interacting biomolecules, intending to devise a scoring algorithm for the fast selection of binders with complimentary curvature for a particular active site. To find out the binding mechanism at the molecular level, one needs to identify the appropriate reaction coordinate. Therefore, our next endeavour was to devised a novel approach based on regularized sparse autoencoders – an energy-based model, to predict a useful and physically intuitive set of reaction coordinates. Although finding strong binders is the first step towards finding a drug, it is not the most crucial step since all the binders to a receptor can not be characterized as drugs, which have to satisfy certain conditions called ADME condition. Therefore, finally, we tried to address this significant problem – “what makes a molecule a putative drug ?”. We used representation learning in conjunction with modern graph neural network architectures to learn and predict crucial attributes behind the prospective drug-like activity. Overall, the goal of the studies carried out in the thesis is to find a fast selection of putative drugs.	en_US
dc.description.sponsorship	IISER Pune, Department of Biotechnology, India (BT/PR34215/AI/133/22/2019).	en_US
dc.language.iso	en	en_US
dc.subject	Machine Learning	en_US
dc.subject	algorithm	en_US
dc.subject	HPC	en_US
dc.subject	drug discovery	en_US
dc.subject	structure prediction	en_US
dc.subject	deep learning	en_US
dc.subject	self-supervised learning	en_US
dc.subject	molecular recognition	en_US
dc.title	Design and application of scalable machine learning algorithms in molecular recognition, structure prediction and drug discovery	en_US
dc.type	Thesis	en_US
dc.publisher.department	Dept. of Chemistry	en_US
dc.type.degree	Int.Ph.D	en_US
dc.contributor.department	Dept. of Chemistry	en_US
dc.contributor.registration	20152021	en_US