Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10101
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorMartinez-Corral, Rosa-
dc.contributor.authorSHIVHARE, PRARABDH-
dc.date.accessioned2025-05-27T10:38:28Z-
dc.date.available2025-05-27T10:38:28Z-
dc.date.issued2025-05-
dc.identifier.citation58en_US
dc.identifier.urihttp://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10101-
dc.description.abstractIn the past 2 decades of literature dealing with modeling complex systems, there has been a balance, or rather, a tension between the predictive power and the interpretability of machine learning models using vast amounts of data. Biological complex systems are no different. The past decade has seen an astonishing increase in the amount of publicly available functional genomics data. While the adoption of deep learning techniques to determine the sequence patterns, syntax and grammar in DNA sequence elements that govern gene regulatory activity has been a natural consequence, most of these investigations have adopted a ‘black box’ approach, with model predictions that are hard to interpret mechanistically. Multiple attribution strategies, which seek to extract meaningful post-hoc interpretations from neural networks have been proposed for addressing this problem. However, there remains a substantial gap in the literature between the outputs of such post-hoc methods and fully mechanistic models, specifically in the context of gene regulation. This problem can be at least partially overcome by including some level of mechanistic detail in the internal structure of deep learning algorithms. This can enable us to better understand the predictions of the model to obtain mechanistic insight. Here, we use a cell-state specific Massively Paraellel Reporter Assay dataset from hemotopoeitic stem cells to model gene regulation using deep learning to predict transcription factor (TF) binding on DNA sequence employing cell-state specific Chip-Seq data and graph-based representations of markov processes to model effects of bound TFs on different rate-limiting steps in the transcriptional cycle. Our model assumptions are grounded in recent biophysical findings in literature.en_US
dc.description.sponsorshipDr. Rosa Martinez-Corral, Dr. Lars Veltenen_US
dc.language.isoenen_US
dc.subjectDeep Learningen_US
dc.subjectBiophysicsen_US
dc.titleDeveloping a Biophysically Grounded Deep Learning Model for Gene Expression Predictionen_US
dc.typeThesisen_US
dc.description.embargoOne Yearen_US
dc.type.degreeBS-MSen_US
dc.contributor.departmentDept. of Data Scienceen_US
dc.contributor.registration20201147en_US
Appears in Collections:MS THESES

Files in This Item:
File Description SizeFormat 
20201147_Prarabdh_Shivhare_MS_Thesis.pdfMS Thesis12.73 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.