Digital Repository

Biomedical Data Enhancement Using Deep Learning

Show simple item record

dc.contributor.advisor Shi, Xinghua en_US
dc.contributor.advisor MADHUSUDHAN, M.S. en_US
dc.contributor.author DAS, SUPRATIM en_US
dc.date.accessioned 2022-05-09T05:24:57Z
dc.date.available 2022-05-09T05:24:57Z
dc.date.issued 2022-08
dc.identifier.citation 83 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6807
dc.description.abstract Genomic data and other omics data have been used for the prediction of disease phenotype in precision medicine for a long time. In recent years, many such prediction models have been built using Machine Learning (ML) algorithms. As of today, Genomic data and other biomedical data suffer from sampling bias in terms of peoples' ethnicity, as most data comes from people of European ancestry. A smaller sample size for other population groups causes suboptimal results in ML-based prediction models for those populations. As data collection for those populations is time-consuming and costly, we developed Deep Learning-based models for in-silico data enhancement. We propose Offspring- Generative Adversarial Network (Offspring GAN), which is trained on heavily biased real data to generate realistic data and augment to existing biased real data to alleviate biases and disparities in real data. Contrary to traditional conditional GANs, Offspring GAN consists of four players, one generator, one discriminator and two novel F1 generators. We evaluated the data fidelity and variation of synthetic data using principal component analysis, correlation matrix, and comparison of Gaussian components. Our results showed Offspring GAN's ability to mitigate mode collapse problems and generate realistic data of good variation even when trained on biased data. en_US
dc.language.iso en en_US
dc.subject Deep Learning en_US
dc.subject Generative Adversarial Networks en_US
dc.subject GAN en_US
dc.subject Mode Collapse en_US
dc.subject Data Augmentaion en_US
dc.subject Genomics en_US
dc.subject SNP en_US
dc.subject Data Bias en_US
dc.title Biomedical Data Enhancement Using Deep Learning en_US
dc.type Thesis en_US
dc.type.degree BS-MS en_US
dc.contributor.department Dept. of Biology en_US
dc.contributor.registration 20171038 en_US


Files in this item

This item appears in the following Collection(s)

  • MS THESES [1705]
    Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository


Advanced Search

Browse

My Account