Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6807
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorShi, Xinghuaen_US
dc.contributor.advisorMADHUSUDHAN, M.S.en_US
dc.contributor.authorDAS, SUPRATIMen_US
dc.date.accessioned2022-05-09T05:24:57Z
dc.date.available2022-05-09T05:24:57Z
dc.date.issued2022-08
dc.identifier.citation83en_US
dc.identifier.urihttp://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6807
dc.description.abstractGenomic data and other omics data have been used for the prediction of disease phenotype in precision medicine for a long time. In recent years, many such prediction models have been built using Machine Learning (ML) algorithms. As of today, Genomic data and other biomedical data suffer from sampling bias in terms of peoples' ethnicity, as most data comes from people of European ancestry. A smaller sample size for other population groups causes suboptimal results in ML-based prediction models for those populations. As data collection for those populations is time-consuming and costly, we developed Deep Learning-based models for in-silico data enhancement. We propose Offspring- Generative Adversarial Network (Offspring GAN), which is trained on heavily biased real data to generate realistic data and augment to existing biased real data to alleviate biases and disparities in real data. Contrary to traditional conditional GANs, Offspring GAN consists of four players, one generator, one discriminator and two novel F1 generators. We evaluated the data fidelity and variation of synthetic data using principal component analysis, correlation matrix, and comparison of Gaussian components. Our results showed Offspring GAN's ability to mitigate mode collapse problems and generate realistic data of good variation even when trained on biased data.en_US
dc.language.isoenen_US
dc.subjectDeep Learningen_US
dc.subjectGenerative Adversarial Networksen_US
dc.subjectGANen_US
dc.subjectMode Collapseen_US
dc.subjectData Augmentaionen_US
dc.subjectGenomicsen_US
dc.subjectSNPen_US
dc.subjectData Biasen_US
dc.titleBiomedical Data Enhancement Using Deep Learningen_US
dc.typeThesisen_US
dc.type.degreeBS-MSen_US
dc.contributor.departmentDept. of Biologyen_US
dc.contributor.registration20171038en_US
Appears in Collections:MS THESES

Files in This Item:
File Description SizeFormat 
MS_Thesis_20171038_final_signed_both.pdfThesis11.42 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.