Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/7753
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDAS, SUPRATIM
dc.contributor.authorShi, Xinghua
dc.coverage.spatialNorthbrooken_US
dc.date.accessioned2023-04-26T09:11:40Z
dc.date.available2023-04-26T09:11:40Z
dc.date.issued2022-08
dc.identifier.citationBCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health, 50, 1-10.en_US
dc.identifier.isbn9781450393867
dc.identifier.urihttps://doi.org/10.1145/3535508.3545537en_US
dc.identifier.urihttp://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/7753
dc.description.abstractGenomic data have been used for trait association and disease risk prediction for a long time. In recent years, many such prediction models are built using machine learning (ML) algorithms. As of today, human genomic data and other biomedical data suffer from sampling biases in terms of people's ethnicity, as most of the data come from people of European ancestry. Smaller sample sizes for other population groups can cause suboptimal results in ML-based prediction models for those populations. Suboptimal predictions in precision medicine for some particular group can cause serious consequences limiting the model's applicability in real-world problems. As data collection for those populations is time-consuming and costly, we suggest deep learning-based models for in-silico data enhancement. Existing Generative Adversarial Network (GAN) models for genomic data like Population scale Genomic conditional-GAN (PG-cGAN) can generate realistic genomic data while trained on fairly unbiased data but fails while trained on biased data and encounters severe mode collapse. Our proposed model, Offspring GAN, can resolve the mode collapse issue even when trained in strongly biased genomic datasets. Our results demonstrate the ability of Offspring GAN to generate realistic and diverse label-aware data, which can augment limited real data to alleviate biases and disparities in genomic data. We also propose a privacy-preserving protocol using Offspring GAN to protect the privacy of genomic data.en_US
dc.language.isoenen_US
dc.publisherAssociation for Computing Machineryen_US
dc.subjectHuman genomic dataen_US
dc.subjectOffspring GANen_US
dc.subject2022en_US
dc.titleOffspring GAN augments biased human genomic dataen_US
dc.typeConference Papersen_US
dc.contributor.departmentDept. of Biologyen_US
dc.identifier.sourcetitleBCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Healthen_US
dc.publication.originofpublisherForeignen_US
Appears in Collections:CONFERENCE PAPERS

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.