Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/7753
Title: Offspring GAN augments biased human genomic data
Authors: DAS, SUPRATIM
Shi, Xinghua
Dept. of Biology
Keywords: Human genomic data
Offspring GAN
2022
Issue Date: Aug-2022
Publisher: Association for Computing Machinery
Citation: BCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health, 50, 1-10.
Abstract: Genomic data have been used for trait association and disease risk prediction for a long time. In recent years, many such prediction models are built using machine learning (ML) algorithms. As of today, human genomic data and other biomedical data suffer from sampling biases in terms of people's ethnicity, as most of the data come from people of European ancestry. Smaller sample sizes for other population groups can cause suboptimal results in ML-based prediction models for those populations. Suboptimal predictions in precision medicine for some particular group can cause serious consequences limiting the model's applicability in real-world problems. As data collection for those populations is time-consuming and costly, we suggest deep learning-based models for in-silico data enhancement. Existing Generative Adversarial Network (GAN) models for genomic data like Population scale Genomic conditional-GAN (PG-cGAN) can generate realistic genomic data while trained on fairly unbiased data but fails while trained on biased data and encounters severe mode collapse. Our proposed model, Offspring GAN, can resolve the mode collapse issue even when trained in strongly biased genomic datasets. Our results demonstrate the ability of Offspring GAN to generate realistic and diverse label-aware data, which can augment limited real data to alleviate biases and disparities in genomic data. We also propose a privacy-preserving protocol using Offspring GAN to protect the privacy of genomic data.
URI: https://doi.org/10.1145/3535508.3545537
http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/7753
ISBN: 9781450393867
Appears in Collections:CONFERENCE PAPERS

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.