Digital Repository

MolGPT: Molecular Generation Using a Transformer-Decoder Model

Show simple item record

dc.contributor.author BAGAL, VIRAJ en_US
dc.contributor.author Aggarwal, Rishal en_US
dc.contributor.author Vinod, P. K. en_US
dc.contributor.author Priyakumar, U. Deva en_US
dc.date.accessioned 2022-04-04T08:56:45Z
dc.date.available 2022-04-04T08:56:45Z
dc.date.issued 2021-10 en_US
dc.identifier.citation Journal of Chemical Information and Modeling en_US
dc.identifier.issn 1549-9596 en_US
dc.identifier.issn 1549-960X en_US
dc.identifier.uri https://doi.org/10.1021/acs.jcim.1c00600 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6711
dc.description.abstract Application of deep learning techniques for de novo generation of molecules, termed as inverse molecular design, has been gaining enormous traction in drug design. The representation of molecules in SMILES notation as a string of characters enables the usage of state of the art models in natural language processing, such as Transformers, for molecular design in general. Inspired by generative pre-training (GPT) models that have been shown to be successful in generating meaningful text, we train a transformer-decoder on the next token prediction task using masked self-attention for the generation of druglike molecules in this study. We show that our model, MolGPT, performs on par with other previously proposed modern machine learning frameworks for molecular generation in terms of generating valid, unique, and novel molecules. Furthermore, we demonstrate that the model can be trained conditionally to control multiple properties of the generated molecules. We also show that the model can be used to generate molecules with desired scaffolds as well as desired molecular properties by conditioning the generation on scaffold SMILES strings of desired scaffolds and property values. Using saliency maps, we highlight the interpretability of the generative process of the model en_US
dc.language.iso en en_US
dc.publisher American Chemical Society en_US
dc.subject Molecular properties en_US
dc.subject Partition coefficient en_US
dc.subject Molecular modeling en_US
dc.subject Scaffolds en_US
dc.subject Molecules en_US
dc.subject 2021 en_US
dc.title MolGPT: Molecular Generation Using a Transformer-Decoder Model en_US
dc.type Article en_US
dc.contributor.department Dept. of Chemistry en_US
dc.identifier.sourcetitle Journal of Chemical Information and Modeling en_US
dc.publication.originofpublisher Foreign en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account