Asset Detail:
Trained JTVAE Autoencoder Model on ZINC Dataset
Asset Detail:
Trained JTVAE Autoencoder Model on ZINC Dataset
Overview
ASSET LINK: | https://modac.cancer.gov/assetDetails?dme_data_id=NCI-DME-MS01-86228520 |
PROGRAM NAME: | Accelerating Therapeutics for Opportunities in Medicine (ATOM) |
STUDY NAME: | Generalized Generative Molecular Design |
ASSET NAME: | Trained JTVAE Autoencoder Model on ZINC Dataset |
ASSET PATH: | /NCI_DOE_Archive/ATOM/generalized_GMD_input_and_documentation/jtvae_zinc |
Asset Attributes
ATTRIBUTE | VALUE |
---|---|
ASSET NAME | Trained JTVAE Autoencoder Model on ZINC Dataset |
ASSET DESCRIPTION | This asset contains a Junction Tree Variational Autoencoder (JTVAE) model trained on the ZINC dataset, along with the train/test split used for training and validation, the datasets vocabulary file, and a documentation file further describing the model. The JTVAE method was originally released in a 2018 paper called ���Junction Tree Variational Autoencoder for Molecular Graph Generation���. The JTVAE method is an autoencoder that is trained to encode molecules from their SMILES string representation into a latent vector then decode the vector, and then decode the vector back to the original SMILES string. This allows for the application of math-based operators on the vector representation, which can be decoded into a new SMILES string. The main purpose of this tool is to facilitate the use of generative AI methods for creating new optimized molecules. This model was trained using our implementation of the JTVAE method which can be found in the associated GitHub link. We modified the original code to implement GPU���s for training. To speed up the training process, we upgraded the code from Python2 to Python3, and increased the reconstruction accuracy. Once you have a trained JTVAE model using our source code, the model can be plugged into our Generalized Generative Molecular Design Loop. This means that anyone can use this model as an autoencoder for molecular optimization. Please refer to the documentation to learn more about getting started with our model and JTVAE code. |
ASSET IDENTIFIER | jtvae_zinc |
ASSET TYPE | Model |
MODEL DOMAIN | AutEncoder |
MODEL FRAMEWORK | PyTorch |
MODEL PLATFORM | None |
PLATFORM VERSION | None |
POC NAME | Black, Sean |
POC EMAIL | sean.black2@nih.gov |
IS MODEL DEPLOYED | No |
COLLECTION SIZE | 33.8 MB |
CURATION STATUS | Unverified |
GITHUB | https://github.com/CBIIT/JTVAE |
Asset Files
To download files, please login.
FILE/COLLECTION | FILE SIZE | ACTIONS |
---|