Exploring Chemical Reaction Space With Reaction Difference Fingerprints and Parametric t-SNE

Exploring Chemical Reaction Space With Reaction Difference Fingerprints and Parametric t-SNE
Mikhail G. Andronov,Maxim V. Fedorov,and Sergey Sosnin. Full text: https://pubs.acs.org/doi/full/10.1021/acsomega.1c04778

This is the repositary accompanying the paper. It contains a pytorch implementation of the parametric t-SNE algorithm and the pretrained models to visualize chemical reaction space.

Usage

Before using the code, create a new conda environment and install requred packages:

conda create -n <environment name> anaconda python=3.7
conda activate <environment name>
conda install -c conda-forge rdkit
conda install pytorch lightgbm

Training of new models

Use the script train.py to train a new model. This requires a train dataset in the form of a file with reaction SMILES and a config file. Rename the file example.config.yaml as config.yaml and set the instructions and parameters in it. The config file contains the following instructions and hyperparameters:

"device": "cpu" or "cuda" - device to train a model on
"seed": random seed for pytorch
"save_model": a flag indicating whather to save the model or not
"problem_settings":
- "filename": path to the train dataset
- "fp_method": "structural" or "difference" - what type of reaction fingerprints to use
- "n_bits": the length of the reaction fingerprint
- "fp_type": "MorganFP", "AtomPairFP" or "TopologicalTorsion" - three options for the type of fingerprints
- "include_agents": whether to include agents in a reaction fingerprint or not
- "agent_weight": agent weight in difference fingerprints if agents are included
- "non_agent_weight": non-agent weight in difference fingerprints
- "bit_ratio_agent": the ratio of agent bits in structural fingerprints
"optimization":
- "lr": learning rate for the Adam optimizer
"training": section for training hyperparameters, which is used in model.fit_model function

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
data		data
preparation		preparation
utils		utils
.gitignore		.gitignore
README.md		README.md
config.py		config.py
cross_validation.py		cross_validation.py
datasets.py		datasets.py
example_config.yaml		example_config.yaml
file_builder.py		file_builder.py
model.py		model.py
projection_example.png		projection_example.png
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Chemical Reaction Space With Reaction Difference Fingerprints and Parametric t-SNE

Usage

Training of new models

About

Releases

Packages

Languages

Academich/reaction_space_ptsne

Folders and files

Latest commit

History

Repository files navigation

Exploring Chemical Reaction Space With Reaction Difference Fingerprints and Parametric t-SNE

Usage

Training of new models

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages