GitHub - RekerLab/ProdrugDesignPipeline: A novel machine learning pipeline for rapid and systematic design of prodrugs with desired properties.

Overview

Prodrugs are easily deployable chemical entities with beneficial pharmacokinetic properties; however, their rational design requires careful crafting of release mechanisms and holistic optimization of pharmacokinetic properties. Machine learning is poised to support rational design of prodrugs by efficiently filtering millions of generated designs down to the most promising candidates. Here, we designed and validated a novel machine learning pipeline for rapid and systematic design of prodrugs with desired properties. We also developed a subsampling approach for efficient application of our pair-wise DeepDelta approach to larger datasets (>1500 datapoints).

The associated publication is currently under review.

We would like to thank the Chemprop, Llama, Unsloth, Molecule-RNN, SmilesGPT, MolGan, Scikit-learn, and Chemical VAE developers for making their code publicly available.

Descriptions of Folders

DeepDelta_for_Prodrugs

Datasets, saved models, and code for applying DeepDelta for the two example prodrug case studies. Due to the large file size of the files for generated prodrugs and their predicted values, these results are stored on Zenodo: 10.5281/zenodo.18079221.

DeepDelta_Subsampling

Datasets and code for subsampling strategies for efficient application of the pair-wise DeepDelta approach to larger datasets (>1500 datapoints). Due to the large file size of results, these are stored on Zenodo: 10.5281/zenodo.14894034.

Existing_Prodrug_Analysis

Datasets, code, and results for the analysis of currently approved and investigational prodrugs (Figure S1).

Generative_Models

Datasets and code for generative models to directly build promoieties onto drug structures (based on Molecule-RNN).

Generative_Prodrug_Analysis

Datasets, code, and results for the analysis of novel prodrugs designed using generative models (Figure 1).

License

The copyrights of the software are owned by Duke University. As such, two licenses for this software are offered:

An open-source license under the GPLv2 license for non-commercial academic use.
A custom license with Duke University, for commercial use or uses without the GPLv2 license restrictions.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
DeepDelta_For_Prodrugs		DeepDelta_For_Prodrugs
DeepDelta_Subsampling		DeepDelta_Subsampling
Existing_Prodrug_Analysis		Existing_Prodrug_Analysis
Generative_Models		Generative_Models
Generative_Prodrug_Analysis		Generative_Prodrug_Analysis
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Descriptions of Folders

DeepDelta_for_Prodrugs

DeepDelta_Subsampling

Existing_Prodrug_Analysis

Generative_Models

Generative_Prodrug_Analysis

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Descriptions of Folders

DeepDelta_for_Prodrugs

DeepDelta_Subsampling

Existing_Prodrug_Analysis

Generative_Models

Generative_Prodrug_Analysis

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages