Multimodal Molecular Generation Automated Workflow

This repository contains codebase for training a scientific reasoning model to generate protein-binding ligands.

Overview

We create a scientific reasoning model to generate small drug-like molecules bdining to diverse protein targets. This repository contains code for chain-of-thought (CoT) SFT and reinforcement learning (RL) training.

We use group relative policy optimization (GRPO) RL training with custom reward functions for optimizing chemical validity, synthesizability and binding affinity of generated molecules across diverse protein targets.

Installation

To install the packages we recommend creating two separate conda environments. One for running SFT and RL training and other to run inference with vLLM.

To install training conda environment run:

conda env create -f molflow.yml

To install vLLM inference environment run:

conda env create -f vllm.yml

Model training and instructions

For both SFT and RL training we leverage accelerate to distribute our training across multiple GPUs.

SFT Training

max_length = 512
per_device_batch_size = 2
gradient_accumulation_steps = 2
max_steps = 25000
learning_rate = 5e-6
weight_decay = 0.05
warmup_steps = 1000
seed = 42

To start training, run:

accelerate launch sft_train.py

RL training

For RL training we use LoRA and trl's GRPOTrainer. Details about training parameters can be found in grpo_train.py

To start training, run:

accelerate launch grpo_train.py

All training runs were conducted on 4X NVIDIA H100 80GB GPUs.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
LICENSE		LICENSE
README.md		README.md
grpo_train.py		grpo_train.py
molflow.yml		molflow.yml
sft_train.py		sft_train.py
vllm.yml		vllm.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Molecular Generation Automated Workflow

Overview

Installation

Model training and instructions

SFT Training

RL training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multimodal Molecular Generation Automated Workflow

Overview

Installation

Model training and instructions

SFT Training

RL training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages