AMG

💥 The official repository of our paper "Deep reinforcement learning as an interaction agent to steer fragment-based 3D molecular generation for protein pockets".

Introduction

Designing high-affinity molecules for protein targets (especially novel protein families) is a crucial yet challenging task in drug discovery. Recently, there has been tremendous progress in structure-based 3D molecular generative models that incorporate structural information of protein pockets. However, the capacity for molecular representation learning and the generalization for capturing interaction patterns need substantial further developments. Here, we propose AMG, a framework that leverages deep reinforcement learning as a pocket-ligand interaction agent to gradually steer fragment-based 3D molecular generation targeting protein pockets. AMG is trained using a two-stage strategy to capture interaction features and explicitly optimize the interaction agent. The framework also introduces a pair of separate encoders for pockets and ligands, coupled with a dedicated pre-training strategy. This enables AMG to enhance its generalization ability by leveraging a vast repository of undocked pockets and molecules, thus mitigating the constraints posed by the limited quantity and quality of available datasets. Extensive evaluations demonstrate that AMG significantly outperforms five state-of-the-art baselines in affinity performance while maintaining proper drug-likeness properties. Furthermore, visual analysis confirms the superiority of AMG at capturing 3D molecular geometrical features and interaction patterns within pocket-ligand complexes, indicating its considerable promise for various structure-based downstream tasks.

Framework

Prerequisites

We have presented the conda environment file in ./environment.yml.

We have evaluated our models using external tools, including: Qvina, Pyscreener.

Install via Conda and Pip

conda create -n AMG python=3.7
conda activate AMG
conda install pytorch==1.13.1  pytorch-cuda=11.7 -c pytorch -c nvidia
conda install -c conda-forge pdbfixer
conda install conda-forge::openbabel

pip install tensorboard==1.15.0
pip isntall protobuf==3.19.6
pip install networkx==2.6.3
pip install rdkit==2023.3.2
pip install biopython==1.81
pip install pyscreener==1.1.1
pip install -U "ray[default]"

cd ADFRsuite_x86_64Linux_1.0
./install.sh -d myFolder -c 0

cd spinningup
pip install -e .

Dataset

🌟 We pre-trained our model using the natural product dataset COCONUT and the Pocket3D dataset collected from the Protein Data Bank. The dataset used for fine-tuning was obtained from CrossDocked2020.

🌟 To facilitate your implementation, we have provided the raw datasets used by AMG. Download the dataset archive from AMG-DATA.

Training

Ligand encoder and fragment-based decoder pre-training:

python scripts/pretrain_ligand.py

Pocket encoder pre-training:

python scripts/pretrain_pocket.py

The first training stage:

python scripts/train_rec.py

The second training stage:

python scripts/train_agent.py

Sampling

python scripts/sample_testset.py --config configs/rl.yml --start_index 0  --end_index 99

Evaluation

Evaluation from sampling results

python scripts/evaluate_amg.py

Evaluation from meta files

We provide the sampling results of our model and Pocket2Mol, TargetDiff, DecompDiff, ResGen, FLAG baselines here.

You can directly reproduce the results reported in the paper quickly with summary.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
ADFRsuite_x86_64Linux_1.0		ADFRsuite_x86_64Linux_1.0
assets		assets
configs		configs
models		models
notebooks		notebooks
scripts		scripts
spinningup		spinningup
README.md		README.md
environment.yaml		environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AMG

Introduction

Framework

Prerequisites

Install via Conda and Pip

Dataset

Training

Sampling

Evaluation

Evaluation from sampling results

Evaluation from meta files

About

Releases

Packages

Languages

ispc-lab/AMG

Folders and files

Latest commit

History

Repository files navigation

AMG

Introduction

Framework

Prerequisites

Install via Conda and Pip

Dataset

Training

Sampling

Evaluation

Evaluation from sampling results

Evaluation from meta files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages