SCORE

A PyTorch implementation for our paper "False Correlation Reduction for Offline Reinforcement Learning" published on IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). Our code is built off of TD3-BC and rlkit.

Link to our paper:

IEEE TPAMI: https://ieeexplore.ieee.org/document/10301548/
arXiv: https://arxiv.org/abs/2110.12468

Prerequisites

PyTorch 1.4.0 with Python 3.7
MuJoCo 2.00 with mujoco-py 2.0.2.13
d4rl 1.1 or higher (with v2 datasets)
rlkit 0.2.1

Usage

For training SCORE on Envname (e.g. walker2d-medium-v2), run:

python main.py --env_name=Envname --version=VersionName --gpu=0

The results are collected in ./output/Envname/SCORE(VersionName)/, where

debug.log records the log data,
params.pkl records the final model parameters,
progress.csv records the log data in the .csv format for analysis purpose,
variant.json records the hyperparameters.

Bibtex

@article{deng2023score,
  author={Deng, Zhihong and Fu, Zuyue and Wang, Lingxiao and Yang, Zhuoran and Bai, Chenjia and Zhou, Tianyi and Wang, Zhaoran and Jiang, Jing},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
  title={False Correlation Reduction for Offline Reinforcement Learning}, 
  year={2023},
  volume={},
  number={},
  pages={1-12},
  doi={10.1109/TPAMI.2023.3328397}}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
rlkit		rlkit
score		score
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCORE

Prerequisites

Usage

Bibtex

About

Releases

Packages

Languages

familyld/SCORE

Folders and files

Latest commit

History

Repository files navigation

SCORE

Prerequisites

Usage

Bibtex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages