Skip to content
/ poce Public

POCE: Primal Policy Optimization with Conservative Estimation forMulti-constraint Offline Reinforcement Learning

License

Notifications You must be signed in to change notification settings

guanjiayi/poce

Repository files navigation

POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning

This project provides the open source implementation of the POCE method introduced in the paper: "POCE: Primal Policy Optimization with Conservative Estimation for\ Multi-constraint Offline Reinforcement Learning" .

Install

Create a conda environment and install

cd poce/
conda env create --file conda-recipe.yaml
conda activate poce
pip install -e .

Install the environment

cd poce/envs/safety-gymnasium-poce/
pip install -e .

Load the dataset

Please download the dataset from the Google Drive address provided in the TXT file and place it in the current directory.

cd poce/dataset/

Training and testing the model

cd poce/
python script/train_policy.py

About

POCE: Primal Policy Optimization with Conservative Estimation forMulti-constraint Offline Reinforcement Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages