Skip to content

QinwenLuo/OCR-CFT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OCR-CFT

The official code of Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL.

Acknowledgments

This project makes use of the following open-source projects:

  • CORL: Implementation of the offline training process.
  • Uni-O4: Implementation of PPO.

Install

For installation instructions, please refer to the CORL repository for detailed guidance.


Run

Take O2SAC from the results of CQL as an example.

offline pre-train

To run the offline pre-training, use the following command:

cd offline
python cql.py --env hopper-medium-v2 --seed 0

online fine-tuning

To perform online fine-tuning, use the following command:

cd finetune
python O2SAC.py --env hopper-medium-v2 --seed 0

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages