T2TL

Exploiting Transformer Sparse Reward in Reinforcement Learning for Interpretable Temporal Logic Motion Planning

If any questions, feel free to contact: [email protected].

Installation instructions

We recommend using Python 3.7 or Python 3.8 to run this code.

pip install -r requirements.txt or conda env create -f conda_env.yml (Note that .yml is only effective for Python 3.7)
Install Spot-2.9:
- For Python 3.7, follow the installation instructions at the link. Spot should be installed in /usr/local/lib/python3.7/site-packages/spot. This step usually takes around 20 mins.
- For Python 3.8+, conda install -c conda-forge spot (Note that this command needs Python is 3.8+)
To train the agent in ZoneEnv, you will need Mujoco installed, as well as an active license.
- pip install mujoco-py==2.0.2.9
- pip install -e src/envs/safety/safety-gym/
- pip install numpy==1.21.5, if Python 3.7 or pip install numpy==1.23, if Python 3.8
Install dgl:
- If you only reproduce the results from Transformer-encoded method, just pip install dgl-cu113 -f https://data.dgl.ai/wheels/repo.html
- If you also want to reproduce the results from GNN-encoded method, we found the dgl team has not maintained dgl_cu111==0.7a210520. You can install it as follows:
  - download the dgl file from google drive
  - unzip dgl.zip and unzip dgl_cu111-0.7a210520.dist-info.zip
  - mv -f dgl/ /home/{your_workspace_name}/anaconda3/envs/T2TL/lib/python3.7/site-packages/
  - mv -f dgl_cu111-0.7a210520.dist-info/ /home/{your_workspace_name}/anaconda3/envs/T2TL/lib/python3.7/site-packages/

Examples

cd dis_src/
To train the agent with GNN method, run:
- python T1TL.py --gnn RGCN_8x32_ROOT_SHARED
To train the agent with pretrained GNN method, run:
- python train_PreGNNAgent.py
To train the agent with T1TL method, run:
- python T1TL.py
To train the agent with pretrained T1TL method, run:
- python T1TL_pretrain.py
To train the agent with T2TL method, run:
- python T2TL.py
To train the agent with pretrained T2TL method, run:
- python T2TL_pretrain..py

WaterWorld

ZoneEnv

Dimension Comparison

Interpretability via Attention

Citation

@ARTICLE{10167731,
  author={Zhang, Hao and Wang, Hao and Kan, Zhen},
  journal={IEEE Robotics and Automation Letters}, 
  title={Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning}, 
  year={2023},
  volume={8},
  number={8},
  pages={4831-4838},
  doi={10.1109/LRA.2023.3290511}}

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
README_file		README_file
src		src
README.md		README.md
conda_env.yml		conda_env.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

T2TL

Installation instructions

Examples

WaterWorld

ZoneEnv

Dimension Comparison

Interpretability via Attention

Citation

About

Releases

Packages

Languages

Charlie0257/T2TL

Folders and files

Latest commit

History

Repository files navigation

T2TL

Installation instructions

Examples

WaterWorld

ZoneEnv

Dimension Comparison

Interpretability via Attention

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages