RL_Legged-Wheeled-sim

Introduction

This project is a simulation of a two-legged-wheeled robot based on a deep reinforcement learning approach. The simulator used is MuJoco. The specific approach is to firstly encapsulate the robot as a standard reinforcement learning Python interface according to the gym style, including state feedback, action, reward function, reset, etc. The robot is then used to interact with the mujoco model through the Pytorch framework. Secondly, the SAC algorithm is built through Pytorch framework to interact with the MuJoco model to make the robot reach the goal point.

Environment Configuration

The project was done on Ubuntu20.04 with the following Python versions and library versions:

Python 3.7.10
MuJoco 210
gym 0.21.0
mujoco-py 2.1.2.14
Pytorch 1.10.0
numpy 1.21.6

Clarification

The project consists of three folders, a main file, a test file, and four other function files. These programs can be divided into three parts: the robot environment, the results, and the algorithms.

1. Environmental files

The robot simulation environment is mainly written in the envs folder, which consists of the asset folder, the file named biped.py, and the file named register.py.

biped.py encapsulates environment reset, step environment update (including reward function), status feedback, etc. The main function is to encapsulate the information of the robot inside the mujoco emulator through Python interface. register.py is the bootstrap file to guide the project to locate biped.py.

The asset folder mainly contains the mujoco simulator, Legged_wheel.xml, Legged_wheel.xml1, Legged_wheel.xml2, Legged_wheel.xml3 are four different simulation scenarios. meshes folder stores the robot parts in stl form.

2. Results file of the training

The trained neural network model is recorded in the models folder and called by test.py. The runtime data is recorded in the runs folder. To view the data, go to the runs folder and use the command:

tensorboard --logdir=[filename]

3. Reinforcement learning algorithm file

main.py is the main training program, the algorithm is SAC(sac.py) and the neural network model is written in Pytorch(model.py). Since SAC is offline reinforcement learning, it requires replay_memory(replay_memory.py). utils.py defines some script functions.

Usage

Install MuJoco 210
Install Anaconda
Create a conda environment
Install mujoco-py, gym and Pytorch
Download the project locally and run the following command for saving the running data:

cd RL_Legged-Wheeled-sim/legged_wheeled_mujoco
mkdir runs

Start a training session.

python main.py

Testing the trained model

python test.py

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
legged_wheeled_mujoco		legged_wheeled_mujoco
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL_Legged-Wheeled-sim

Introduction

Environment Configuration

Clarification

1. Environmental files

2. Results file of the training

3. Reinforcement learning algorithm file

Usage

About

Releases

Packages

Languages

hrxsd/RL_Legged-Wheeled-sim

Folders and files

Latest commit

History

Repository files navigation

RL_Legged-Wheeled-sim

Introduction

Environment Configuration

Clarification

1. Environmental files

2. Results file of the training

3. Reinforcement learning algorithm file

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages