Skip to content

Latest commit

 

History

History
32 lines (21 loc) · 1.27 KB

README.md

File metadata and controls

32 lines (21 loc) · 1.27 KB

Transformer-based Multi-agent Reinforcement Learning for Multiple Unmanned Aerial Vehicle Coordination in Air Corridors

submiteed to IEEE International Conference on Communications

Animation on cylinder-torus-torus-cylinder

cylinder-torus-torus-cylinder.gif

D3MOVE_v4.py for visualization of UAVs coordination in air corridors.

Air Corridor Modeling

UAVs need to traverse several air corridors to reach their destinations. Air corridors are modelled as cylinder and partial torus.

Cylinder and Torus

Air_corridor.jpg corridor.py

RL Training

Network Structure

  • H(), embedding layer, normalizes the input values and standardize the input dimensions.
  • G(), transformer layer, deals with stochastic neighbors information
  • F(), actor-critic network combined. TransRL.jpg

network function.png

Training

related package can be found in environment.yml

main.py