Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement Learning

Introduction

This repository is the implementation of "Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement Learning". Paper
This study proposed a dual (time and spcace)-domain energy harvesting (EH) approach to maximize EH efficiency of the UAV—RIS system by jointly optimizing the RIS phase shifts vector, the RIS scheduling martix, the length of energy harvesting phase, and the transmit power.
For the UAV trajectory design, we considered the density-aware and Fermat point-based algorithms.
The implementation of DDPG and TD3 using Stable-Baseline3.
The implementation of SD3 is based on Dr. Pan's research: Softmax Deep Double Deterministic Policy Gradients.

There are some limitations to this work. If you have any questions or suggestions, please feel free to contact me (haoranpeng@cuhk.edu.hk). Your suggestions are greatly appreciated.

Citing

Please consider citing our paper if this repository is helpful to you.

Haoran Peng, and Li-Chun Wang, “Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement Learning”, IEEE Trans. Wireless Commun., vol. 22, no. 10, pp. 6826——6838, Oct., 2023, doi: 10.1109/TWC.2023.3245820

Bibtex:

  @ARTICLE{10051712,
  author={Peng, Haoran and Wang, Li-Chun},
  journal={IEEE Trans. Wireless Commun.}, 
  title={Energy Harvesting Reconfigurable Intelligent Surface for {UAV} Based on Robust Deep Reinforcement Learning}, 
  year={2023},
  month={Oct.}
  volume={22},
  number={10},
  pages={6826--6838},
 }

@INPROCEEDINGS{peng1570767WCNC,
  author={Peng, Haoran and Wang, Li-Chun and Li, Geoffrey Ye and Tsai, Ang-Hsun},
  booktitle={Proc. IEEE Wireless Commun. Netw. Conf. (WCNC)}, 
  title={Long-Lasting {UAV}-aided {RIS} Communications based on {SWIPT}},
  address={Austin, TX},
  year={2022},
  month = {Apr.}
}

Requirements

Python: 3.6.13
Pytorch: 1.10.1
gym: 0.15.3
numpy: 1.19.2
matplotlib
pandas
Stable-Baselines3

Usage

Descriptions of folders

The folder "XXXX-MultiUT-Time" is the source code for the time-domain EH scheme in the multiple user scenario.
The folder "XXXX-MultiUT-Two" is the source code for the two-domain (Time and Space) EH scheme in the multiple user scenario.
The folder "XXXX-SingleUT-Time" is the source code for the time-domain EH scheme in the single user scenario.
The folder "XXXX-SingleUT-Two" is the source code for the two-domain (Time and Space) EH scheme in the single user scenario.
The folder "CreateData" is the source code for generating dataset of trajectories files for users and the UAV.

Descriptions of files

For the Exhaustive Algorithm, the communication environment is impletemented in 'ARIS_ENV.py'.
For DRL-based algorithms, the communication environment is impletemented in 'gym_foo/envs/foo_env.py'.
You can change the dataset and the scenario in 'gym_foo/envs/foo_env.py'.

Training phase

For the TD3 and DDPG, please execute the TD3.py and DDPG.py to train the model, such as

python TD3.py / python DDPG.py

Please change the training mode in the file "gym_foo/envs/foo_env.py" before you executing the training progress. For example:

class FooEnv(gym.Env):
    metadata = {'render.modes': ['human']}
    def __init__(self, LoadData = True, Train = False, multiUT = True, Trajectory_mode = 'Fermat', MaxStep = 41):

If you want to conduct the training phase, the value of "Train" should be "True", otherwise, the value of "Train" should be "Flase" when excuting the testing phase.

For the exhaustive search, please execute the ExhaustiveSearch.py to reproduce the simulation results.
For the SD3, please execute main.py to train a new model.

Please use the version of 0.15.3 for Gym, otherwise there may have some issues in the training phase.

Testing phase

Please execute test.py to evaluate DRL models. Before you produce the testing results, please change the dataset and scenario in 'gym_foo/envs/foo_env.py'.

The EH efficiency

The EH efficiency = the harvested energy / the received energy from RF signals

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
CreateData		CreateData
DDPG-MultiUT-Time		DDPG-MultiUT-Time
DDPG-MultiUT-Two		DDPG-MultiUT-Two
DDPG-SingleUT-Time		DDPG-SingleUT-Time
DDPG-SingleUT-Two		DDPG-SingleUT-Two
Exhaustive-MultiUT-Time		Exhaustive-MultiUT-Time
Exhaustive-MultiUT-Two		Exhaustive-MultiUT-Two
Exhaustive-SingleUT-Time		Exhaustive-SingleUT-Time
Exhaustive-SingleUT-Two		Exhaustive-SingleUT-Two
SD3-MultiUT-Time		SD3-MultiUT-Time
SD3-MultiUT-Two		SD3-MultiUT-Two
SD3-SingleUT-Time		SD3-SingleUT-Time
SD3-SingleUT-Two		SD3-SingleUT-Two
TD3-MultiUT-Time		TD3-MultiUT-Time
TD3-MultiUT-Two		TD3-MultiUT-Two
TD3-SingleUT-Time		TD3-SingleUT-Time
TD3-SingleUT-Two		TD3-SingleUT-Two
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement Learning

Introduction

Citing

Requirements

Usage

Descriptions of folders

Descriptions of files

Training phase

Testing phase

The EH efficiency

About

Releases

Packages

Languages

Haoran-Peng/UAV-RIS_EnergyHarvesting

Folders and files

Latest commit

History

Repository files navigation

Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement Learning

Introduction

Citing

Requirements

Usage

Descriptions of folders

Descriptions of files

Training phase

Testing phase

The EH efficiency

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages