Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Actions

PyTorch implementation of our action candidate based clipped double estimator (AC-CDE), action candidate based clipped Double Q-learning (AC-CDQ), action candidate based clipped Double DQN (AC-CDDQN) and action candidate based TD3 (AC-TD3).

Paper link arXiv.

Usage

For AC-CDE, we evaluate it on the multi-armed bandits problem. The result can be reproduced by running:
```
cd AC_CDE_code
python3 main.py
```
For AC-CDQ, we evaluate it on the grid world game. The result can be reproduced by running:
```
cd AC_CDQ_code
python3 main.py
```
For AC-CDDQN, we evaluate it on the MinAtar benchmark. The result can be reproduced by running:
```
cd AC_CDDQN_code
CUDA_VISIBLE_DEVICES=0 python3 main.py
```
For AC-TD3, we evaluate it on MuJoCo continuous control tasks. The result can be reproduced by running:
```
cd AC_TD3_code
CUDA_VISIBLE_DEVICES=0 python3 main.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
AC_CDDQN_code		AC_CDDQN_code
AC_CDE_code		AC_CDE_code
AC_CDQ_code		AC_CDQ_code
AC_TD3_code		AC_TD3_code
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Actions

Usage

About

Releases

Packages

Languages

License

Jiang-HB/AC_CDQ

Folders and files

Latest commit

History

Repository files navigation

Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Actions

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages