hierarchical-DQN
in pytorch. [paper]
Only Q-learning
and h-DQN
for StochasticMDPEnv
are implemented.
https://github.com/EthanMacdonald/h-DQN: Another implementation of hierarchical-DQN
paper, which I copied the StochasticMDPEnv
from.