-
Learning from sparse feedback from complex environments is challenging. Requires very efficient exploration strategies.
-
Long-term credit assignment remains a major challenge.
-
Meta Learning Shared Hierarchies[arXiv]
-
TreeQN and ATreeC: Differentiable Tree Planning for Deep Reinforcement Learning[arXiv]
- Learning Goal-Directed Behaviour[Pdf]
-
Multi-Level Discovery of Deep Options[arXiv]
-
FeUdal Networks for Hierarchical Reinforcement Learning[arXiv]
- The Option-Critic Architecture[arXiv]