- 🔭 I’m currently working on MARL in swarm
- 🌱 I’m currently learning RL, attention, information
㊙️
secret
Highlights
- Pro
Pinned Loading
-
SIDM
SIDM PublicRepository for SIDM, a novel unsupervised adaptive framework improving policy quality, stability, and sample efficiency in RL scenarios (Deep RL, Hierarchical RL, Multi-Agent RL). Includes code, ex…
Python
-
hsd3
hsd3 PublicForked from facebookresearch/hsd3
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.