Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Generative Adversarial Imitation Learning (GAIL) #550

Merged
merged 19 commits into from
Mar 6, 2022

Conversation

nuance1979
Copy link
Collaborator

  • I have marked all applicable categories:
    • exception-raising fix
    • algorithm implementation fix
    • documentation modification
    • new feature
  • I have reformatted the code using make format (required)
  • I have checked the code using make commit-checks (required)
  • If applicable, I have mentioned the relevant/related issue(s)
  • If applicable, I have listed every items in this Pull Request below

Implement GAIL based on PPO and provide example script and sample (i.e., most likely not the best) results with Mujoco tasks. (#531, #173)

@codecov-commenter
Copy link

codecov-commenter commented Mar 2, 2022

Codecov Report

Merging #550 (77ae3c9) into master (d976a5a) will decrease coverage by 0.00%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #550      +/-   ##
==========================================
- Coverage   93.82%   93.81%   -0.01%     
==========================================
  Files          63       64       +1     
  Lines        4322     4368      +46     
==========================================
+ Hits         4055     4098      +43     
- Misses        267      270       +3     
Flag Coverage Δ
unittests 93.81% <100.00%> (-0.01%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
tianshou/policy/__init__.py 100.00% <100.00%> (ø)
tianshou/policy/imitation/gail.py 100.00% <100.00%> (ø)
tianshou/policy/modelfree/trpo.py 88.52% <0.00%> (-4.92%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d976a5a...77ae3c9. Read the comment docs.

tianshou/policy/imitation/gail.py Outdated Show resolved Hide resolved
tianshou/utils/net/continuous.py Outdated Show resolved Hide resolved
examples/atari/atari_wrapper.py Outdated Show resolved Hide resolved
examples/atari/README.md Outdated Show resolved Hide resolved
examples/atari/atari_wrapper.py Outdated Show resolved Hide resolved
tianshou/policy/imitation/gail.py Outdated Show resolved Hide resolved
tianshou/utils/net/continuous.py Outdated Show resolved Hide resolved
@Trinkle23897 Trinkle23897 merged commit 2377f2f into thu-ml:master Mar 6, 2022
@nuance1979 nuance1979 deleted the gail branch April 24, 2022 21:00
BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024
Implement GAIL based on PPO and provide example script and sample (i.e., most likely not the best) results with Mujoco tasks. (thu-ml#531, thu-ml#173)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Is there any bc or gail in the tianshou Inverse reinforcement learning
3 participants