Remap action to fit gym's action space #313

ChenDRAG · 2021-03-18T12:33:33Z

See #312 for details.

ChenDRAG · 2021-03-18T12:35:49Z

The validity of this implementation is verified by my version of PPO as stated in #307.
I didn't check tests and docs, so it won't pass tests. Will fix this tomorrow.

danagi

Is the action of SAC mapped in forward and then mapped again in map_action?

Trinkle23897 · 2021-03-19T14:38:44Z

Is the action of SAC mapped in forward and then mapped again in map_action?

just once, the code in forward is to correct the log_prob for entropy term

danagi

great job

codecov-io · 2021-03-21T08:25:47Z

Codecov Report

Merging #313 (ecff62b) into master (0c7117d) will increase coverage by 0.01%.
The diff coverage is 97.22%.

❗ Current head ecff62b differs from pull request most recent head 60dee40. Consider uploading reports for the commit 60dee40 to get more accurate results

@@            Coverage Diff             @@
##           master     #313      +/-   ##
==========================================
+ Coverage   93.91%   93.92%   +0.01%     
==========================================
  Files          51       51              
  Lines        3270     3278       +8     
==========================================
+ Hits         3071     3079       +8     
  Misses        199      199

Flag	Coverage Δ
unittests	`93.92% <97.22%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tianshou/policy/modelfree/a2c.py	`86.20% <ø> (ø)`
tianshou/policy/modelfree/sac.py	`84.88% <91.66%> (-0.84%)`	⬇️
tianshou/data/collector.py	`94.93% <100.00%> (+0.04%)`	⬆️
tianshou/policy/base.py	`79.56% <100.00%> (+2.32%)`	⬆️
tianshou/policy/modelfree/ddpg.py	`98.68% <100.00%> (-0.10%)`	⬇️
tianshou/policy/modelfree/discrete_sac.py	`87.69% <100.00%> (ø)`
tianshou/policy/modelfree/pg.py	`97.36% <100.00%> (ø)`
tianshou/policy/modelfree/ppo.py	`97.59% <100.00%> (+1.07%)`	⬆️
tianshou/policy/modelfree/td3.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0c7117d...60dee40. Read the comment docs.

Co-authored-by: Trinkle23897 <trinkle23897@gmail.com>

ChenDRAG added 2 commits March 18, 2021 12:33

half

508cc15

remap actions

4d3102d

ChenDRAG requested a review from Trinkle23897 March 18, 2021 12:33

ChenDRAG marked this pull request as draft March 18, 2021 12:37

ChenDRAG and others added 6 commits March 18, 2021 20:40

minor fix

129087c

fix part of tests

5458acc

pep8

b35585d

update

9cf1653

fix bugs

27fcfb8

fix docs

1154f29

Trinkle23897 marked this pull request as ready for review March 19, 2021 13:32

Trinkle23897 requested a review from danagi March 19, 2021 13:33

low high

28b16bc

danagi reviewed Mar 19, 2021

View reviewed changes

danagi previously approved these changes Mar 19, 2021

View reviewed changes

Trinkle23897 linked an issue Mar 20, 2021 that may be closed by this pull request

Buffer should not store remapped action #312

Closed

minor

8d680fe

ChenDRAG dismissed danagi’s stale review via 8d680fe March 21, 2021 08:12

ChenDRAG and others added 3 commits March 21, 2021 16:14

minor

ecff62b

pep8 fix

f08d389

docs

b42b709

Trinkle23897 added 2 commits March 21, 2021 16:26

docs

6293deb

Merge branch 'master' into clip_action

60dee40

Trinkle23897 approved these changes Mar 21, 2021

View reviewed changes

Trinkle23897 merged commit 4d92952 into thu-ml:master Mar 21, 2021

ChenDRAG deleted the clip_action branch March 21, 2021 10:28

Trinkle23897 mentioned this pull request Apr 1, 2021

SAC's loss explode on Hopper-v3 #332

Closed

8 tasks

Trinkle23897 linked an issue Apr 21, 2021 that may be closed by this pull request

Plans of releasing mujoco benchmark of onpolicy algorithms(VPG, A2C, PPO) #307

Closed

BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024

Remap action to fit gym's action space (thu-ml#313)

626549a

Co-authored-by: Trinkle23897 <trinkle23897@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remap action to fit gym's action space #313

Remap action to fit gym's action space #313

ChenDRAG commented Mar 18, 2021

ChenDRAG commented Mar 18, 2021 •

edited by Trinkle23897

Loading

danagi left a comment

Trinkle23897 commented Mar 19, 2021

danagi left a comment

codecov-io commented Mar 21, 2021 •

edited

Loading

Remap action to fit gym's action space #313

Remap action to fit gym's action space #313

Conversation

ChenDRAG commented Mar 18, 2021

ChenDRAG commented Mar 18, 2021 • edited by Trinkle23897 Loading

danagi left a comment

Choose a reason for hiding this comment

Trinkle23897 commented Mar 19, 2021

danagi left a comment

Choose a reason for hiding this comment

codecov-io commented Mar 21, 2021 • edited Loading

Codecov Report

ChenDRAG commented Mar 18, 2021 •

edited by Trinkle23897

Loading

codecov-io commented Mar 21, 2021 •

edited

Loading