multi-actions environment #738

gitlabspy · 2022-09-01T06:29:12Z

gitlabspy
Sep 1, 2022

Any tutorials on warpping a (multi-actions) custom gym-env for tianshou?

有没有官方交流群？感觉弄个群交流会方便大家使用

Trinkle23897 · 2022-09-02T17:10:46Z

Trinkle23897
Sep 2, 2022
Maintainer

First, I assume your environment's action format is a dict, e.g.,

{"a": [3, 4], "b": 5.0}

the next step is to inherit an existing policy class and overwrite its forward function, so that the return value is a Batch that contains the desired action format, e.g.,

    def forward(self, ...):
        ...
        return Batch(act=Batch(a=..., b=...), ...)

Feel free to modify the way that calculates the result of a and b, for example, you can directly return a and b in your network forward. And that's it.

Note: in forward function, action a and b are batch data, i.e., a has a shape of [bsz, 2] and b has a shape of [bsz].

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-actions environment #738

{{title}}

Replies: 1 comment

{{title}}

Select a reply

multi-actions environment #738

gitlabspy Sep 1, 2022

Replies: 1 comment

Trinkle23897 Sep 2, 2022 Maintainer

gitlabspy
Sep 1, 2022

Trinkle23897
Sep 2, 2022
Maintainer