support for deterministic inference in onnx #5593

maryamhonari · 2021-10-22T18:36:03Z

Proposed change(s)

Serialize two output tensors to onnx for deterministic actions:

"deterministic_continuous_actions"
"deterministic_discrete_actions"

ℹ️ changes to com.unity.ml-agents on a separate PR

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

miguelalonsojr

Can you add test to cover these changes?

ml-agents/mlagents/trainers/torch/action_model.py

ml-agents/mlagents/trainers/torch/distributions.py

ml-agents/mlagents/trainers/torch/networks.py

* Init: actor.forward outputs separate deterministic actions * changelog * Renaming * Add more tests

* Progress on propagating the setting to the action model. * Added the _sample_action logic and tests. * Add information to the changelog. * Prioritize the CLI over the configuration file. * Update documentation for config file. * CR refactor. * Update docs/Training-Configuration-File.md Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com> Update com.unity.ml-agents/CHANGELOG.md Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com> Update com.unity.ml-agents/CHANGELOG.md Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com> Update com.unity.ml-agents/CHANGELOG.md Co-authored-by: Maryam Honari <honari.m94@gmail.com> Update ml-agents/mlagents/trainers/settings.py Co-authored-by: Maryam Honari <honari.m94@gmail.com> Update ml-agents/mlagents/trainers/cli_utils.py Co-authored-by: Maryam Honari <honari.m94@gmail.com> * Fix CR requests * Add tests for discrete. * Update ml-agents/mlagents/trainers/torch/distributions.py Co-authored-by: Maryam Honari <honari.m94@gmail.com> * Added more stable test. * Return deterministic actions for training (#5615) * Added more stable test. * Fix the tests. * Fix pre-commit * Fix help line to pass precommit. * support for deterministic inference in onnx (#5593) * Init: actor.forward outputs separate deterministic actions * changelog * Renaming * Add more tests * Package changes to support deterministic inference (#5599) * Init: actor.forward outputs separate deterministic actions * fix tensor shape for discrete actions * Add test and editor flag - Add tests for deterministic sampling - update editor and tooltips * Reverting to "Deterministic Inference" * dissect tests * Update docs * Update CHANGELOG.md Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com> Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>

* Progress on propagating the setting to the action model. * Added the _sample_action logic and tests. * Add information to the changelog. * Prioritize the CLI over the configuration file. * Update documentation for config file. * CR refactor. * Update docs/Training-Configuration-File.md Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com> Update com.unity.ml-agents/CHANGELOG.md Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com> Update com.unity.ml-agents/CHANGELOG.md Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com> Update com.unity.ml-agents/CHANGELOG.md Co-authored-by: Maryam Honari <honari.m94@gmail.com> Update ml-agents/mlagents/trainers/settings.py Co-authored-by: Maryam Honari <honari.m94@gmail.com> Update ml-agents/mlagents/trainers/cli_utils.py Co-authored-by: Maryam Honari <honari.m94@gmail.com> * Fix CR requests * Add tests for discrete. * Update ml-agents/mlagents/trainers/torch/distributions.py Co-authored-by: Maryam Honari <honari.m94@gmail.com> * Added more stable test. * Return deterministic actions for training (#5615) * Added more stable test. * Fix the tests. * Fix pre-commit * Fix help line to pass precommit. * support for deterministic inference in onnx (#5593) * Init: actor.forward outputs separate deterministic actions * changelog * Renaming * Add more tests * Package changes to support deterministic inference (#5599) * Init: actor.forward outputs separate deterministic actions * fix tensor shape for discrete actions * Add test and editor flag - Add tests for deterministic sampling - update editor and tooltips * Reverting to "Deterministic Inference" * dissect tests * Update docs * Update CHANGELOG.md * Fix the deterministic showing up all the tiime (#5621) Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com> Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>

Init: actor.forward outputs separate deterministic actions

b340c85

maryamhonari changed the title ~~Init: actor.forward outputs separate deterministic actions~~ support for deterministic inference in onnx Oct 22, 2021

maryamhonari changed the base branch from main to develop-staging-determinstic-action October 25, 2021 21:12

cmard force-pushed the develop-staging-determinstic-action branch from 9160d85 to 7e7c3e2 Compare October 27, 2021 13:54

maryamhonari added 2 commits October 28, 2021 12:30

fix tensor shape for discrete actions

07c11d8

clean up

b047e5d

maryamhonari changed the base branch from develop-staging-determinstic-action to deterministic-actions-python-training October 28, 2021 19:43

changelog

085e56e

maryamhonari marked this pull request as ready for review October 28, 2021 19:51

maryamhonari requested review from andrewcoh, cmard and miguelalonsojr October 28, 2021 21:53

miguelalonsojr requested changes Nov 2, 2021

View reviewed changes

Renaming

fb7849f

maryamhonari force-pushed the develop-deterministic-policy-editor branch from 20fce24 to fb7849f Compare November 2, 2021 23:30

Add more tests

afa4d83

maryamhonari requested a review from miguelalonsojr November 3, 2021 16:24

miguelalonsojr approved these changes Nov 10, 2021

View reviewed changes

cmard approved these changes Nov 15, 2021

View reviewed changes

cmard force-pushed the deterministic-actions-python-training branch from bf15d2e to 604d7c1 Compare November 15, 2021 23:06

merge feature branch

b989a09

maryamhonari merged commit 0f5cd2b into deterministic-actions-python-training Nov 16, 2021

delete-merged-branch bot deleted the develop-deterministic-policy-editor branch November 16, 2021 22:17

maryamhonari added a commit that referenced this pull request Nov 18, 2021

support for deterministic inference in onnx (#5593)

e984488

* Init: actor.forward outputs separate deterministic actions * changelog * Renaming * Add more tests

maryamhonari added a commit that referenced this pull request Nov 18, 2021

support for deterministic inference in onnx (#5593)

13e9a88

* Init: actor.forward outputs separate deterministic actions * changelog * Renaming * Add more tests

maryamhonari mentioned this pull request Nov 18, 2021

Deterministic actions python training #5619

Merged

10 tasks

maryamhonari mentioned this pull request Nov 30, 2021

Deterministic actions python training #5626

Merged

10 tasks

github-actions bot locked as resolved and limited conversation to collaborators Nov 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for deterministic inference in onnx #5593

support for deterministic inference in onnx #5593

maryamhonari commented Oct 22, 2021 •

edited

Loading

miguelalonsojr left a comment

support for deterministic inference in onnx #5593

support for deterministic inference in onnx #5593

Conversation

maryamhonari commented Oct 22, 2021 • edited Loading

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

miguelalonsojr left a comment

Choose a reason for hiding this comment

maryamhonari commented Oct 22, 2021 •

edited

Loading