Update v2-staging from main (March 15) #5123

chriselion · 2021-03-16T01:42:22Z

Proposed change(s)

Merge main into v2-staging. This still needs a conflict fix from @vincentpierre.

* Make buffer type-agnostic * Edit types of Apped method * Change comment * Collaborative walljump * Make collab env harder * Add group ID * Add collab obs to trajectory * Fix bug; add critic_obs to buffer * Set group ids for some envs * Pretty broken * Less broken PPO * Update SAC, fix PPO batching * Fix SAC interrupted condition and typing * Fix SAC interrupted again * Remove erroneous file * Fix multiple obs * Update curiosity reward provider * Update GAIL and BC * Multi-input network * Some minor tweaks but still broken * Get next critic observations into value estimate * Temporarily disable exporting * Use Vince's ONNX export code * Cleanup * Add walljump collab YAML * Lower max height * Update prefab * Update prefab * Collaborative Hallway * Set num teammates to 2 * Add config and group ids to HallwayCollab * Fix bug with hallway collab * Edits to HallwayCollab * Update onnx file meta * Make the env easier * Remove prints * Make Collab env harder * Fix group ID * Add cc to ghost trainer * Add comment to ghost trainer * Revert "Add comment to ghost trainer" This reverts commit 292b6ce. * Actually add comment to ghosttrainer * Scale size of CC network * Scale value network based on num agents * Add 3rd symbol to hallway collab * Make comms one-hot * Fix S tag * Additional changes * Some more fixes * Self-attention Centralized Critic * separate entity encoder and RSA * clean up args in mha * more cleanups * fixed tests * entity embeddings work with no max Integrate into CC * remove group id * very rough sketch for TeamManager interface * One layer for entity embed * Use 4 heads * add defaults to linear encoder, initialize ent encoders * add team manager id to proto * team manager for hallway * add manager to hallway * send and process team manager id * remove print * small cleanup * default behavior for baseTeamManager * add back statsrecorder * update * Team manager prototype (#4850) * remove group id * very rough sketch for TeamManager interface * add team manager id to proto * team manager for hallway * add manager to hallway * send and process team manager id * remove print * small cleanup Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Remove statsrecorder * Fix AgentProcessor for TeamManager Should work for variable decision frequencies (untested) * team manager * New buffer layout, TeamObsUtil, pad dead agents * Use NaNs to get masks for attention * Add team reward to buffer * Try subtract marginalized value * Add Q function with attention * Some more progress - still broken * use singular entity embedding (#4873) * I think it's running * Actions added but untested * Fix issue with team_actions * Add next action and next team obs * separate forward into q_net and baseline * might be right * forcing this to work * buffer error * COMAA runs * add lambda return and target network * no target net * remove normalize advantages * add target network back * value estimator * update coma config * add target net * no target, increase lambda * remove prints * cloud config * use v return * use target net * adding zombie to coma2 brnch * add callbacks * cloud run with coma2 of held out zombie test env * target of baseline is returns_v * remove target update * Add team dones * ntegrate teammate dones * add value clipping * try again on cloud * clipping values and updated zombie * update configs * remove value head clipping * update zombie config * Add trust region to COMA updates * Remove Q-net for perf * Weight decay, regularizaton loss * Use same network * add base team manager * Remove reg loss, still stable * Black format * add team reward field to agent and proto * set team reward * add maxstep to teammanager and hook to academy * check agent by agent.enabled * remove manager from academy when dispose * move manager * put team reward in decision steps * use 0 as default manager id * fix setTeamReward Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * change method name to GetRegisteredAgents * address comments * Revert C# env changes * Remove a bunch of stuff from envs * Remove a bunch of extra files * Remove changes from base-teammanager * Remove remaining files * Remove some unneeded changes * Make buffer typing neater * AgentProcessor fixes * Back out trainer changes * use delegate to avoid agent-manager cyclic reference * put team reward in decision steps * fix unregister agents * add teamreward to decision step * typo * unregister on disabled * remove OnTeamEpisodeBegin * change name TeamManager to MultiAgentGroup * more team -> group * fix tests * fix tests * Use attention tests from master * Revert "Use attention tests from master" This reverts commit 78e052b. * Use attention from master * Renaming fest * Use NamedTuples instead of attrs classes * Bug fixes * remove GroupMaxStep * add some doc * Fix mock brain * np float32 fixes * more renaming * Test for team obs in agentprocessor * Test for group and add team reward * doc improve Co-authored-by: Ervin T. <ervin@unity3d.com> * store registered agents in set * remove unused step counts * Global group ids * Fix Trajectory test * Remove duplicated files * Add team methods to AgentAction * Buffer fixes (cherry picked from commit 2c03d2b) * Add test for GroupObs * Change AgentAction back to 0 pad and add tests * Addressed some comments * Address some comments * Add more comments * Rename internal function * Move padding method to AgentBufferField * Fix slicing typing and string printing in AgentBufferField * Fix to-flat and add tests * Rename GroupmateStatus to AgentStatus * Update comments * Added GroupId, GlobalGroupId, GlobalAgentId types * Update comment * Make some agent processor properties internal * Rename add_group_status * Rename store_group_status, fix test * Rename clear_group_obs Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

* Removing some scenes, All the Static and all the non variable speed environments. Also removed Bouncer, PushBlock, WallJump and reacher. Removed a bunch of visual environements as well. Removed 3DBallHard and FoodCollector (kept Visual and Grid FoodCollector) * readding 3DBallHard * readding pushblock and walljump * Removing tennis * removing mentions of removed environments * removing unused images * Renaming Crawler demos * renaming some demo files * removing and modifying some config files * new examples image? * removing Bouncer from build list * replacing the Bouncer environment with Match3 for llapi tests * Typo in yamato test

…5041)

…5041) (#5043) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

* Fix typo * Add test

* Fix padding for List entries in buffer * Revert to coonverting to np.array * Fix dtype in PPO trainer

* Detach memory before storing * Add test * Evaluate with no_grad

…nes. (#5052)

…-main Release 14 branch to main

* Fix save/restore critic, add test * Rename module for PPO * Use correct policy in test

…json files in our examples. (#5077)

…back (#5091) Co-authored-by: Chris Elion <chris.elion@unity3d.com>

Co-authored-by: Ervin Teng <ervin@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>

* Add pushblock collab * Make SimpleMultiAgentGroup public * Remove GoalDetectTrigger * Remove GDT meta file * Remove some comments * Add training configuration * Rename behavior * Add to docs * Change the reward structure in docs * Add back GoalDetectTrigger Co-authored-by: HH <brandonh@unity3d.com>

* Add multiAgentGroup capabilities flag * Add proto * Fix compiler error * Add warning for multiagent group * Add comment * Fix spelling mistake

* use get step to determine curriculum * add to CHANGELOG * Make step in trainer private (#5099) Co-authored-by: Ervin T <ervin@unity3d.com>

Increment versions after release 15 branch split

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

…5113) * Fix end episode for POCA, add warning for group reward if not POCA * Add missing imports

Chris Elion and others added 30 commits March 3, 2021 15:56

Update cattrs dependencies to support python3.9 (#4821)

a5b324a

Fix issue with queuing input events that stomp on others. (#5034)

3fb97b4

Update cattrs dependencies to support python3.9 (#4821)

541916c

Fix issue with queuing input events that stomp on others. (#5034)

b6e8469

renaming of behavior name for imitation crawler (#5039)

686518f

[Fix][Documentation] Replace vis_encodeR_type with vis_encode_type (#…

967d917

…5041)

Update readme. (#5042)

2c1acd4

Update versions for release 14 hotfix. (#5040)

eac07df

[Fix][Documentation] Replace vis_encodeR_type with vis_encode_type (#…

9dd0203

…5041) (#5043) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

[bug-fix] Fix typo (#5035)

9e3cd91

* Fix typo * Add test

master -> main. (#5010) (#5044)

40c34a9

Update changelog. (#5045)

35e7a27

[bug-fix] Fix padding for List entries in buffer (#5046)

5e87e2c

* Fix padding for List entries in buffer * Revert to coonverting to np.array * Fix dtype in PPO trainer

[bug-fix] Fix memory leak when using LSTMs (#5048)

347852b

* Detach memory before storing * Add test * Evaluate with no_grad

[MLA-1809] catch mismatched observation sizes (#5030)

d37a2af

Enable the exporting of unitypackage files from a list of curated sce…

edcf2df

…nes. (#5052)

Update changelog. (#5055)

7862a29

Fix xml docs. (#5057)

5b8cbd2

Merge branch 'main' into release_14_branch-to-main

a0265be

Update readme table for 1.0.7 verified release (#5059)

768b7c3

Merge branch 'main' into release_14_branch-to-main

6add502

Merge pull request #5061 from Unity-Technologies/release_14_branch-to…

d0b3e99

…-main Release 14 branch to main

add release tag to git packman command (#5063)

60497c5

Adding model overrider to match3 prefabs (#5067)

4b91f48

pass sensor name through to ObservationSpec (#5036)

dd6575d

[bug-fix] Fix save/restore critic, add test (#5062)

78cb833

* Fix save/restore critic, add test * Rename module for PPO * Use correct policy in test

Remove unused allocation (#5068)

22a45cc

[MLA-1731] update pip instructions (#5074)

d18d5ff

Musubee and others added 23 commits March 10, 2021 16:28

Updated incomplete sentence in installation docs. (#5078)

eced65c

Automatically generate samples based on placement of mlagents-sample.…

1158df0

…json files in our examples. (#5077)

Upgrade PyTorch version for python 3.9 (#5028)

65d1983

Update barracuda to 1.3.2-preview. (#5084)

df9ecde

disable dotnet format temporarily (#5088)

1369b76

Update install docs to mention samples and link to the forum for feed…

65971da

…back (#5091) Co-authored-by: Chris Elion <chris.elion@unity3d.com>

POCA trainer (#5005)

d63a9d7

Co-authored-by: Ervin Teng <ervin@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

[docs] Documentation for POCA and cooperative behaviors (#5056)

847d723

Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>

[docs] Update changelog (#5095)

f67ed30

Add multiAgentGroup capabilities flag (#5096)

192b5e6

* Add multiAgentGroup capabilities flag * Add proto * Fix compiler error * Add warning for multiagent group * Add comment * Fix spelling mistake

Move PushBlockCollab config to poca directory (#5097)

8545a0d

Fix ghost curriculum and make steps private (#5098)

f200b90

* use get step to determine curriculum * add to CHANGELOG * Make step in trainer private (#5099) Co-authored-by: Ervin T <ervin@unity3d.com>

Update changelog for samples. (#5103)

12afe95

Update versions on main (#5102)

dae6195

Increment versions after release 15 branch split

Fix for validate release links (#5100)

2073807

Integrate Group Manager to soccer/retrain with POCA (#5115)

450e522

Pragma warnings into the void. (#5117)

02c5b9a

Make analytics module an optional dependency. (#5109)

1b0577e

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

Fix end episode for POCA, add warning for group reward if not POCA (#…

f7d6dc3

…5113) * Fix end episode for POCA, add warning for group reward if not POCA * Add missing imports

Redo dotnet format (#5119)

ff3d608

Merge remote-tracking branch 'origin/main' into v2s-update-main

c7b5812

Resolving conflicts between vs-staging and main (#5125)

5affb3c

vincentpierre approved these changes Mar 16, 2021

View reviewed changes

chriselion merged commit 9093939 into v2-staging Mar 16, 2021

delete-merged-branch bot deleted the v2s-update-main branch March 16, 2021 18:18

surfnerd pushed a commit that referenced this pull request Mar 18, 2021

Update v2-staging from main (March 15) (#5123)

9a4a4bf

surfnerd pushed a commit that referenced this pull request Mar 18, 2021

Update v2-staging from main (March 15) (#5123)

e88f2a5

github-actions bot locked as resolved and limited conversation to collaborators Mar 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update v2-staging from main (March 15) #5123

Update v2-staging from main (March 15) #5123

Uh oh!

chriselion commented Mar 16, 2021

Uh oh!

Uh oh!

Update v2-staging from main (March 15) #5123

Update v2-staging from main (March 15) #5123

Uh oh!

Conversation

chriselion commented Mar 16, 2021

Proposed change(s)

Uh oh!

Uh oh!