Make the Agent reset immediately after Done #3291

vincentpierre · 2020-01-25T00:39:41Z

No description provided.

chriselion · 2020-01-25T00:51:09Z

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs

+            _AgentReset();
+            m_RequestAction = false;
+            m_RequestDecision = false;
+            m_Reward = 0f;
+            m_CumulativeReward = 0f;


Feels like this could be moved into NotifyAgentDone() (or maybe combine Done and NotifyAgentDone, unless you don't want to the user to set maxStepReached)

Moved some things around

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs

chriselion · 2020-01-27T20:13:32Z

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs

@@ -44,7 +44,7 @@ public struct AgentInfo
        /// Unique identifier each agent receives at initialization. It is used
        /// to separate between different agents in the environment.
        /// </summary>
-        public int id;
+        public int episodeId;


Update this comment too.

ervteng · 2020-01-27T21:41:33Z

ml-agents/mlagents/trainers/agent_processor.py

@@ -144,7 +144,7 @@ def add_experiences(
                    )
                    for traj_queue in self.trajectory_queues:
                        traj_queue.put(trajectory)
-                    self.experience_buffers[global_id] = []
+                    del self.experience_buffers[global_id]


Need to del last_step_result and last_take_action_outputs as well. Probably can do it right after the experience buffer del

hmmm, unfortunately, it seems last_step_result as well as policy.previous_actions are modified after the check for done. (So even if I delete them, they will be re-added). I need to do more experiments...

vincentpierre · 2020-01-27T23:11:40Z

ml-agents/mlagents/trainers/agent_processor.py

+        for terminated_id in terminated_agents:
+            self._clean_agent_data(terminated_id)
+
+    def _clean_agent_data(self, global_id: str) -> None:


@ervteng Tell me what you think

ervteng

Looks good - we'll keep an eye out for mem leaks

vincentpierre added 5 commits January 24, 2020 11:28

Made the Agent reset immediately

e1385f7

fixing the C# tests

043819a

Fixing the tests still

90cfbb2

Trying with incremental episode ids

43a1185

deleting buffer rather than using an empty list

adcffb0

vincentpierre requested review from surfnerd, ervteng and chriselion January 25, 2020 00:39

vincentpierre self-assigned this Jan 25, 2020

chriselion reviewed Jan 25, 2020

View reviewed changes

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs Show resolved Hide resolved

chriselion reviewed Jan 25, 2020

View reviewed changes

UnitySDK/Assets/ML-Agents/Scripts/Agent.cs Show resolved Hide resolved

vincentpierre added 2 commits January 24, 2020 17:06

Addressing the comments

a930143

Merge branch 'master' into develop-trimdone

de0303c

chriselion reviewed Jan 27, 2020

View reviewed changes

chriselion approved these changes Jan 27, 2020

View reviewed changes

vincentpierre added 2 commits January 27, 2020 13:25

Forgot to edit the comment on AgentInfo

98dc22b

Updating the migrating doc

25b7483

surfnerd approved these changes Jan 27, 2020

View reviewed changes

ervteng reviewed Jan 27, 2020

View reviewed changes

vincentpierre added 2 commits January 27, 2020 14:11

Fixed an obvious bug

b88f7ae

cleaning after an agent is done in agent processor

b094180

vincentpierre commented Jan 27, 2020

View reviewed changes

Fixing the pytest errors

aecb9e2

ervteng approved these changes Jan 28, 2020

View reviewed changes

vincentpierre merged commit d06ac2c into master Jan 28, 2020

delete-merged-branch bot deleted the develop-trimdone branch January 28, 2020 00:16

github-actions bot locked as resolved and limited conversation to collaborators May 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make the Agent reset immediately after Done #3291

Make the Agent reset immediately after Done #3291

Uh oh!

vincentpierre commented Jan 25, 2020

Uh oh!

chriselion Jan 25, 2020

Uh oh!

vincentpierre Jan 25, 2020

Uh oh!

Uh oh!

Uh oh!

chriselion Jan 27, 2020

Uh oh!

ervteng Jan 27, 2020

Uh oh!

vincentpierre Jan 27, 2020

Uh oh!

vincentpierre Jan 27, 2020

Uh oh!

ervteng left a comment

Uh oh!

Uh oh!

Make the Agent reset immediately after Done #3291

Make the Agent reset immediately after Done #3291

Uh oh!

Conversation

vincentpierre commented Jan 25, 2020

Uh oh!

chriselion Jan 25, 2020

Choose a reason for hiding this comment

Uh oh!

vincentpierre Jan 25, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chriselion Jan 27, 2020

Choose a reason for hiding this comment

Uh oh!

ervteng Jan 27, 2020

Choose a reason for hiding this comment

Uh oh!

vincentpierre Jan 27, 2020

Choose a reason for hiding this comment

Uh oh!

vincentpierre Jan 27, 2020

Choose a reason for hiding this comment

Uh oh!

ervteng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!