restore hidden states added #306

hnekoeiq · 2022-11-10T20:43:33Z

No description provided.

hnekoeiq · 2022-12-01T15:02:54Z

This PR adds two features to the DRQN agent:
1- Restoring hidden states from the replay buffer
2- Burn-in frames to warm up the rnn module.

kshitijkg · 2022-12-05T20:42:03Z

hive/agents/drqn.py

+        if self._store_hidden == True:
+            if self._rnn_type == "lstm":
+                preprocessed_update_info.update(
+                    {
+                        "hidden_state": self._prev_hidden_state,
+                        "cell_state": self._prev_cell_state,
+                    }
+                )
+
+            elif self._rnn_type == "gru":
+                preprocessed_update_info.update(
+                    {
+                        "hidden_state": self._prev_hidden_state,
+                    }
+                )


This works too, but it might be easier to adapt this code to other memory based architecture if we instead store "memory" that can contain any kind of memory (hidden, hidden+cell, trxl memory), etc and have a function that can pack and unpack this memory? So the user has to only modify teh pack unpack function when adding a new memory based architecture?

@kshitijkg With the most recent commit, the update and act functions are almost independent of the type of memory (I'm still calling it hidden_state because it is a DRQN agent, but we can replace everything with memory). Could you please review it again?

mrsamsami · 2023-02-21T20:45:40Z

hive/agents/drqn.py

+                mask[self._burn_frames :] = 1.0
+                mask = mask.view(1, -1)
+                interm_loss *= mask
+                loss = interm_loss.mean()


Isn't the correct way of doing it interm_loss.sum() / mask.sum()?

That's true. This should be fixed after merging #339.

mrsamsami · 2023-02-21T20:49:12Z

hive/agents/drqn.py

+        if self._store_hidden == True:
+            hidden_state = (
+                torch.tensor(
+                    batch["hidden_state"][:, 0].squeeze(1).squeeze(1).unsqueeze(0),


I'd suggest using view() or reshape() to potentially make the code cleaner.

* first version. testing. * pylint * pylint * pylint * pylint * pylint * test fix * pylint * pylint * resolve discussion --------- Co-authored-by: artem.zholus <artem.zholus@login-3.server.mila.quebec> Co-authored-by: Darshan Patil <dapatil211@gmail.com>

* Added option to initialize separate components differently * Made minor fixes * Fixed init and registraion * Fix term trunc (#336) (#341) * Fixed issues with moving from done to terminated, truncated * Undo change to logging' scales * Revert changes to this file, SAC agents don't exist yet * Clean up test file Co-authored-by: Darshan Patil <dapatil211@gmail.com> --------- Co-authored-by: Darshan Patil <dapatil211@gmail.com>

…_features

restore hidden states added

1ddb49f

hnekoeiq requested a review from dapatil211 November 10, 2022 20:51

burn in frames feature added

419bca4

hnekoeiq requested a review from kshitijkg November 29, 2022 14:13

kshitijkg reviewed Dec 5, 2022

View reviewed changes

hnekoeiq added 3 commits January 5, 2023 19:57

a more general recurrent agent

b68f74c

Merge branch 'dev' into rnn_features

8495d91

stateless drqn agent

4e83b39

hnekoeiq requested a review from mrsamsami February 21, 2023 14:12

mrsamsami reviewed Feb 21, 2023

View reviewed changes

hnekoeiq and others added 15 commits May 17, 2023 18:07

Merge branch 'dev' into rnn_features

0571d8e

Remove AtariEnv (#334)

d8bed9c

* first version. testing. * pylint * pylint * pylint * pylint * pylint * test fix * pylint * pylint * resolve discussion --------- Co-authored-by: artem.zholus <artem.zholus@login-3.server.mila.quebec> Co-authored-by: Darshan Patil <dapatil211@gmail.com>

Update pull_request_ci.yml (#342)

4570d6d

new sequence model version

43315d6

restore hidden states added

39ab3a7

burn in frames feature added

69f90a4

a more general recurrent agent

2ea4ebd

stateless drqn agent

0741139

new sequence model version

9f5a3b6

Update Sequence model to remove coupling with LSTM/GRU

a3d7a54

Merge branch 'rnn_features' of github.com:chandar-lab/RLHive into rnn…

b094aaa

…_features

Fix issue with agent_traj_state vs hidden_state

3e0365f

Update config to match new atari env

a902a94

Fixed issue with storing hidden states in replay buffer

51697bb

dapatil211 merged commit 09a21d2 into dev Jun 1, 2023

dapatil211 deleted the rnn_features branch June 1, 2023 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

restore hidden states added #306

restore hidden states added #306

hnekoeiq commented Nov 10, 2022

hnekoeiq commented Dec 1, 2022

kshitijkg Dec 5, 2022

hnekoeiq Jan 6, 2023 •

edited

Loading

mrsamsami Feb 21, 2023

hnekoeiq Apr 19, 2023

mrsamsami Feb 21, 2023

restore hidden states added #306

restore hidden states added #306

Conversation

hnekoeiq commented Nov 10, 2022

hnekoeiq commented Dec 1, 2022

kshitijkg Dec 5, 2022

Choose a reason for hiding this comment

hnekoeiq Jan 6, 2023 • edited Loading

Choose a reason for hiding this comment

mrsamsami Feb 21, 2023

Choose a reason for hiding this comment

hnekoeiq Apr 19, 2023

Choose a reason for hiding this comment

mrsamsami Feb 21, 2023

Choose a reason for hiding this comment

hnekoeiq Jan 6, 2023 •

edited

Loading