Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

pytorch / rl Public

Notifications You must be signed in to change notification settings
Fork 309
Star 2.3k

Code
Issues 159
Pull requests 71
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Implement EnsembleModule #1359

Closed

smorad wants to merge 10 commits into pytorch:main from smorad:ensemble

Closed

Implement EnsembleModule #1359

smorad wants to merge 10 commits into pytorch:main from smorad:ensemble

Conversation 12 Commits 10 Checks 0 Files changed

Conversation

Copy link

Contributor

smorad commented Jul 5, 2023 •

edited

Loading

Description

This adds support for module ensembles once pytorch/tensordict#478 lands. cc @vmoens, @matteobettini, @Acciorocketships.

Motivation and Context

This is necessary for implementing REDQ and various other Q learning algorithms that use ensembles at collection time. See #1344 for more.

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

Sorry, something went wrong.

All reactions


          Implement EnsembleModule

6e00e66

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label

This was referenced Jul 5, 2023

[Feature] Implement ensemble reduce #1360

Open

Functional reset_parameters pytorch/tensordict#478

Merged

smorad added 2 commits

July 10, 2023 14:05


          Update to match new reset_parameters_recursive

246eb60


          test both first and second layer outputs

c90c8f0

Copy link

Contributor Author

smorad commented Jul 10, 2023

I've rebased this upon the latest reset_parameters_recursive commit. The parameter tensordict approach to reset_parameters_recursive really simplifies this module, thanks for the tips!

All reactions

Sorry, something went wrong.

smorad added 4 commits

July 10, 2023 14:29


          Clean up reset parameters recursive and add test ensuring reset calle…

6ec9ad9

…d once


          remove commented line

6b3ce6f


          comment

a039aad


          match tensordictbase reset_params_recursive fn signature

6d03a13

vmoens approved these changes

View reviewed changes

Copy link

Contributor

vmoens left a comment

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Superb I love this feature!
It is meant to be part of RL, but there could be a usage for this in tensordict, wdyt?

not to self: should be integrated more broadly in the losses to alleviate the burden of expanding params for multiple q value nets

Sorry, something went wrong.

All reactions

torchrl/modules/tensordict_module/ensemble.py

		@@ -0,0 +1,102 @@
		import torch
		from tensordict import TensorDict, TensorDictBase

Copy link

Contributor

vmoens Jul 10, 2023

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing headers

Sorry, something went wrong.

All reactions

torchrl/modules/tensordict_module/ensemble.py

		from torch import nn


		class EnsembleModule(TensorDictModuleBase):

Copy link

Contributor

vmoens Jul 10, 2023

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should be added manually to the doc under docs/source/reference/modules.rst

Sorry, something went wrong.

All reactions

torchrl/modules/tensordict_module/ensemble.py Outdated Show resolved Hide resolved

torchrl/modules/tensordict_module/ensemble.py Show resolved Hide resolved

torchrl/modules/tensordict_module/ensemble.py Outdated

Comment on lines 92 to 94

+                      assert (
+                          TensorDictBase is not None
+                      ), "Ensembles are functional and require passing a TensorDict of parameters to reset_parameters_recursive"

Copy link

Contributor

vmoens Jul 10, 2023

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we want to check that parameters are not None?
The function will break before this since we don't have a default value.

Other comment: We don't use assert in the lib, only in the tests. In this case, a TypeError or a ValueError would be appropriate

Sorry, something went wrong.

All reactions

Copy link

Contributor Author

smorad Jul 10, 2023

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oops yes, should be parameters and this is indeed unreachable code. I could make the default parameters=None here, then raise an exception if parameters == None but that seems confusing.

All I wanted was a descriptive error message because it will not be clear to the user why my_ensemble.reset_parameters_recursive() is failing (they need to explicitly pass in params).

Sorry, something went wrong.

All reactions

Copy link

Contributor

vmoens Jul 10, 2023

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree, let's make the default None and raise the ValueError if it is the default

Sorry, something went wrong.

All reactions

torchrl/modules/tensordict_module/ensemble.py Outdated

+                      """Resets the parameters of all the copies of the module.
+                      Args:
+                          stacked_params_td: A TensorDict of parameters for self.module. The batch dimension(s) of the tensordict

Copy link

Contributor

vmoens Jul 10, 2023

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change

      
                        stacked_params_td: A TensorDict of parameters for self.module. The batch dimension(s) of the tensordict
          
                        parameters: A TensorDict of parameters for self.module. The batch dimension(s) of the tensordict

Sorry, something went wrong.

All reactions

Copy link

Contributor Author

smorad commented Jul 10, 2023 •

edited

Loading

Superb I love this feature! It is meant to be part of RL, but there could be a usage for this in tensordict, wdyt?

Are you suggesting to move EnsembleModule into tensordict.nn? Yeah, I suppose that makes sense. Let's get it to the point where you are happy with it here, then I will abort this PR and move it to tensordict.nn.

All reactions

Sorry, something went wrong.

smorad and others added 3 commits

July 10, 2023 17:42


          add warning for users doing dubious things

205b2c9


          Update torchrl/modules/tensordict_module/ensemble.py

c19690a

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>


          Update torchrl/modules/tensordict_module/ensemble.py

44899a4

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

smorad mentioned this pull request

[Feature] Add EnsembleModule pytorch/tensordict#485

Merged

10 tasks

Copy link

Contributor Author

smorad commented Jul 11, 2023

Closing in favor of pytorch/tensordict#485

All reactions

Sorry, something went wrong.

smorad closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

vmoens vmoens approved these changes

Assignees

No one assigned

Labels

This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

3 participants

Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.