Clarify CleanRL is a non-modular library #200

vwxyzjn · 2022-06-10T23:05:46Z

Description

Closes #197.

Types of changes

Documentation

vercel · 2022-06-10T23:05:48Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	Jun 17, 2022 at 4:33PM (UTC)

cool-RR

Nice. Personally I'd put the phrase "reference implementation" into the title of the readme and the "about" of the repo (where it says "High-quality single file implementation".) But it's your decision.

cool-RR · 2022-06-11T07:36:53Z

docs/index.md

-Good luck have fun 🚀
+Good luck have fun :rocket:
+
+⚠️ **NOTE**: CleanRL is *not* a modular library and therefore it is not meant to be imported. At the cost of duplicate code, we make all implementation details of a DRL algorithm variant easy to understand, so CleanRL comes with its own pros and cons. You should consider using CleanRL if you want to 1) understand all implementation details of an algorithm's varaint or 2) do quick prototypes.


varaint -> variant

I'm not sure that "do quick prototypes" makes sense here. Running from clearrl import PPO would be quick. Reading the algorithm and copy-pasting it into my code is slow.

I think "doing prototypes" is not really a well-defined notion as there are many types of prototypes. Being able to do prototypes quickly largely depends on the use case.

While things like from stable_baselines3 import PPO is quick but if you want to prototype advanced features that SB3 does not support, it could be more difficult as discussed in #197 with the invalid action masking example.

Maybe I can clarify as "do prototypes that can't be achieved by just combining components in modular DRL libraries"? I am really unsure what the phrasing would be.

"if you want to prototype advanced features that SB3 does not support" Keep in mind that 95% of people just want something that works, not advanced features. But in any case, this PR is good and I think you should make any changes that result in it being merged.

cool-RR · 2022-06-17T15:08:41Z

Lost interest?

vwxyzjn · 2022-06-17T15:16:54Z

Hey sorry for the delay. This week is my end-of-internship and a move-out week.

Yes, let me think of a place to put the reference implementation. Please let me know what you think of my reply above regarding the prototypes comment.

README.md

Clarify CleanRL is a non-modular library

b0d00df

vwxyzjn requested a review from dosssman June 10, 2022 23:05

cool-RR approved these changes Jun 11, 2022

View reviewed changes

vwxyzjn commented Jun 17, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

emphasize reference implementation

b87adc0

vercel bot deployed to Preview June 17, 2022 15:22 View deployment

Quick fix

190b441

vercel bot deployed to Preview June 17, 2022 15:49 View deployment

typo fix

e7a4fab

vercel bot deployed to Preview June 17, 2022 16:33 View deployment

vwxyzjn merged commit 94a685d into master Jun 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify CleanRL is a non-modular library #200

Clarify CleanRL is a non-modular library #200

vwxyzjn commented Jun 10, 2022

vercel bot commented Jun 10, 2022 •

edited

Loading

cool-RR left a comment

cool-RR Jun 11, 2022

vwxyzjn Jun 17, 2022

cool-RR Jun 17, 2022

cool-RR commented Jun 17, 2022

vwxyzjn commented Jun 17, 2022

Clarify CleanRL is a non-modular library #200

Clarify CleanRL is a non-modular library #200

Conversation

vwxyzjn commented Jun 10, 2022

Description

Types of changes

vercel bot commented Jun 10, 2022 • edited Loading

cool-RR left a comment

Choose a reason for hiding this comment

cool-RR Jun 11, 2022

Choose a reason for hiding this comment

vwxyzjn Jun 17, 2022

Choose a reason for hiding this comment

cool-RR Jun 17, 2022

Choose a reason for hiding this comment

cool-RR commented Jun 17, 2022

vwxyzjn commented Jun 17, 2022

vercel bot commented Jun 10, 2022 •

edited

Loading