[Proposal] Proper initialization of the custom CNNmodel in the RLlib tutorial #1253
Open
1 task done
Labels
enhancement
New feature or request
Proposal
In PettingZoo/tutorials/Ray/rllib_pistonball.py it appears that the custom CNNModel never gets initialized. This issue proposes to either change the config to explicity use the custom CNN model, or remove it entirely if it is indeed the case that it is not being used.
Maybe even better would be to mirror the PettingZoo tutorials from the RLlib repository. Although I must say their own examples also felt quite buggy when experimenting with them and changing parameters, for example switching to another algorithm than PPO caused a lot of issues.
Motivation
In the example outlined below, you can see that a custom CNN model is specified for the action and value networks:
However, after registering the model using ModelCatalog.register_custom_model, it is never used in the initialization of the config object.
Pitch
According to the documentation of the algorithm_config class, custom policy models should be defined by calling the .training(model=[dict]) function within the config. Therefore I recommend the following change in the config creator:
Alternatives
Alternatively we can rewrite the code to just remove the custom CNN model:
Additional context
No response
Checklist
The text was updated successfully, but these errors were encountered: