[WIP] Make adding new Policy Models flexible #327

engmubarak48 · 2024-06-18T19:07:35Z

Fixes #293

This PR tries to make adding new function approximations (policy models) as flexible as possible.

Modification of the base.py file to dynamically import and instantiate policy models based on configuration
Allowing the addition of new model types (e.g., CNN, GNN, Transformer) without altering the core logic.
Implementation of a clean interface in the ModelBase class

josephdviviano

lgtm - a comment, but non blocking.

gflownet/policy/mlp.py

alexhernandezgarcia · 2024-06-25T09:40:07Z

Great that you're taking a stab at this, which is one of the important things that remain to be done! Looks good so far!

…cnn policy

engmubarak48 · 2024-06-28T04:24:11Z

gflownet/envs/tetris.py

+        if self.flatten:
+            return self.states2proxy(states).flatten(start_dim=1).to(self.float)
+        return self.states2proxy(states).to(self.float)


@alexhernandezgarcia This is a temporary solution to make the CNN policy work on Tetris env. But normally the flattening should happen inside the model but not in the environment (see my other comments)

if you are okay with that, then I can update.

config/env/tetris.yaml

engmubarak48 · 2024-06-28T04:25:14Z

gflownet/envs/tetris.py

@@ -75,6 +75,7 @@ def __init__(
        height: int = 20,
        pieces: List = ["I", "J", "L", "O", "S", "T", "Z"],
        rotations: List = [0, 90, 180, 270],
+        flatten: bool = True,


If we move the flattening from the environment to the policy, then we don't need this.

gflownet/policy/mlp.py

engmubarak48 · 2024-06-28T04:31:31Z

I think any policy could be added now.. i have tested tetris on CNN and everything seems to work well with minimal changes.. I think next thing we could do is have another PR which better documents this and show how one could simply change the Policy etc.

test with python main.py env=tetris proxy=tetris policy=cnn env.flatten=False env.width=4 env.height=4
@alexhernandezgarcia feel free to review..

gflownet/policy/cnn.py

gflownet/policy/base.py

AlexandraVolokhova · 2024-07-03T17:53:12Z

Thank you for the great work! I've added a bunch of suggestions. It seems to me that it would be worth to add a config file for a case when one wants to create a uniform or a random policy, but I don't have a strong opinion about it.

gflownet/policy/base.py

engmubarak48 · 2024-07-09T21:56:00Z

Hi @AlexandraVolokhova just got time, let me address your comments...

Regarding:

It seems to me that it would be worth to add a config file for a case when one wants to create a uniform or a random policy, but I don't have a strong opinion about it.

I think I agree with you, maybe it would have been nice to have a separate config for fixed and uniform policies (I am hoping you are referring to the two functions inside the base policy class). To be honest I don't even still know why we need them.. We could also keep them as it is with the default configuration.

…) in the parse_config

…icy-definition [WIP, Policy] Docstring and refactoring on top of PR 327

…ns workflow

initial commit: split the base policy and the architectures

98e4d7e

engmubarak48 linked an issue Jun 18, 2024 that may be closed by this pull request

Flexible Policy Definition #293

Open

engmubarak48 requested review from michalkoziarski, josephdviviano and alexhernandezgarcia June 18, 2024 19:07

josephdviviano approved these changes Jun 18, 2024

View reviewed changes

gflownet/policy/mlp.py Outdated Show resolved Hide resolved

engmubarak48 added 5 commits June 20, 2024 20:46

ignore git logs

1388756

refactor base policy class and move models into differrent file

7c8a517

handle when config none gracefully

4ecd865

black formatting

d50aeea

further formatting with isort

1508e4b

engmubarak48 added 5 commits June 27, 2024 23:43

formatting black + isort

9cdf18e

added flatten flag and device movement handling

8af7f82

bug fix: use .cpu() before .numpy()

9474150

added cnn policy, and flatten flag should be set to false when using …

83a4904

…cnn policy

black formatting

e254cf2

engmubarak48 commented Jun 28, 2024

View reviewed changes

engmubarak48 marked this pull request as ready for review June 28, 2024 04:31

AlexandraVolokhova reviewed Jul 3, 2024

View reviewed changes

gflownet/policy/cnn.py Outdated Show resolved Hide resolved

AlexandraVolokhova reviewed Jul 3, 2024

View reviewed changes

gflownet/policy/cnn.py Outdated Show resolved Hide resolved

AlexandraVolokhova reviewed Jul 3, 2024

View reviewed changes

gflownet/policy/base.py Outdated Show resolved Hide resolved

AlexandraVolokhova reviewed Jul 3, 2024

View reviewed changes

gflownet/policy/base.py Outdated Show resolved Hide resolved

engmubarak48 added 3 commits July 9, 2024 18:35

smaller cnn config like kernel size etc

d30c3f2

minor refactor on parse_config

2ac9af5

move self.is_model to instantiate and add super().parse_config(config…

7e02200

…) in the parse_config

engmubarak48 removed the request for review from michalkoziarski July 9, 2024 23:42

alexhernandezgarcia added 5 commits July 10, 2024 18:01

Add docstring and typing to __init__ of policy base.

e6a14ef

Use kwargs instead of listing parameters explicitly

57b0b13

Policy MLP: docstring and typing.

178a08e

Get rid of parse_config and include its content in __init__

ace8a28

Combine instantiate and make_* into a single method make_model()

8e6f03d

alexhernandezgarcia mentioned this pull request Jul 10, 2024

[WIP, Policy] Docstring and refactoring on top of PR 327 #335

Merged

alexhernandezgarcia added 3 commits July 10, 2024 20:02

Missing import

774c411

Fix config issue by implementing _get_config()

9520315

Docstring for base argument

910a948

engmubarak48 self-assigned this Aug 21, 2024

josephdviviano and others added 9 commits September 18, 2024 18:11

Merge pull request #335 from alexhernandezgarcia/ahg/293-flexible-pol…

c9ec03f

…icy-definition [WIP, Policy] Docstring and refactoring on top of PR 327

remove env from the cnn policy

9fa9381

init the cnn env's height and width in the policy

2bf438a

add mlp to device

9395e61

formatting

595db80

debug: add to print environment information when running GitHub Actio…

6797bcc

…ns workflow

Add the specific versions of pymatgen and spglib

95258a0

Merge branch 'main' into 293-flexible-policy-definition

61f1e1d

revert version downgrade of pymatgen

b7f33d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Make adding new Policy Models flexible #327

[WIP] Make adding new Policy Models flexible #327

engmubarak48 commented Jun 18, 2024

josephdviviano left a comment

alexhernandezgarcia commented Jun 25, 2024

engmubarak48 Jun 28, 2024

engmubarak48 Jun 28, 2024

engmubarak48 commented Jun 28, 2024 •

edited

Loading

AlexandraVolokhova commented Jul 3, 2024

engmubarak48 commented Jul 9, 2024

[WIP] Make adding new Policy Models flexible #327

Are you sure you want to change the base?

[WIP] Make adding new Policy Models flexible #327

Conversation

engmubarak48 commented Jun 18, 2024

josephdviviano left a comment

Choose a reason for hiding this comment

alexhernandezgarcia commented Jun 25, 2024

engmubarak48 Jun 28, 2024

Choose a reason for hiding this comment

engmubarak48 Jun 28, 2024

Choose a reason for hiding this comment

engmubarak48 commented Jun 28, 2024 • edited Loading

AlexandraVolokhova commented Jul 3, 2024

engmubarak48 commented Jul 9, 2024

engmubarak48 commented Jun 28, 2024 •

edited

Loading