Activity

Small fix in epistemic neural networks

facebook-github-botpushed 1 commit to main • 6a26fe3…b0f58f4 •

25 days ago

fbcode//pearl

facebook-github-botpushed 1 commit to main • 0702f51…6a26fe3 •

on Jan 23

Announcing Pearl serialization

facebook-github-botpushed 1 commit to main • 01c16fd…0702f51 •

on Jan 23

Test serialization

facebook-github-botpushed 1 commit to main • adcebb2…01c16fd •

on Jan 21

dqn for frozen lake tutorial

facebook-github-botpushed 1 commit to main • 0bf2b6b…adcebb2 •

on Jan 21

Make all Pearl components torch.nn.Modules

facebook-github-botpushed 2 commits to main • 8c3a116…0bf2b6b •

on Jan 17

Implement `compare` for safety modules

facebook-github-botpushed 1 commit to main • 043fdfa…8c3a116 •

on Dec 31, 2024

Implement `compare` for contextual bandits

facebook-github-botpushed 1 commit to main • c29ed5f…043fdfa •

on Dec 31, 2024

Implement `compare` for action representation modules

facebook-github-botpushed 2 commits to main • 04d8969…c29ed5f •

on Dec 31, 2024

Implement `compare` for (neural) linear regression modules

facebook-github-botpushed 4 commits to main • 4b8e774…04d8969 •

on Dec 31, 2024

Refactor TestAgentWithPyTorch so its models can be reused

facebook-github-botpushed 1 commit to main • 6066854…4b8e774 •

on Dec 30, 2024

Unit test for dqn and double dqn

facebook-github-botpushed 1 commit to main • 5c99dd8…6066854 •

on Dec 27, 2024

Publishing the model with exploration scores for CB use-cases

facebook-github-botpushed 1 commit to main • 334a08c…5c99dd8 •

on Dec 23, 2024

add the missing .shape to q value networks

facebook-github-botpushed 2 commits to main • 00ed853…334a08c •

on Dec 19, 2024

Add missing Pyre mode headers] [batch:21/1502] [shard:31/N]

facebook-github-botpushed 1 commit to main • c28dd27…00ed853 •

on Dec 19, 2024

Documentation generation via gh actions (#107)

facebook-github-botpushed 1 commit to main • 5b9138f…c28dd27 •

on Dec 17, 2024

Allow twin critic to take network instances and test SAC in atari games.

facebook-github-botpushed 1 commit to main • a92fdd9…5b9138f •

on Dec 13, 2024

Add CNNActorNetwork

facebook-github-botpushed 1 commit to main • 1b0702e…a92fdd9 •

on Dec 13, 2024

Rename VanillaCNN to CNNValueNetwork and divide state values by 255.

facebook-github-botpushed 1 commit to main • 5d9316b…1b0702e •

on Dec 13, 2024

add autotune to discrete sac

facebook-github-botpushed 1 commit to main • cafb382…5d9316b •

on Dec 13, 2024

multi-head CNN

facebook-github-botpushed 1 commit to main • 1b86382…cafb382 •

on Dec 13, 2024

modify epsilon greedy exploration to add epsilon scheduling

facebook-github-botpushed 1 commit to main • c9576ed…1b86382 •

on Dec 13, 2024

Fix training_epochs meaning in Pearl offline learning

facebook-github-botpushed 1 commit to main • 1f50db2…c9576ed •

on Dec 12, 2024

Multiple minor warnings

facebook-github-botpushed 1 commit to main • 7b25efe…1f50db2 •

on Dec 12, 2024

Fix the error when max_number_of_actions is None and the action_space…

facebook-github-botpushed 1 commit to main • f435791…7b25efe •

on Dec 11, 2024

setup atari experiments

facebook-github-botpushed 4 commits to main • 6d107ec…f435791 •

on Dec 10, 2024

allow passing optimizers to policy learners

facebook-github-botpushed 1 commit to main • 8941dbe…6d107ec •

on Dec 7, 2024

change observations only from float64 to float32 before adding to the…

facebook-github-botpushed 1 commit to main • 3e52c4a…8941dbe •

on Dec 7, 2024

replace _critic_update_count with _training_steps in td3

facebook-github-botpushed 1 commit to main • 3ac6732…3e52c4a •

on Dec 7, 2024

Make csac's autotune more efficient

facebook-github-botpushed 1 commit to main • 7f022a9…3ac6732 •

on Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small fix in epistemic neural networks

fbcode//pearl

Announcing Pearl serialization

Test serialization

dqn for frozen lake tutorial

Make all Pearl components torch.nn.Modules

Implement `compare` for safety modules

Implement `compare` for contextual bandits

Implement `compare` for action representation modules

Implement `compare` for (neural) linear regression modules

Refactor TestAgentWithPyTorch so its models can be reused

Unit test for dqn and double dqn

Publishing the model with exploration scores for CB use-cases

add the missing .shape to q value networks

Add missing Pyre mode headers] [batch:21/1502] [shard:31/N]

Documentation generation via gh actions (#107)

Allow twin critic to take network instances and test SAC in atari games.

Add CNNActorNetwork

Rename VanillaCNN to CNNValueNetwork and divide state values by 255.

add autotune to discrete sac

multi-head CNN

modify epsilon greedy exploration to add epsilon scheduling

Fix training_epochs meaning in Pearl offline learning

Multiple minor warnings

Fix the error when max_number_of_actions is None and the action_space…

setup atari experiments

allow passing optimizers to policy learners

change observations only from float64 to float32 before adding to the…

replace _critic_update_count with _training_steps in td3

Make csac's autotune more efficient

Small fix in epistemic neural networks

fbcode//pearl

Announcing Pearl serialization

Test serialization

dqn for frozen lake tutorial

Make all Pearl components torch.nn.Modules

Implement compare for safety modules

Implement compare for contextual bandits

Implement compare for action representation modules

Implement compare for (neural) linear regression modules

Refactor TestAgentWithPyTorch so its models can be reused

Unit test for dqn and double dqn

Publishing the model with exploration scores for CB use-cases

add the missing .shape to q value networks

Add missing Pyre mode headers] [batch:21/1502] [shard:31/N]

Documentation generation via gh actions (#107)

Allow twin critic to take network instances and test SAC in atari games.

Add CNNActorNetwork

Rename VanillaCNN to CNNValueNetwork and divide state values by 255.

add autotune to discrete sac

multi-head CNN

modify epsilon greedy exploration to add epsilon scheduling

Fix training_epochs meaning in Pearl offline learning

Multiple minor warnings

Fix the error when max_number_of_actions is None and the action_space…

setup atari experiments

allow passing optimizers to policy learners

change observations only from float64 to float32 before adding to the…

replace _critic_update_count with _training_steps in td3

Make csac's autotune more efficient

Implement `compare` for safety modules

Implement `compare` for contextual bandits

Implement `compare` for action representation modules

Implement `compare` for (neural) linear regression modules