[Feature] Offline datasets: D4RL #928

vmoens · 2023-02-20T21:14:08Z

Description

Integrates offline RL dataset in torchrl

>>> from torchrl.data.datasets import D4RLExperienceReplay
>>> data = D4RLExperienceReplay('kitchen-complete-v0', split_trajs=True)
>>> print(data._storage._storage)
TensorDict(
    fields={
        _batch_size: MemmapTensor(shape=torch.Size([20, 1]), device=cpu, dtype=torch.int64, is_shared=False),
        action: MemmapTensor(shape=torch.Size([20, 207, 9]), device=cpu, dtype=torch.float32, is_shared=False),
        done: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.bool, is_shared=False),
        index: MemmapTensor(shape=torch.Size([20]), device=cpu, dtype=torch.int32, is_shared=False),
        infos: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.int64, is_shared=False),
        mask: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.bool, is_shared=False),
        next: TensorDict(
            fields={
                done: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.bool, is_shared=False),
                observation: MemmapTensor(shape=torch.Size([20, 207, 60]), device=cpu, dtype=torch.float32, is_shared=False),
                reward: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.float32, is_shared=False)},
            batch_size=torch.Size([20, 207]),
            device=cpu,
            is_shared=False),
        observation: MemmapTensor(shape=torch.Size([20, 207, 60]), device=cpu, dtype=torch.float32, is_shared=False),
        timeouts: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.bool, is_shared=False),
        traj_ids: MemmapTensor(shape=torch.Size([20, 207]), device=cpu, dtype=torch.int64, is_shared=False)},
    batch_size=torch.Size([20]),
    device=cpu,
    is_shared=False)
>>> print(data.sample(10))  # will sample 10 trajectories since split_trajs is set to True
TensorDict(
    fields={
        action: Tensor(shape=torch.Size([10, 207, 9]), device=cpu, dtype=torch.float32, is_shared=False),
        done: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.bool, is_shared=False),
        index: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.int32, is_shared=False),
        infos: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.int64, is_shared=False),
        mask: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.bool, is_shared=False),
        next: TensorDict(
            fields={
                done: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.bool, is_shared=False),
                observation: Tensor(shape=torch.Size([10, 207, 60]), device=cpu, dtype=torch.float32, is_shared=False),
                reward: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.float32, is_shared=False)},
            batch_size=torch.Size([10, 207]),
            device=cpu,
            is_shared=False),
        observation: Tensor(shape=torch.Size([10, 207, 60]), device=cpu, dtype=torch.float32, is_shared=False),
        timeouts: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.bool, is_shared=False),
        traj_ids: Tensor(shape=torch.Size([10, 207]), device=cpu, dtype=torch.int64, is_shared=False)},
    batch_size=torch.Size([10, 207]),
    device=cpu,
    is_shared=False)

These datasets can be used with transforms:

>>> from torchrl.data.datasets.d4rl import D4RLExperienceReplay
>>> from torchrl.envs import ObservationNorm
>>> data = D4RLExperienceReplay("maze2d-umaze-v1")
>>> # we can append transforms to the dataset
>>> data.append_transform(ObservationNorm(loc=-1, scale=1.0))
>>> data.sample(128)
TensorDict(
    fields={
        action: Tensor(shape=torch.Size([128, 2]), device=cpu, dtype=torch.float32, is_shared=False),
        done: Tensor(shape=torch.Size([128]), device=cpu, dtype=torch.bool, is_shared=False),
        index: Tensor(shape=torch.Size([128]), device=cpu, dtype=torch.int32, is_shared=False),
        next: TensorDict(
            fields={
                done: Tensor(shape=torch.Size([128]), device=cpu, dtype=torch.bool, is_shared=False),
                observation: Tensor(shape=torch.Size([128, 4]), device=cpu, dtype=torch.float32, is_shared=False)},
                reward: Tensor(shape=torch.Size([128]), device=cpu, dtype=torch.float32, is_shared=False)},
            batch_size=torch.Size([128]),
            device=cpu,
            is_shared=False),
        observation: Tensor(shape=torch.Size([128, 4]), device=cpu, dtype=torch.float32, is_shared=False),
    batch_size=torch.Size([128]),
    device=cpu,
    is_shared=False)

# Conflicts: # torchrl/envs/common.py # torchrl/envs/libs/vmas.py # torchrl/envs/vec_env.py

# Conflicts: # torchrl/envs/vec_env.py

d4rl

a2a3f3e

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2023

amend

986d30f

vmoens added the enhancement New feature or request label Feb 20, 2023

amend

a37d545

BY571 mentioned this pull request Feb 22, 2023

[Feature] Implicit Q-Learning (IQL) #933

Merged

9 tasks

vmoens added 8 commits March 1, 2023 07:59

amend

5bec83b

Merge branch 'main' into offline_datasets

5fb8819

# Conflicts: # torchrl/envs/common.py # torchrl/envs/libs/vmas.py # torchrl/envs/vec_env.py

amend

a72d925

amend

d2acef9

amend

f9c968c

amend

82305a5

amend

b123624

lint

d879f0a

vmoens changed the title ~~[Feature] Offline datasets~~ [Feature] Offline datasets: D4RL Mar 10, 2023

vmoens added 15 commits March 10, 2023 12:24

tests

b8ca289

tests

9daf739

tests

950d70f

tests

2b6da29

tests

651ed4a

tests

655a1c7

tests

ae09acc

tests

64ff1d0

tests

c1d600b

tests

d16dd5f

tests

aa2b76b

tests

56d7f3d

tests

957e47f

tests

ab22ae3

tests

d30a80e

vmoens added 28 commits March 15, 2023 20:58

amend

7801efb

unzip

d0ea455

amend

98a68e8

amend

cdc67a2

amend

4e7c7d4

amend

b9f4770

amend

9af4550

amend

c3140ea

amend

71e84d2

amend

8e60a33

amend

ef6164b

amend

2d03f54

amend

856251a

fix

ccdab70

amend

2af1f6c

amend

cb9dfdf

amend

363288c

amend

98695f8

amend

d78bbfa

amend

9e3f680

Merge remote-tracking branch 'origin/main' into offline_datasets

fe484b6

readme

96d17f3

Merge remote-tracking branch 'origin/main' into offline_datasets

83124c0

# Conflicts: # torchrl/envs/vec_env.py

less print

0fb957e

amend

055beca

ant-medium-v2

bcdf4fe

ant-medium-v2

d4ff292

deactivating tests

2e760b3

vmoens merged commit ce81995 into main Mar 16, 2023

vmoens deleted the offline_datasets branch March 16, 2023 20:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Offline datasets: D4RL #928

[Feature] Offline datasets: D4RL #928

vmoens commented Feb 20, 2023 •

edited

Loading

[Feature] Offline datasets: D4RL #928

[Feature] Offline datasets: D4RL #928

Conversation

vmoens commented Feb 20, 2023 • edited Loading

Description

vmoens commented Feb 20, 2023 •

edited

Loading