JPEG encoding and decoding if the observation is an image #275

gabrielemaraglino · 2025-02-14T14:03:07Z

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue), Depends on # (pull request)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Screenshots

Please attach before and after screenshots of the change if applicable.
To upload images to a PR -- simply drag and drop or copy paste.

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have run pytest -v and no errors are present.
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I solved any possible warnings that pytest -v has generated that are related to my code to the best of my knowledge.
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

younik

Thanks for adding this feature!

Can you add the line .idea in the .gitignore file?
Also, can you add some tests? You can make a fake env that generates images as obs, and check that everything is okay (serialization works and deserialization returns the same obs). Check the dataset_creation tests for the examples

younik · 2025-02-19T10:51:00Z

.gitignore

@@ -157,4 +157,4 @@ cython_debug/
 #  be found at https://github.com/github/gitignore/blob/main/Global/JetBrains.gitignore
 #  and can be added to the global gitignore or merged into this file.  For a more nuclear
 #  option (not recommended) you can uncomment the following to ignore the entire idea folder.
-#.idea/
+.idea


Uhm, maybe you should keep the '/'
The point of this is that the idea files in the code diff should disappear

younik · 2025-02-19T10:52:56Z

tests/test_jpeg_serialization.py

+class TestEnv:
+    def __init__(self):
+        self.observation_space = spaces.Box(low=0, high=255, shape=(4, 84, 84), dtype=np.uint8)
+        self.action_space = spaces.Discrete(2)
+
+    def reset(self):
+        obs = np.random.randint(0, 256, (4, 84, 84), dtype=np.uint8)
+        return obs, {}
+
+    def step(self, action):
+        obs = np.random.randint(0, 256, (4, 84, 84), dtype=np.uint8)
+        reward = 1.0
+        terminated = False
+        truncated = False
+        info = {}
+        return obs, reward, terminated, truncated, info


This should go in common.py in tests

And I would call it something like "ImageObsEnv"

younik · 2025-02-19T10:53:30Z

tests/test_jpeg_serialization.py

+def generate_episode_buffer_from_env(env: TestEnv, length=3) -> EpisodeBuffer:
+    initial_obs , _ = env.reset()
+    buffer = EpisodeBuffer(observations=initial_obs)
+    for i in range(length):
+        action = env.action_space.sample()
+        obs, reward, terminated, truncated, info = env.step(action)
+        step_data = {
+            "observation": obs,
+            "action": action,
+            "reward": reward,
+            "terminated": terminated,
+            "truncated": truncated,
+            "info": info
+        }
+        buffer = buffer.add_step_data(step_data)
+    return buffer


This function is already present in common.py, just use that one

younik · 2025-02-19T11:00:09Z

tests/test_jpeg_serialization.py

+def test_arrow_storage_serialization():
+    env = TestEnv()
+    episode = generate_episode_buffer_from_env(env, length=3)
+    with tempfile.TemporaryDirectory() as tmpdir:
+        tmp_path = pathlib.Path(tmpdir)
+
+        METADATA_FILE_NAME = "metadata.json"
+        default_metadata = {
+            "total_steps": 0,
+            "total_episodes": 0,
+            "data_format": "arrow",
+            "observation_space": str(env.observation_space),
+            "action_space": str(env.action_space)
+        }
+        metadata_path = tmp_path.joinpath(METADATA_FILE_NAME)
+        metadata_path.write_text(json.dumps(default_metadata))
+
+        from minari.dataset._storages.arrow_storage import ArrowStorage
+        storage = ArrowStorage(tmp_path, env.observation_space, env.action_space)
+        storage.update_episodes([episode])
+        loaded_episode = list(storage.get_episodes([0]))
+        loaded_obs = loaded_episode[0]["observations"]
+        np.testing.assert_array_equal(episode.observations, loaded_obs)
+
+def test_hdf5_serialization():
+    env = TestEnv()
+    episode = generate_episode_buffer_from_env(env, length=3)
+    with tempfile.TemporaryDirectory() as tmpdir:
+        tmp_path = pathlib.Path(tmpdir)
+
+        METADATA_FILE_NAME = "metadata.json"
+        default_metadata = {
+            "total_steps": 0,
+            "total_episodes": 0,
+            "data_format": "hdf5",
+            "observation_space": str(env.observation_space),
+            "action_space": str(env.action_space)
+        }
+        metadata_path = tmp_path.joinpath(METADATA_FILE_NAME)
+        metadata_path.write_text(json.dumps(default_metadata))
+
+        from minari.dataset._storages.hdf5_storage import HDF5Storage
+        storage = HDF5Storage._create(tmp_path, env.observation_space, env.action_space)
+        storage.update_episodes([episode])
+        loaded_episodes = list(storage.get_episodes([0]))
+        loaded_obs = loaded_episodes[0]["observations"]
+        np.testing.assert_array_equal(episode.observations, loaded_obs)


These tests can be merged in a single function, and use pytest.mark.parametrize to test both storages.

Also, in the end, I believe that if you add your test env in the list of envs as mentioned above, you can rely on previous written tests that should already check equality, and this file can be removed

younik · 2025-02-19T11:08:24Z

minari/dataset/_storages/arrow_storage.py

-        values = np.pad(values, ((0, pad), (0, 0)))
-        dtype = pa.list_(pa.from_numpy_dtype(space.dtype), list_size=values.shape[1])
-        return pa.FixedSizeListArray.from_arrays(values.reshape(-1), type=dtype)
+        if values.shape == (4, 84, 84) and values.dtype == np.uint8: # check for image observation (4 stacked greyscale images)


we should not constrain ourselves with 84 x 84 images, but we should accept any size.

Same reasoning for the 4 dimension.

I think the discriminant here is to have Box, with at least 2 dim, and with type uint8. Then you can put a logging.warn saying you are considering it as image, and if this is not intended to disable it via a flag "image_observation" which is defaulted to None. Then you can compute the value of the flag during init (and warn just once). I will clarify later in our meeting.

younik · 2025-02-19T11:09:13Z

minari/dataset/_storages/arrow_storage.py

+        if values.type == pa.binary():  # check for binary data (JPEG)
+            jpeg_images = []
+            for jpeg_bytes in values:
+                image = Image.open(io.BytesIO(jpeg_bytes)).convert("L") # decode JPEG and convert to greyscale


we should work with non-grayscale images as well.

gabrielemaraglino added 5 commits January 30, 2025 11:21

Add JPEG-encoding for images

8106db5

Add JPEG-decoding in _decode_space

5bf0cc3

Add JPEG encoding and decoding in pyarrow and hdf5

a4762c8

fixed missing backtick

a348f1a

fixed typo

d843f9f

younik requested changes Feb 14, 2025

View reviewed changes

add JPEG serialization/deserialization tests

2aa2f44

younik requested changes Feb 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JPEG encoding and decoding if the observation is an image #275

JPEG encoding and decoding if the observation is an image #275

gabrielemaraglino commented Feb 14, 2025

younik left a comment •

edited

Loading

younik Feb 19, 2025

younik Feb 19, 2025

younik Feb 19, 2025

younik Feb 19, 2025

younik Feb 19, 2025

younik Feb 19, 2025

JPEG encoding and decoding if the observation is an image #275

Are you sure you want to change the base?

JPEG encoding and decoding if the observation is an image #275

Conversation

gabrielemaraglino commented Feb 14, 2025

Description

Type of change

Screenshots

Checklist:

younik left a comment • edited Loading

Choose a reason for hiding this comment

younik Feb 19, 2025

Choose a reason for hiding this comment

younik Feb 19, 2025

Choose a reason for hiding this comment

younik Feb 19, 2025

Choose a reason for hiding this comment

younik Feb 19, 2025

Choose a reason for hiding this comment

younik Feb 19, 2025

Choose a reason for hiding this comment

younik Feb 19, 2025

Choose a reason for hiding this comment

younik left a comment •

edited

Loading