Add color profile to training #22

jonasteuwen · 2023-11-22T12:11:26Z

This is the PR that allows ICC profiles to be used during training.

ICC profile is applied in the dataset
H5ImageFileWriter needs to be able to embed icc_profile
- Add possibility to embed the icc_profile

Furthermore it also fixes #43.

ahcore/writers.py

BPdeRooij · 2023-12-19T12:38:38Z

ahcore/writers.py

        # This only works when the mode is 'overflow' and in 'C' order.
        metadata = {
            "mpp": self._mpp,
-            "dtype": str(batch_dtype),


Removing dtype from metadata causes the reader to fail in the _open_file function.

Two further comments:

Applying the ICC profiles during training works (tested with TCGA images and Openslide backend).

When using the CLI functionality of tiling.py, a type error in data.py is raised. Simply ignoring the pylinting with a comment is not enough. I believe adding from __future__ import annotations at the top would solve the problem.

I also observe this.

Should be fixed?

Yes, thanks!

EricMarcus-ai

Nice. Some minor comments added.

EricMarcus-ai · 2023-12-19T11:26:16Z

ahcore/writers.py

@@ -33,18 +57,23 @@ def __init__(
        tile_size: tuple[int, int],
        tile_overlap: tuple[int, int],
        num_samples: int,
-        is_binary: bool = False,
+        is_compressed_image: bool = False,


Should this be a bool, i.e., is the behavior completely the same irrespective of the compression type? Or should we make an Enum somewhere with the known compression types?

It should be independent as the compression type is included in the binary blob.

EricMarcus-ai · 2023-12-19T12:10:57Z

ahcore/writers.py

        # This only works when the mode is 'overflow' and in 'C' order.
        metadata = {
            "mpp": self._mpp,
-            "dtype": str(batch_dtype),


Why is this removed? It is being used in readers, for example here:

ahcore/ahcore/readers.py

Line 113 in 0ae5c1b

self._dtype = self._metadata["dtype"]

We could add it with first_batch.dtype if necessary.

Added it back

EricMarcus-ai · 2023-12-19T12:57:25Z

ahcore/writers.py

+        else:
+            _mode = "ARRAY"
+            _format = "RAW"
+            _num_channels = first_batch.shape[-1]


I think there is a typo here, should be first_batch.shape[1], right?

I think it's correct as it's the conversion of a PIL Image to an array. If it would be a tensor it would indeed be channels first:

import PIL.Image import numpy as np print(np.asarray(PIL.Image.open("image.jpg")).shape) (1635, 1966, 3)

AjeyPaiK · 2023-12-19T16:08:33Z

@jonasteuwen.. this PR also fixes {#16}. It would be nice to add it in the PR description.

Made this commit to make ahcore compatible with upstream changes made to dlup.

AjeyPaiK

LGTM

Add color profile to training

1a5566a

jonasteuwen marked this pull request as draft November 22, 2023 12:11

jonasteuwen and others added 4 commits December 8, 2023 11:48

Merge branch 'main' into feature/use-icc-profile

0e30862

Add ICC profile to tiling

9da9fdf

bump dlup version

c2821b9

Add color profile to the writers

1f0f54f

jonasteuwen marked this pull request as ready for review December 8, 2023 22:52

jonasteuwen requested review from VanessaBotha and moerlemans December 8, 2023 22:53

jonasteuwen mentioned this pull request Dec 8, 2023

h5ImageReader needs to read ICC profiles #42

Open

Tiler now supports proper PNG and JPEG output.

742e68f

jonasteuwen mentioned this pull request Dec 10, 2023

H5FileImageWriter metadata is wrong #43

Closed

jonasteuwen added 3 commits December 10, 2023 17:37

Let's be explicit

105e996

Raise error if icc profile is present

e847c1a

Merge reworked callbacks

4bcf11a

jonasteuwen requested review from EricMarcus-ai and BPdeRooij December 15, 2023 16:59

AjeyPaiK reviewed Dec 19, 2023

View reviewed changes

ahcore/writers.py Outdated Show resolved Hide resolved

BPdeRooij reviewed Dec 19, 2023

View reviewed changes

EricMarcus-ai reviewed Dec 19, 2023

View reviewed changes

AjeyPaiK and others added 2 commits December 21, 2023 18:24

Compute bounded size during h5 writer callback

68592ac

Made this commit to make ahcore compatible with upstream changes made to dlup.

Fix according to comments

c5887c9

jonasteuwen requested review from EricMarcus-ai and BPdeRooij January 1, 2024 15:05

AjeyPaiK approved these changes Jan 3, 2024

View reviewed changes

jonasteuwen merged commit 172384a into main Jan 3, 2024
2 checks passed

AjeyPaiK mentioned this pull request Jan 5, 2024

Fix H5FileImageWriter Callback #17

Closed

AjeyPaiK deleted the feature/use-icc-profile branch May 24, 2024 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add color profile to training #22

Add color profile to training #22

jonasteuwen commented Nov 22, 2023 •

edited

Loading

BPdeRooij Dec 19, 2023

BPdeRooij Dec 19, 2023

AjeyPaiK Dec 19, 2023

jonasteuwen Dec 24, 2023

AjeyPaiK Jan 3, 2024

EricMarcus-ai left a comment

EricMarcus-ai Dec 19, 2023

jonasteuwen Dec 24, 2023

EricMarcus-ai Dec 19, 2023

jonasteuwen Dec 24, 2023

EricMarcus-ai Dec 19, 2023

jonasteuwen Dec 24, 2023

AjeyPaiK commented Dec 19, 2023

AjeyPaiK left a comment

Add color profile to training #22

Add color profile to training #22

Conversation

jonasteuwen commented Nov 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EricMarcus-ai left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AjeyPaiK commented Dec 19, 2023

AjeyPaiK left a comment

Choose a reason for hiding this comment

jonasteuwen commented Nov 22, 2023 •

edited

Loading