update normalization #32

FynnBe · 2020-10-23T10:14:13Z

~~README.md will be updated shortly...~~

FynnBe · 2020-10-23T10:34:42Z

We forgot about the need to specify a list of means/std values for mode 'fixed' in case axes are specified. Other than that everything is exactly as discussed in today's bioimage-io meeting.

oeway · 2020-10-23T11:07:02Z

README.md

 - `data_type` data type (e.g. float32)
 - `data_range` tuple of (minimum, maximum)
+ - `axes` string of axes identifying characters from: btczyx


Do we have a place to give detailed definition for the axes and the meaning? Are we allowed to give a custom letter to it?

I think we should be as restrictive with these letters as possible to keep them meaningful/useful from a consumer software perspective, e.g restrict them to btczyx. We can have a (separate) discussion on the axes keys (use HW, instead of xy, etc...). I'd prefer to delay that for 0.3.1+, for now in a given input the description field could add specific meaning in the model context for humans? I feel an axes_description field would go a bit too far anyway, but again, let's leave that for future discussion if necessary.

I do agree that for now, it is too ambitious to get this kind of specifications. I think it was already discussed at some point but outputs might also be better described with rows and columns. While in NumPy arrays those could be understood as HW, displaying them as tables could need a different kind of description.

oeway

Looks good to me.

esgomezm · 2020-10-23T13:19:49Z

OK for the normalization as well.

PD: In a quick search, I found some nice comments about normalization/standardization/stretching:
https://stackoverflow.com/questions/33610825/normalization-in-image-processing/33611556#33611556
there is also centering like mean/median centering for just fixing the mean/median to 0 value.

FynnBe · 2020-10-26T07:51:45Z

https://stackoverflow.com/questions/33610825/normalization-in-image-processing/33611556#33611556

Data standarization is another way of normalizing the data

So normalizing data is not the same as normalization I suppose... I see the ambiguity. So how about we change the name of the normalization field to preprocessing (What, wasn't that what it's called before?? Yes, it was, we had our reasons then and maybe should have listened to our past us... Anyhow, the big difference would be that this is a subfield of an input now. And it is not attached to any .transformation.yaml files, but instead we agree on a strict set of valid transformation names and their kwargs).

All in favor of renaming our newly introduced normalization to preprocessing say 👍

oeway · 2020-10-26T11:40:50Z

preprocessing looks good, it make sense that we support only a very minimal set of preprocessing ops.

fjug · 2020-10-26T13:22:19Z

README.md

+ - `axes` subset of input `axes` to normalize independently (e.g. 'c')
+ - `mean` mean to normalize with (only applies for `mode` 'fixed'). This may be a (nested) list depending on `axes`, e.g. for `axes` 'c' a list of means for each channel; or for axes: 'cz' a list for each channel c of a nested list of means for each z position of that channel.
+ - `std` standard deviation to normalize with (only applies for `mode` 'fixed'). This may be a (nested) list depending on `axes` analogously to `mean`.
+


We like explicit normalization descriptions in this way, but would like to talk about supported normalization schemes rather sooner than later. Maybe one of the next meetings?

Let's put it on the agenda 👍

FynnBe · 2020-10-30T16:19:04Z

We decided in today's bioimage.io meeting:

the preprocessing key is associated to an input in inputs. More complicated preprocessing schemes will have to be covered by model source code for the time being.
make preprocessing a list of dicts with a name and a kwargs key. Order of application is the order in this list.
name needs to be a valid preprocessing name
kwargs need to contain valid key word arguments for the preprocessing specified by name

valid preprocessings we start with:

zero_mean_unit_variance: with kwargs:
- mode: (fixed/per_dataset/per_sample)
- axes: xy # subset of axes to normalize jointly, batch ('b') is not a valid axis key here!
- mean: [1.1, 2.2, 3.3] # mean if mode == fixed. Here it is a list (because in this example we assume a channel dimension of length c=3)
- std: [0.1, 0.2, 0.3] # standard deviation if mode == fixed analogously to mean

todos:

min_max normalization with fixed min, max (from training)
percentile normalization
clipping

FynnBe · 2020-11-03T14:27:46Z

open todos moved to #37

FynnBe requested a review from oeway October 23, 2020 10:14

FynnBe changed the title ~~[WIP] update normalization~~ update normalization Oct 23, 2020

oeway reviewed Oct 23, 2020

View reviewed changes

oeway approved these changes Oct 23, 2020

View reviewed changes

fjug reviewed Oct 26, 2020

View reviewed changes

FynnBe force-pushed the update_normalization branch from 5338dbf to bff31d8 Compare October 27, 2020 14:09

FynnBe added 3 commits November 3, 2020 15:17

update normalization

326997c

update documentation (also in example)

421a51a

normalization -> preprocessing

d7bc5e1

FynnBe force-pushed the update_normalization branch from bff31d8 to d7bc5e1 Compare November 3, 2020 14:17

FynnBe mentioned this pull request Nov 3, 2020

Expand preprocessing for inputs #37

Closed

3 tasks

update zero_mean_unit_variance docs/example

b887d31

FynnBe merged commit 1d12ee6 into master Nov 3, 2020

FynnBe deleted the update_normalization branch November 3, 2020 14:28

constantinpape mentioned this pull request Nov 13, 2020

Add tables weight-formats and pre-post-processing #43

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update normalization #32

update normalization #32

FynnBe commented Oct 23, 2020 •

edited

Loading

FynnBe commented Oct 23, 2020

oeway Oct 23, 2020

FynnBe Oct 23, 2020

esgomezm Oct 23, 2020

oeway left a comment

esgomezm commented Oct 23, 2020

FynnBe commented Oct 26, 2020

oeway commented Oct 26, 2020

fjug Oct 26, 2020

FynnBe Oct 26, 2020

FynnBe commented Oct 30, 2020 •

edited

Loading

FynnBe commented Nov 3, 2020

update normalization #32

update normalization #32

Conversation

FynnBe commented Oct 23, 2020 • edited Loading

FynnBe commented Oct 23, 2020

oeway Oct 23, 2020

Choose a reason for hiding this comment

FynnBe Oct 23, 2020

Choose a reason for hiding this comment

esgomezm Oct 23, 2020

Choose a reason for hiding this comment

oeway left a comment

Choose a reason for hiding this comment

esgomezm commented Oct 23, 2020

FynnBe commented Oct 26, 2020

oeway commented Oct 26, 2020

fjug Oct 26, 2020

Choose a reason for hiding this comment

FynnBe Oct 26, 2020

Choose a reason for hiding this comment

FynnBe commented Oct 30, 2020 • edited Loading

FynnBe commented Nov 3, 2020

FynnBe commented Oct 23, 2020 •

edited

Loading

FynnBe commented Oct 30, 2020 •

edited

Loading