Add docs and CLI #11

LorenzLamm · 2023-06-06T07:36:44Z

I added some documentation for the tomo_preprocessing module in the form of better docstrings (tried to have numpy docstring style for all functions), and a Readme file for this module. Also, I used the teamtomo docs structure, where I have an overview page and the preprocessing one so far.
(I guess it should all be prettier in the end with some images & more examples on how to use)

I also tried to implement the Typer CLI (thanks for the suggestion, Alister), which seems really nice to use.

By adding it also to pyproject.toml file, preprocessing methods can now be accessed using
"tomo_preprocessing NAME_OF_PREPROCESSING".

Let me know if this is going in the right direction. If not, I'm happy to adjust -- if yes, I'll extend the documentations and Typer CLIs also to the other functionalities (i.e. training and prediction).

LorenzLamm · 2023-06-06T07:43:41Z

src/tomo_preprocessing/__init__.py

I needed to add the # noqa: F401 command, because otherwise one of the pre-commit formatting packages would always remove all lines here (because imported functions are not used).
Not sure if there is a more elegant way to avoid this.

Hmm I think this gets removed because the __all__ exposing these imports is not implemented. In any case, I think it's fine to use the noqa for now.

https://stackoverflow.com/questions/44834/what-does-all-mean-in-python

LorenzLamm · 2023-06-06T07:45:07Z

src/tomo_preprocessing/amplitude_spectrum_matching/_cli.py

This is the Typer CLI for the amplitude matching.
Again, I had to add the command # noqa: B008, because otherwise a pre-commit package (I think ruff) would complain about the "Option" class being used in the function arguments.

no worries. I think it's fine here.

LorenzLamm · 2023-06-06T07:46:04Z

src/tomo_preprocessing/pixel_size_matching/_cli.py

Typer CLI for pixel size matching

LorenzLamm · 2023-06-06T07:48:33Z

src/tomo_preprocessing/pixel_size_matching/match_pixel_size_seg.py

This script is new: In the previous version, I only had pixel size matching based on Fourier cropping / extension. However, this does not work for binary segmentations given from the U-Net. So it was not possible to rescale the segmentation to the original tomogram pixel size.

This function does exactly that: It rescales the segmentation to match the original tomogram using scipy's zoom function.

kevinyamauchi

Hey @LorenzLamm ! Sorry for the delay. I think this looks really nice! I left some minor comments below. @alisterburt and I are going to be out on a bike trip next week, so please feel free to merge yourself once you address the comments.

I am not sure how the github pages is configured for teamtomo, so we may need to update the docs building CI, but let's do that in a follow up PR.

Nice work!

kevinyamauchi · 2023-06-11T07:07:25Z

src/tomo_preprocessing/__init__.py

Hmm I think this gets removed because the __all__ exposing these imports is not implemented. In any case, I think it's fine to use the noqa for now.

https://stackoverflow.com/questions/44834/what-does-all-mean-in-python

kevinyamauchi · 2023-06-11T07:09:56Z

docs/Usage/Preprocessing.md

@@ -0,0 +1,94 @@
+# Preprocessing


Super nice!

docs/index.md

kevinyamauchi · 2023-06-11T07:15:51Z

src/tomo_preprocessing/amplitude_spectrum_matching/_cli.py

no worries. I think it's fine here.

kevinyamauchi · 2023-06-11T07:30:23Z

src/tomo_preprocessing/matching_utils/spec_matching_utils.py

+    -----
+    This function uses the FFT algorithms from numpy.fft for performing the Fourier
+    transform, and it assumes that the input tomogram has voxel intensities that
+    can be converted to floating point numbers.


I think it would be helpful to clarify what "voxel intensities that can be converted to floating point numbers" means.

kevinyamauchi · 2023-06-11T07:32:24Z

src/tomo_preprocessing/pixel_size_matching/match_pixel_size.py

+    if (pixel_size_in / pixel_size_out) < 1.0:
+        resized_data = fourier_cropping(data, output_shape, smoothing)
+    else:
+        resized_data = fourier_extend(data, output_shape, smoothing)


Is there a need to cover the case where pixel_size_in == pixel_size_out (i.e., just pass the image)?

Theoretically, nothing should happen in the "fourier_extend" case with the same pixel sizes.
But also doesn't hurt to add this third case to avoid any issue. I added it.

kevinyamauchi · 2023-06-11T07:32:45Z

src/tomo_preprocessing/pixel_size_matching/match_pixel_size_seg.py

kevinyamauchi · 2023-06-11T07:36:14Z

src/tomo_preprocessing/pixel_size_matching/match_pixel_size_seg.py

+        target_dim / original_dim
+        for target_dim, original_dim in zip(output_shape, data.shape)
+    ]
+    resized_data = ndimage.zoom(data, rescale_factors, order=1)


Is the order=1 going to cause an issue? It seems like interpolating a segmentation mask could yield erronous values (e.g., if the users passes a label image or a mask of dtype int, the interpolation would "create" values at boundaries). It might make more sense to use interpolationorder=0. You may also need to set prefilter=False (filtering the mask could also lead to erroneous values).

https://docs.scipy.org/doc/scipy/reference/generated/scipy.ndimage.zoom.html

Ah, good catch! Yes, order=1 will cause issues I guess. Replacing it with order=0, and also setting prefilter=False, although that only takes an effect when order > 1. But also doesn't hurt.

kevinyamauchi · 2023-06-11T07:37:44Z

src/tomo_preprocessing/pixel_size_matching/match_pixel_size.py

+        The file path to the input tomogram to be processed.
+    output_path : str
+        The file path where the processed tomogram will be stored.
+    pixel_size_in : float


I think this is fine for now, but are voxels always isotropic (i.e., does this need to support an array input)?

Good question!
I have never encountered any tomograms that had anisotropic voxel sizes. I feel the image acquisition schemes and reconstruction algorithms should always lead to isotropic resolution (neglecting e.g. missing wedge artifacts). But maybe there are also exceptions? Need to look into this

Disclaimer for MemBrain being under development Co-authored-by: Kevin Yamauchi <kevin.yamauchi@gmail.com>

Remove mkdocs description. Co-authored-by: Kevin Yamauchi <kevin.yamauchi@gmail.com>

remove redundant comment Co-authored-by: Kevin Yamauchi <kevin.yamauchi@gmail.com>

…nto add_docs

LorenzLamm added 6 commits June 5, 2023 20:01

Add Typer CLI and Documentation

c8625d6

Fix description in CLI

d66dd07

Fix almost zero cutoff flag in CLI

13702fb

Change module name to tomo_preprocessing

c5821d2

Add mkdocs Docs

0d766fb

Merge with original template

d3773e7

LorenzLamm marked this pull request as ready for review June 6, 2023 07:42

LorenzLamm commented Jun 6, 2023

View reviewed changes

src/tomo_preprocessing/pixel_size_matching/_cli.py Outdated

Copy link

Collaborator Author

LorenzLamm Jun 6, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Typer CLI for pixel size matching

LorenzLamm commented Jun 6, 2023

View reviewed changes

kevinyamauchi approved these changes Jun 11, 2023

View reviewed changes

kevinyamauchi changed the title ~~Add docs~~ Add docs and CLI Jun 11, 2023

LorenzLamm and others added 9 commits June 11, 2023 12:27

Update docs/index.md

d94f184

Disclaimer for MemBrain being under development Co-authored-by: Kevin Yamauchi <kevin.yamauchi@gmail.com>

Update docs/index.md

fe49188

Remove mkdocs description. Co-authored-by: Kevin Yamauchi <kevin.yamauchi@gmail.com>

Update src/tomo_preprocessing/amplitude_spectrum_matching/_cli.py

a642f0c

remove redundant comment Co-authored-by: Kevin Yamauchi <kevin.yamauchi@gmail.com>

Add skimage and simpleitk dependencies

aec6d2c

Merge branch 'add_docs' of https://github.com/teamtomo/membrain-seg i…

aa7073f

…nto add_docs

use skimage conversion to float for tomograms

c08a8da

Add same pixel size case

ae88791

Fix typo

b2ba1b2

Add small description to readme

0174726

LorenzLamm merged commit 75e3eb2 into main Jun 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docs and CLI #11

Add docs and CLI #11

LorenzLamm commented Jun 6, 2023

LorenzLamm Jun 6, 2023

kevinyamauchi Jun 11, 2023

LorenzLamm Jun 6, 2023

kevinyamauchi Jun 11, 2023

LorenzLamm Jun 6, 2023

LorenzLamm Jun 6, 2023

kevinyamauchi Jun 11, 2023

kevinyamauchi left a comment

kevinyamauchi Jun 11, 2023

kevinyamauchi Jun 11, 2023

kevinyamauchi Jun 11, 2023

kevinyamauchi Jun 11, 2023

kevinyamauchi Jun 11, 2023

LorenzLamm Jun 11, 2023

kevinyamauchi Jun 11, 2023

kevinyamauchi Jun 11, 2023

LorenzLamm Jun 11, 2023

kevinyamauchi Jun 11, 2023

LorenzLamm Jun 11, 2023

Add docs and CLI #11

Add docs and CLI #11

Conversation

LorenzLamm commented Jun 6, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kevinyamauchi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment