Dev christoph #17

CDStark · 2021-03-25T19:58:57Z

AL as another option for training

Add functionality to predict.py in save_img_mat() to save the image as a nii file. The filetype gets specified in config with parameter save_predict_img_datatype

the new version of the h5py caused an error so the old version of the package gets specified in requirements.txt

Create the function sort_by_informativeness() and the accompaning function uncertainty_sampling() as a first step of the active learning implementation. The idea is for sort_by_informativeness() to sort the training-patches by a certain value that represents it's potential to benefit the model in training.

Correct/finish the function uncertainty_sampling() in active_learning.py that calculates an uncertainty value for a given prediction-Tensor. It now first calculates a value for every pixel and then averages over the entire image to get a value for the image.

Establish the first part of loading data from tf record files for subsequent prediction necessary for active learning. The process is heavily inspired by pipeline.py and predict.py

To avoid redundancy in code, redesign the implementation of active learning in train.py. Build of training pipeline and fitting of the model are now in the same for loop (it doesn't matter if active learning is on or of)

Rename some variables to be better readable. Turn off random shiftig of patches in patching function. Add prediction statement (doesn't work at the moment)

Fix prediction error in active_learning.py by casting the indice list to Float32. Add part that calculates an uncertainty value for every image.

In active_learning.py add code that selects the patches with the highest uncertainty value for training and return these to train.py

For some networks the indices-list should be regularized before prediction. Therefore add this process to prediction in active_learning.py. The process is inspired/copied form predict.py predict_image(). To keep the code more readable put this in a extra function predict()

Introduce the class PatchPool that contains patching relevant parameters and also keeps track of all patches their properties and whether or not they were already used in a prediction or should be used in the next one. For this implement a second class Patch that represents a single patch with relevant information. The PatchPool class contains methods get_unused_patches_indices() that outputs all patches of an image that haven't been used in training the network. Implement this method in query_training_patches() to predict only the next possible candidates for every iteration. The method select_patches is supposed to pick the n best patches for training(not tested yet)

Rebuild the PatchPool class. Use a dictionary which includes a dictionary for every image that contains the patches. This way acsess to specific patches saved in pool is much easier and efficient. To this end implement a method get_pos_key() that creates a key for a given patch index. Also rewrite the methods calculate_values(), get_unused_patches_indices() and select_patches() accordingly. Implement the methods in the code of query_training_patches() !The PatchPool generally works however the indices are initialized before the actual image is patched, if the indexes change during patching an error occurs, this will have to be changed in the following commit

Don't initialize the patches in PatchPool self.pool on initialisation of the class but instead the first time the values of the batches in self.pool get updated. Modify get_unused_patches to return the ideal list of indices if this initialisation hasn't occured yet. To keep Track of this introduce self.patches_set_up, a list that keeps track of for which image the patches have already been initialized.

Create method that returns the patches selected for training for a specific image number and edits the pool accordingly

Only create PatchPool Object if active learning is activated, pass the object to pipeline if al activated. Passing of the object replaces parameter active_learning as marker in pipeline, that al is wished.

Make the modified function get_patches_data and it's usage in pipeline compatible again. Change order of arguments in pipeline, edit return statement in the function. Ajust query_training_patches() accordingly

To only get the information on how the patches indices will turn out after patching modify the function. The idea was to use this functionality to build the patch pool for active learning. For now however another technique is used.

Get changes from dev_christoph branch (commit f0ca9f8) to test branch. Includes mainly options to save the used patches and their origin for later analysis

Move parameters for determining the used number of patches to config (as temporary parameters) Also settings for first Exp9-1

Change the filename in which mosaic plots are saved so that it includes the name of the experiment (according to config) so that the predictions of different experiments can be distinguished

Also Settings for Plotting of Exp9 slices (Predict_slices_Exp9)

(Predict_slices_Exp9)

Small corrections to the results of merging the test branch with the developement branch Result is the momentary version of the AL pipeline at the end of my bachelors thesis.

Compare with the last master commit that was merged into this branch and match them as much as possible, again removing unnecessary changes

CDStark · 2021-03-25T20:35:10Z

Notes:

Also contains changes to argparse in main.py - will likely slightly change the interaction with the code in command line
(e.g. adding flag --train will result in args.train=True, not specifying --train will result in args.train=False)
My IDE (Pycharm) detects two errors in predict.py line 183 (indice_list_model unknown) and line 513 (read_mat unknown) in the curent master version (which I also merged into this commit
In pipeline.py I use the max_patch_num parameter in get_fixed_patches_index() and random_shift_patch in get_patches_data()

thomaskuestner

add new provided config parameters to config check for backward compatibility

med_io/pipeline.py

New config parameters active_learning and max_patch_num might not be present in some config files, therefore these checks are needed

thomaskuestner · 2021-03-30T11:06:53Z

@all-contributors please add @CDStark for code, maintenance

allcontributors · 2021-03-30T11:07:01Z

@thomaskuestner

I've put up a pull request to add @CDStark! 🎉

Christoph Staerk added 30 commits October 9, 2020 15:43

specified requirements versions and added new config file

27c5f3f

fixed paths to prediction nifti yaml file and some typos

d15c9ab

fix argparse to not require additional argument for boolean type options

293de38

update paths to Nifti Data and Model

4fa0777

Create new Config Files for TULIP 1_5T and 3T; change plot to axial

567c952

Enable save predict image as nii file

2fb8087

Add functionality to predict.py in save_img_mat() to save the image as a nii file. The filetype gets specified in config with parameter save_predict_img_datatype

Edit TULIP config files to save result as nii

6dbff05

Bugfix - specifying h5py version

8cff6b0

the new version of the h5py caused an error so the old version of the package gets specified in requirements.txt

First draft for Active learning in train.py

31ca1d6

Data loading for active learning

a523773

Establish the first part of loading data from tf record files for subsequent prediction necessary for active learning. The process is heavily inspired by pipeline.py and predict.py

Redesign AL implementation in train.py

066d337

To avoid redundancy in code, redesign the implementation of active learning in train.py. Build of training pipeline and fitting of the model are now in the same for loop (it doesn't matter if active learning is on or of)

Determine patch indices in active_learning.py

390821b

Patch data in active_learning.py

b55366b

Rename variables + introduce predict in al

38f02e7

Rename some variables to be better readable. Turn off random shiftig of patches in patching function. Add prediction statement (doesn't work at the moment)

Predict bugfix and calculate uncertainty in al

fd50a35

Fix prediction error in active_learning.py by casting the indice list to Float32. Add part that calculates an uncertainty value for every image.

Selection of most useful batches

d5bb768

In active_learning.py add code that selects the patches with the highest uncertainty value for training and return these to train.py

Fix get_pos_key for border patches not on lattice

d20586f

Bugfix patch selection in PatchPool

8385b0d

Create get_patches_to_train for PatchPool

7b924e0

Create method that returns the patches selected for training for a specific image number and edits the pool accordingly

Rebuild active learning implementation in train.py

44a204e

Only create PatchPool Object if active learning is activated, pass the object to pipeline if al activated. Passing of the object replaces parameter active_learning as marker in pipeline, that al is wished.

Edit pipeline and modified fnc get_patches_data()

0a77b13

Make the modified function get_patches_data and it's usage in pipeline compatible again. Change order of arguments in pipeline, edit return statement in the function. Ajust query_training_patches() accordingly

Fix get_patches_to_train to output indexes

628631c

Small changes and revert changes to pad_and_patch

c9931e5

Enable return indices only in get_patches_data

2ed5d4c

To only get the information on how the patches indices will turn out after patching modify the function. The idea was to use this functionality to build the patch pool for active learning. For now however another technique is used.

Christoph Staerk added 22 commits February 28, 2021 19:37

Partially merge changes from dev_christoph

4aa3e65

Get changes from dev_christoph branch (commit f0ca9f8) to test branch. Includes mainly options to save the used patches and their origin for later analysis

Changes for Exp9

06dbccd

Move parameters for determining the used number of patches to config (as temporary parameters) Also settings for first Exp9-1

Settings Exp9-2

4da7d6a

Settings Exp9-3

8dadbfa

Settings Exp9-4

6bdd207

Settings Exp9-5

10cc047

Settings Exp9-6

ca1a4f7

Settings Exp9-7

ccc74eb

Settings Exp9-8

0263574

Settings Exp9-9

a8d6dc7

Delete "unused" hdf5 patches data+Settings Exp8-2

647b48d

Settings Exp8-3

f1087c6

Save Predictions with Exp name + Settings Predct.

47b05ba

Change the filename in which mosaic plots are saved so that it includes the name of the experiment (according to config) so that the predictions of different experiments can be distinguished

Save plot by slice with exp_name in filename

dbc5a2d

Also Settings for Plotting of Exp9 slices (Predict_slices_Exp9)

Analyse saved patches info - accumulate data

c523987

patches info - add prints + fix bin key error

83319cd

Settings For Plotting of Exp9 slices with new img

34a3d47

(Predict_slices_Exp9)

Merge branch 'test-0121' into dev_christoph

b3baf47

End version AL - without temporary test code

aaa7793

Small corrections to the results of merging the test branch with the developement branch Result is the momentary version of the AL pipeline at the end of my bachelors thesis.

Remove parts before merge with master

2188853

Compare with master (last merge) - assimilate

7a852f6

Compare with the last master commit that was merged into this branch and match them as much as possible, again removing unnecessary changes

Merge branch 'master' into dev_christoph

2772634

thomaskuestner requested changes Mar 26, 2021

View reviewed changes

med_io/pipeline.py Show resolved Hide resolved

med_io/pipeline.py Show resolved Hide resolved

med_io/pipeline.py Show resolved Hide resolved

Add config checks for backwards compatability

de9f836

New config parameters active_learning and max_patch_num might not be present in some config files, therefore these checks are needed

thomaskuestner approved these changes Mar 30, 2021

View reviewed changes

thomaskuestner merged commit 5b36be3 into master Mar 30, 2021

allcontributors bot mentioned this pull request Mar 30, 2021

docs: add CDStark as a contributor #20

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev christoph #17

Dev christoph #17

CDStark commented Mar 25, 2021

CDStark commented Mar 25, 2021

thomaskuestner left a comment

thomaskuestner commented Mar 30, 2021

allcontributors bot commented Mar 30, 2021

Dev christoph #17

Dev christoph #17

Conversation

CDStark commented Mar 25, 2021

CDStark commented Mar 25, 2021

thomaskuestner left a comment

Choose a reason for hiding this comment

thomaskuestner commented Mar 30, 2021

allcontributors bot commented Mar 30, 2021