Tests & Refactor (incl. dependencies, CICD workflow, Documentation workflow) & Doc. #14

CharlesGaydon · 2022-05-09T15:18:08Z

A test suite that covers typical use case: training and prediction from CLI, successive train+test, dry run on RandLaNet, overfitting test with RandLaNet and PointNet to assure that the model is trainable.

Dependency torch-points-kernels is deleted, and replaced using pyg, which adds some complexity in code but simplifies installation of virtual environment. The resulting code is retrocompatible with previous models and fully tested for regressions (IoU is unchanged on a 15km² test set).

Corrections to the docker file are also implemented ; in particular, CUDA images were broken by a CUDA update, and needed to be adjusted.

Workflows make a good use of caching functionnalitis, both from Docker and from Github environment.

Requirements files are simplified. Dependencies are installed without redundant command lines. torchmetrics version is fixed, because pytorch-lightning would elsewise use a newer, non-retrocompatible version.

…tions

…esults. Replace with pyg knn in 1/2 places Refactor to extract knns as functions instead of methods No regression on large las IoU Remove torck-points-kernels dependency

Simplify syntax

Always use host shared memory option

Simplify syntax

myria3d/data/transforms.py

myria3d/data/loading.py

myria3d/models/modules/randla_net.py

MichelDaab · 2022-05-23T15:23:58Z

myria3d/models/modules/randla_net.py

@@ -86,13 +89,13 @@ def forward(self, batch):
        """

        input = torch.cat([batch.pos, batch.x], axis=1)
-        chunks = torch.split(input, len(batch.pos) // batch.batch_size)
+        chunks = torch.split(input, len(batch.pos) // batch.num_batches)


J'ai la sensation qu'ici on change la forme du tenseur, en faisant la manip inverse de celle qu'on effectue avec les np.concatenate() du fichier transform.py L337. Est-ce le cas et est-ce nécessaire de faire cette transformation/détransformation ?

MichelDaab · 2022-05-24T09:44:02Z

myria3d/train.py

-    import comet_ml
-except:
+    # It is safer to import comet before all other imports.
+    import comet_ml  # noqa


Flake8 râle ici car comet_ml n'est pas utilisée dans ce fichier, est-ce que la librairie doit quand même être présente pour récupérer ce dont elle a besoin ?

MichelDaab · 2022-05-24T09:47:05Z

myria3d/train.py

+        which contains LAS files with a classification).
+
+    finetune:
+        Finetunes a checkpointed neural network on a prepared dataset, which muste be specified


petite typo: must sans e

MichelDaab · 2022-05-24T13:26:31Z

myria3d/train.py

        kwargs_to_override = copy.deepcopy(model.hparams)
+        NEURAL_NET_ARCHITECTURE_CONFIG_GROUP = "neural_net"


J'ai déjà fait des suggestions pour ces constantes, j'en fait une autre : il est aussi possible de les mettre simplement en haut de la page (quand c'est au milieu d'une fonction cela me stresse)

MichelDaab · 2022-05-24T13:27:35Z

myria3d/train.py

@@ -142,14 +157,19 @@ def train(config: DictConfig) -> Optional[float]:

    if "finetune" in task_name:
        log.info("Starting finetuning pretrained model on new data!")
-        # here rebuild model but overwrite everything except module related params
+        # Instantiates the Model but overwrites everything with current config,
+        # except module related params (nnet architecture)
        kwargs_to_override = copy.deepcopy(model.hparams)


Je ne pense pas que tu as besoin de faire une copie du dictionnaire, puisque derrière tu reconstruit encore un dictionnaire (avec un filtre les noms de clés)

MichelDaab · 2022-05-24T13:48:48Z

myria3d/train.py

        kwargs_to_override = {
            key: value
            for key, value in kwargs_to_override.items()
-            if "neural_net" not in key
+            if NEURAL_NET_ARCHITECTURE_CONFIG_GROUP not in key
        }
        model = Model.load_from_checkpoint(config.model.ckpt_path, **kwargs_to_override)


D'après ce que je comprends, il y a toute la procédure au-dessus avec les dictionnaires pour supprimer les clés avec "neural_net" pour ne pas passer en paramètre de load_from_checkpoint le "neural_net". Je suppose que c'est parce que le modèle est déjà défini et qu'on ne peut pas le changer. Cepndant, je n'ai pas réussi à voir où se trouvait ce "neural_net" dans les configs, est-ce encore utilisé ? Sinon autant enlever tout ce "bazar". Si c'est encore utiliser, il faudrait bien préciser dans la doc à quoi cela sert et qu'il est nécessaire de bien garder cette nomenclature

MichelDaab · 2022-05-25T15:05:58Z

tests/myria3d/test_train_and_predict.py

+    # Hydra changes CWD, and therefore absolute paths are preferred
+    abs_path_to_toy_LAS = osp.abspath(LAS_SUBSET_FOR_TOY_DATASET)
+    command = [
+        "run.py",


On est bien d'accord que les tests où il y a un "run" sont des test fonctionnels, pas unitaires ? (pas que cela soit mal, juste une précision)

MichelDaab · 2022-05-25T15:22:08Z

tests/myria3d/test_train_and_predict.py

+
+@pytest.mark.slow()
+def test_RandLaNet_overfitting(isolated_toy_dataset_tmpdir, tmpdir):
+    """Check ability to overfit with RandLa-Net.


Je ne comprends pas l'intérêt de tester l'overfitting

MichelDaab · 2022-05-25T16:02:31Z

tests/myria3d/test_train_and_predict.py

+        assert dim in a1.dtype.fields.keys()
+
+
+def check_las_does_not_contains_dims(las_path, dims_to_check=[]):


Plutôt que de tester que le las ne contient pas certains champs, peut-être vérifier qu'il contient tous les champs voulu et seulement ceux-là (plus exhaustif). Evidemment il faut que la liste complête des champs soit connue, mais il me semble que c'est le cas

CharlesGaydon added 30 commits April 25, 2022 14:43

Valid link to documentation

cb64719

Setup test env

6bad332

Test env, including a toy LAS file for tests

081974d

Setp up a fast_dev_run run in pytest

be204d7

Set pytest toy dataset to be a session-level fixture

84ab287

Test that overfitting happens for RandLaNet debug

f2da84a

Test training for both RandLaNet and PointNet on an overfit task

d382536

Add tests for model test and inference

11f1bd9

Add test on large las data

4c81323

New Point Net config with adapted learning rate

15166de

All test pass, with coverage 80%

63893f8

Complete suite of tests, more modular

cc6aa2f

Refactor to use of tmpdir as fixture

1d79c69

Merge branch 'add-tests' into dev

c012bdd

Use a single .yml file for setting environment

a9bf9ff

CICD workflow

235bdf0

Set up workflow and update Docker to account for cuda images key rota…

238cd7d

…tions

Install numba using mamba directly

394b55a

List env

f0d88ac

Add back cache number

0fe41bd

Activate the bash shell in gh pages

5ebdf2e

Single pip instruction in reqs

e1d4d3d

Cachce based on both reqs files

3dce269

Change docker image save path

e68142b

Better documentation, no no-deps

5957b9a

Replace torch-kernels-knn with pytorch_geometric knn with identical r…

a351142

…esults. Replace with pyg knn in 1/2 places Refactor to extract knns as functions instead of methods No regression on large las IoU Remove torck-points-kernels dependency

CICD - update of mount path

4bbdd34

Update doc

d640b16

Monkey patch PREPARED_DATA_DIR in tests

49f7576

Seed everything in tests

b313960

CharlesGaydon added 8 commits May 17, 2022 17:09

Better error msg when invalid classif dict

20d9a5d

Simplify syntax

Update path to large LAS file

967a500

Run heavy tests with --icp=host option to allow multithreading

9a90087

Always use host shared memory option

Corrct documentation format

858bda4

Do not mock packages in doc build

079e721

Simplify syntax

Allow overriding of test_data_dir

b3e98e8

Rename package and project everywhere

e8fd119

Move assets for CICD

269c0ab

CharlesGaydon force-pushed the dev branch from e196a85 to 269c0ab Compare May 17, 2022 15:09

CharlesGaydon merged commit a930c35 into main May 17, 2022