[WIP] 486 Add example model package #487

Nic-Ma · 2021-12-16T02:42:20Z

Fixes #486 .

Description

This PR implemented a draft example of the model package for discussion, leverage @ericspod 's great work to save data: Project-MONAI/MONAI#3138

Basic principles
(1) Structured sharable data: Define meta data and components in structured files, like JSON or YAML. With predefined schema to verify the JSON configs, follow: https://json-schema.org/. We can put the base common schema in some public web storage in the future.
(2) TorchScript package: Export model weights, meta data, components config, etc. into TorchScript file.
(3) Hybrid programming: For 90% cases, only the JSON configs should be enough. For some advanced cases, the developer of model package defines which parts of the workflow should be shared for others to reconstruct other logic, then define these parts in JSON or YAML config, and can implement other customized logic in the python program. For example, this PR defines transforms, dataset, dataloader, inferer, etc. in the inference.json config, implements a task-specific special inference logic with native PyTorch program in inference.py, other teams can easily leverage the config file to reconstruct necessary components and implement their specific inference logic.
(4) Data sharing: the python program can leverage the ConfigParser to get constructed instance or raw config items from config file then lazy instantiation, configs can refer / extend other items in a file or even other files.
Structure of the package
(1) commands: start point to execute, like: export.sh to export the network weights, meta, config, etc. to TorchScript model, inference.sh to load TorchScript model, construct instances and execute inference. We should enhance it to something like monairun to easily support all platforms and environments.
(2) configs: structure config for shareable components or information, can be JSON or YAML, like: metadata.json record the necessary meta information of the model package, inference.json define the args and components for inference usage.
(3) docs: necessary documents and images, etc. like: README.md, lisence.txt, tensorboard.png, etc. depends on the model context.
(4) models: pretrained model weights and exported TorchScript model file.
(5) [Optional] scripts: if the model package has customized python program, like inference.py leverage the structured components to construct a special inference program.

Status

Work in progress

Checks

Notebook runs automatically ./runner [-p <regex_pattern>]

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2021-12-16T09:35:40Z

I didn't implement any config-parsing related logic in it so far, marked as TODO: in the code.

Thanks.

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2021-12-17T15:56:23Z

Interesting comments from @ericspod during the online meeting:

Just a note that .ts is meant to be Torchscript but to web developers it'll looks like a Typescript file, we may want to choose something else if anyone finds that confusing.
Versioning the package would help, if we used git repos to store things the commit hash becomes part of the version implicitly.

Thanks.

Nic-Ma · 2021-12-20T12:48:11Z

Hi @ericspod @wyli ,

Maybe I misunderstood your suggestion, how can we get the commit hash if we want to save the commit hash in meta data for this commit...?

Thanks in advance.

ericspod · 2021-12-21T03:35:37Z

Hi @ericspod @wyli ,

Maybe I misunderstood your suggestion, how can we get the commit hash if we want to save the commit hash in meta data for this commit...?

Thanks in advance.

What I was thinking was, from the perspective of a user, the commit hash could be used to determine model versions. It wouldn't be included in the package but would be used to tell model versions apart that claim to be the same version, ie. if the user forgets to increment the version.

Nic-Ma · 2021-12-21T04:06:45Z

Hi @ericspod @wyli ,
Maybe I misunderstood your suggestion, how can we get the commit hash if we want to save the commit hash in meta data for this commit...?
Thanks in advance.

What I was thinking was, from the perspective of a user, the commit hash could be used to determine model versions. It wouldn't be included in the package but would be used to tell model versions apart that claim to be the same version, ie. if the user forgets to increment the version.

Thanks for your explanation, makes sense to me!

modules/model_package/spleen_segmentation/programs/__init__.py

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2022-01-17T02:41:40Z

Slightly adjusted the folder structurer, thanks for the reference of MONAI application:
https://github.com/Project-MONAI/monai-deploy/blob/main/guidelines/monai-application-package.md#table-of-important-paths

Thanks.

Nic-Ma · 2022-01-17T08:21:21Z

Hi @wyli @ericspod ,

Thanks for several internal discussions, I updated the example model package to show new ideas:

inference.py shows how to get config and modify it then instantiation, and also shows how to get instance directly, these are the 2 ways for JSON / python hybrid.
inference_v2.py shows how to write a new config with a base config and override some configs.
trtinfer.json and trtinfer.py show how to refer to config in another JSON file or same file, simulated TensorRT and DALI inference logic.

Could you please help take a look again?

Thanks in advance.

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2022-01-18T00:32:40Z

Hi @ericspod @wyli ,

I added the tensorboard.png and mlflow.png into the model package for training curves.

Thanks.

Signed-off-by: Nic Ma <nma@nvidia.com>

modules/model_package/spleen_segmentation/configs/metadata.json

modules/model_package/spleen_segmentation/configs/inference.json

aihsani

Not sure why we need shell scripts in the model package... they're not easily portable across operating systems, and not descriptive in what they each achieve. Wouldn't in make more sense to remove any shell scripts and implement functionality in MONAI that is able to interpret/generate the JSON artifacts via a CLI based on the framework being used?

Nic-Ma · 2022-01-21T08:37:23Z

Hi @aihsani ,

Thanks for your suggestion!
I think the shell scripts are used to simplify the common use cases and args, especially for researchers to tune args when training a MMAR, for example, they may have several different configs, change config file in shell.
For example, the inference.sh in this example is:

python ../scripts/inference.py
    --config ../configs/inference.json
    --meta ../configs/metadata.json

And you are right, we should definitely not put complicated logic in the shell script, main things should in the python or JSON files, you can call the python program inference.py directly.

Thanks.

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2022-01-25T12:32:01Z

I added schema example to check the metadata format and expected values.
It's based on the jsonschema standard: http://json-schema.org/.
I didn't define schema for all the items, just included key required items as example.
We can define MONAI MMAR schema and put them into some public web storage in the future.

Thanks.

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2022-01-25T12:57:38Z

Hi @wyli ,

I changed the override feature to a separate MMAR example according to our online discussion to make it more clear.

Thanks.

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma · 2022-01-27T09:51:38Z

Hi @ericspod @wyli @rijobro @aihsani ,

I have updated this MMAR example according to our online discussion yesterday.
And moved the python scripts and schema to MONAI core to support 90% cases:
https://github.com/Project-MONAI/MONAI/tree/506b9089637d7dd9ed3496dde9deee8c669f269d/monai/apps/mmars

Thanks.

ericspod · 2022-02-10T01:06:44Z

To recap what we had discussed about actual file structure, let's say we have a directory with at least these files:

models/model.ts
models/model.pt
configs/metadata.json

We can consider this directory the model package itself. The metadata.json would have in it the information that is currently provided in this PR, plus it should also have the arguments to instantiate the network rather than that being in the inference config. If the model isn't compatible with Torchscript then the mode.ts file will be absent and this will be needed to recreate the network before loading the stored weights.

This directory can then be packaged into a zip file whose structure can be assumed with models and configs directories. If the model is compatible with Torchscript then a new .ts file can be made with the metadata stored internally and other files saved as extra files using save_net_with_metadata. Users can then load this Torchscript model without worrying about the metadata or anything else MONAI specific if they just wanted to get things started quickly. Either way we'd have a zip file that's easy to distribute and which other software can expect to have as input.

wyli · 2022-04-27T08:59:45Z

closing in favour of https://github.com/Project-MONAI/tutorials/tree/master/modules/bundles

[DLMED] add folder structure

fefa39a

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma force-pushed the 486-add-model-package branch from 24c93db to fefa39a Compare December 16, 2021 02:43

Nic-Ma added 3 commits December 16, 2021 12:10

[DLMED] add more components

97ebc38

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] add changelog

1c56a47

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] add export logic

607c817

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma mentioned this pull request Dec 16, 2021

Adding Torchscript utility functions Project-MONAI/MONAI#3138

Merged

7 tasks

[DLMED] add inference logic

e87bc37

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma mentioned this pull request Dec 16, 2021

Develop MVP of model bundle Project-MONAI/MONAI#3482

Closed

17 tasks

Nic-Ma requested review from atbenmurray, ericspod, rijobro and wyli December 16, 2021 15:53

[DLMED] update to pre / post processing according to Wenqi's comments

a6e282d

Signed-off-by: Nic Ma <nma@nvidia.com>

Merge branch 'master' into 486-add-model-package

60aa6ab

Nic-Ma mentioned this pull request Dec 21, 2021

3482 Build component instance from dictionary config Project-MONAI/MONAI#3518

Merged

7 tasks

wyli reviewed Jan 4, 2022

View reviewed changes

modules/model_package/spleen_segmentation/programs/__init__.py Outdated Show resolved Hide resolved

Nic-Ma added 2 commits January 5, 2022 00:25

Merge branch 'master' into 486-add-model-package

21e415f

[DLMED] update according to comments

0a88da1

Signed-off-by: Nic Ma <nma@nvidia.com>

dbericat requested a review from MMelQin January 13, 2022 17:00

gigony mentioned this pull request Jan 13, 2022

[FEA] Integrating Model Zoo and MMAR Project-MONAI/monai-deploy-app-sdk#213

Closed

Merge branch 'master' into 486-add-model-package

88b13ec

[DLMED] add more features

1b01dad

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma force-pushed the 486-add-model-package branch from 18801c6 to 1b01dad Compare January 17, 2022 10:16

[DLMED] add more materials

fab729f

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] update according to discussion

fa5dbf0

Signed-off-by: Nic Ma <nma@nvidia.com>

ericspod reviewed Jan 20, 2022

View reviewed changes

modules/model_package/spleen_segmentation/configs/metadata.json Show resolved Hide resolved

ericspod reviewed Jan 20, 2022

View reviewed changes

modules/model_package/spleen_segmentation/configs/inference.json Show resolved Hide resolved

aihsani reviewed Jan 20, 2022

View reviewed changes

laurencejackson mentioned this pull request Jan 21, 2022

Add example model package #486

Open

Nic-Ma added 4 commits January 24, 2022 23:24

Merge branch 'master' into 486-add-model-package

d7eb155

[DLMED] update according to Eric's suggestion

918a4bd

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] add config schema example

c385605

Signed-off-by: Nic Ma <nma@nvidia.com>

Merge branch 'master' into 486-add-model-package

a6e9280

Nic-Ma added 2 commits January 25, 2022 20:48

[DLMED] network verification example

826192e

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] change the "override" example to a separate MMAR example

0890d38

Signed-off-by: Nic Ma <nma@nvidia.com>

Nic-Ma added 3 commits January 27, 2022 16:06

[DLMED] update according to comments

1ccb783

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] refine override example

0171768

Signed-off-by: Nic Ma <nma@nvidia.com>

[DLMED] add image data type

c5dc8be

Signed-off-by: Nic Ma <nma@nvidia.com>

wyli closed this Apr 27, 2022

wyli deleted the 486-add-model-package branch June 14, 2022 05:40

[WIP] 486 Add example model package #487

[WIP] 486 Add example model package #487

Uh oh!

Conversation

Nic-Ma commented Dec 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Status

Checks

Uh oh!

Nic-Ma commented Dec 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nic-Ma commented Dec 17, 2021

Uh oh!

Nic-Ma commented Dec 20, 2021

Uh oh!

ericspod commented Dec 21, 2021

Uh oh!

Nic-Ma commented Dec 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Nic-Ma commented Jan 17, 2022

Uh oh!

Nic-Ma commented Jan 17, 2022

Uh oh!

Nic-Ma commented Jan 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aihsani left a comment

Choose a reason for hiding this comment

Uh oh!

Nic-Ma commented Jan 21, 2022

Uh oh!

Nic-Ma commented Jan 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nic-Ma commented Jan 25, 2022

Uh oh!

Nic-Ma commented Jan 27, 2022

Uh oh!

ericspod commented Feb 10, 2022

Uh oh!

wyli commented Apr 27, 2022

Uh oh!

Uh oh!

Nic-Ma commented Dec 16, 2021 •

edited

Loading

Nic-Ma commented Dec 16, 2021 •

edited

Loading

Nic-Ma commented Dec 21, 2021 •

edited

Loading

Nic-Ma commented Jan 18, 2022 •

edited

Loading

Nic-Ma commented Jan 25, 2022 •

edited

Loading