[FEA] Save input and output schema when `.save` methods are called on models #669

oliverholworthy · 2022-08-22T10:48:54Z

🚀 Feature request

Provide a consistent .save interface for all models.

This .save method should save the model artifact along with the schema to a directory provided.

Motivation

Model artifacts alone are not enough to unambiguously infer the correct input schema for a model.
Saving the input schema along with models will enable serving code in systems to figure out the correct inputs required for models from an artifact without requiring the user to provide the schema at serving time.

Part of: NVIDIA-Merlin/Merlin#489

Proposed interface

Create a runtime check-able protocol that specifies the common methods expected on a model object.

This .save method on a model will write out the model artifact(s) along with the input schema to a directory provided.

Files saved in merlin metadata directory.

input_schema.json and output_schema.json). Serialized Merlin Schema using the TensorflowMetadata format currently provided in merlin.core
model.json
- module and class names to support reconstructing the same model

Saved model directory structure

model artifacts to be saved in top-level of directory
merlin metadata to be saved in merlin_metadata subdirectory.

e.g. a model.save("my_merln_model") on a tensorflow backend should result in the following directory structure:


my_merlin_model
├── merlin_metadata
│   ├── input_schema.json
│   ├── output_schema.json
│   └── model.json
├── assets
├── keras_metadata.pb
├── saved_model.pb
└── variables
    ├── variables.data-00000-of-00001
    └── variables.index

Sub-Tasks

The text was updated successfully, but these errors were encountered:

EvenOldridge · 2022-09-29T21:55:54Z

@marcromeyn @oliverholworthy what's the status of this? This is blocking the creation of example notebooks for end to end using systems.

rnyak · 2022-10-19T15:41:34Z

partially addressed by #680

oliverholworthy · 2022-10-19T15:57:33Z

This PR (#680) handles the first two tasks.

Adding the protocol
and updating save method of tensorflow models.

There is a bit more to think about beyond this. Adding output schema for tensorflow models. And figuring out how to enforce that a schema is created. Since the Merlin Models API is flexible enough at the moment that we can't always guarantee that we have a schema available. At least not in a way that provides any more information than the saved model signature is able to (since we could infer the schema from the saved model like we do in Merlin Systems currently).

oliverholworthy added the status/needs-triage label Aug 22, 2022

karlhigley added this to the Merlin 22.09 milestone Aug 24, 2022

karlhigley added the enhancement New feature or request label Aug 24, 2022

oliverholworthy mentioned this issue Aug 25, 2022

[Task] Create a protocol for common methods across model frameworks #676

Closed

oliverholworthy self-assigned this Aug 25, 2022

oliverholworthy mentioned this issue Aug 25, 2022

[Task] Implement save method on Tensorflow implementation of models. #677

Closed

viswa-nvidia modified the milestones: Merlin 22.09, Merlin 22.10 Sep 26, 2022

oliverholworthy changed the title ~~[FEA] Save input schema when .save methods are called on models~~ [FEA] Save input and output schema when .save methods are called on models Oct 5, 2022

oliverholworthy modified the milestones: Merlin 22.10, Merlin 22.11 Oct 10, 2022

oliverholworthy mentioned this issue Nov 15, 2022

[RMP] Tensorflow support for session based recommendations integration in Merlin NVIDIA-Merlin/Merlin#433

Closed

37 tasks

oliverholworthy mentioned this issue Dec 19, 2022

[Task] Enable pickle serialzation of Models with Merlin Schema Saved alongside #928

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Save input and output schema when `.save` methods are called on models #669

[FEA] Save input and output schema when `.save` methods are called on models #669

oliverholworthy commented Aug 22, 2022 •

edited

Loading

EvenOldridge commented Sep 29, 2022

rnyak commented Oct 19, 2022 •

edited

Loading

oliverholworthy commented Oct 19, 2022 •

edited

Loading

[FEA] Save input and output schema when .save methods are called on models #669

[FEA] Save input and output schema when .save methods are called on models #669

Comments

oliverholworthy commented Aug 22, 2022 • edited Loading

🚀 Feature request

Motivation

Proposed interface

Saved model directory structure

Sub-Tasks

EvenOldridge commented Sep 29, 2022

rnyak commented Oct 19, 2022 • edited Loading

oliverholworthy commented Oct 19, 2022 • edited Loading

[FEA] Save input and output schema when `.save` methods are called on models #669

[FEA] Save input and output schema when `.save` methods are called on models #669

oliverholworthy commented Aug 22, 2022 •

edited

Loading

rnyak commented Oct 19, 2022 •

edited

Loading

oliverholworthy commented Oct 19, 2022 •

edited

Loading