[Feature Request]: Support Streaming model updates for the RunInference transform #24042

AnandInguva · 2022-11-08T22:45:23Z

What would you like to happen?

For streaming pipeline in Python, add support for updating the models.

Allow streaming pipelines to update the model(s) in use without requiring the use of pipeline lifecycle events ( Update / Drain ) .

Issue Priority

Priority: 2

Issue Component

Component: run-inference

AnandInguva · 2022-11-08T22:45:34Z

.take-issue

AnandInguva · 2023-01-27T17:05:14Z

Part 1: Add API that accepts Singleton side input PCollection.
Part 2: Use case of the API. Example: WatchFilePattern transform with an example.
Part 3: DLQ to catch errors.

Also, update beam website documentation with

## Side Inputs to Update Models
From Beam 2.45.0, the RunInference PTransform will accept a side input of `ModelMetadata`, which is a `NamedTuple` containing the `model_id` and `model_name`.
  * `model_id`: The model_id is used to load the models. It could be an URI or path to the model.
  * `model_name`: Unique identifier used to append the metrics. This should be short relative to the model_id so that it can be attached to the metrics to identify which model was used to calculate the metrics.

**Note**: The side input PCollection must be compatible with `AsSingleton` view or the pipeline will result in error.

**Note**: If the main PCollection emits inputs and side input has yet to receive inputs, the main PCollection will get buffered until there is
            an update to the side input. This could happen with Global windowed side inputs with data driven triggers such as `AfterCount`, `AfterProcessingTime`. So until there is an update to the side input, emit the default/initial model id that is used to pass the respective `ModelHandler` as side input..

Some other ideas as extension of this feature:

Add Read/Write lock for updating and runInference part.
An idea from Danny : https://github.com/apache/beam/pull/25200/files#r1089107034

AnandInguva · 2023-04-01T18:40:30Z

.close-issue

AnandInguva added awaiting triage new feature labels Nov 8, 2022

github-actions bot removed the awaiting triage label Nov 8, 2022

github-actions bot assigned AnandInguva Nov 8, 2022

github-actions bot added ml P2 python run-inference labels Nov 8, 2022

AnandInguva mentioned this issue Jan 27, 2023

Add sideinputs to the RunInference Transform #25200

Merged

3 tasks

AnandInguva mentioned this issue Feb 9, 2023

Add WatchFilePattern #25393

Merged

3 tasks

AnandInguva closed this as completed Apr 1, 2023

github-actions bot added this to the 2.47.0 Release milestone Apr 1, 2023

damccorm added the done & done Issue has been reviewed after it was closed for verification, followups, etc. label Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Support Streaming model updates for the RunInference transform #24042

[Feature Request]: Support Streaming model updates for the RunInference transform #24042

AnandInguva commented Nov 8, 2022

AnandInguva commented Nov 8, 2022

AnandInguva commented Jan 27, 2023 •

edited

Loading

AnandInguva commented Apr 1, 2023

[Feature Request]: Support Streaming model updates for the RunInference transform #24042

[Feature Request]: Support Streaming model updates for the RunInference transform #24042

Comments

AnandInguva commented Nov 8, 2022

What would you like to happen?

Issue Priority

Issue Component

AnandInguva commented Nov 8, 2022

AnandInguva commented Jan 27, 2023 • edited Loading

AnandInguva commented Apr 1, 2023

AnandInguva commented Jan 27, 2023 •

edited

Loading