Introduce unified batching #199

PawelPeczek-Roboflow · 2023-12-19T14:57:40Z

Description

With this PR we make it possible to infer against batches for all types of models, including the situations where models
statically define batch size to 1 and when inference payload is larger than max batch size (defined in env)

when there is only one element to infer - everything works as previously
when there is a static batch size defined at the input - and payload contains list - list will be sliced to maximum value of batch size and passed to standard inference - then results are merged by new method merge_inference_results()
when there is max batch size defined and payload contains list - max batch size will be used to slice input list and pass into consecutive inferences
works smoothly with bs=1 and bs=auto - for other values - in other cases - if that's the actual need - model classes should pad value in preprocess() method as it is done in object_detection_base.py - but this is not implemented in this PR and should be introduced as per need of specific model (and this may even never be required)
I plan to create dummy exported onnx models for all of the core models we support and create a suite of tests for single image and batch inference in different configurations to make sure it works smoothly and those will be separated from regression tests, as I intend to load model into tmp cache, skip using keys and run inference python, not in service - that shall be faster and more verbose type of tests than regression ones that are sometimes flaky

Example:

from inference.models import YOLOv8Classification, YOLOv8InstanceSegmentation, YOLOv8ObjectDetection

OBJECT_DETECTION_MODEL = YOLOv8ObjectDetection(model_id="coin-counting/64")
result = OBJECT_DETECTION_MODEL.infer([IMAGE]*6)

CLASSIFICATION_MODEL = YOLOv8Classification(model_id="vehicle-classification-eapcd/2")
result = CLASSIFICATION_MODEL.infer([IMAGE]*6)

SEGMENTATION_MODEL = YOLOv8InstanceSegmentation(model_id="asl-poly-instance-seg/53")
result = SEGMENTATION_MODEL.infer([IMAGE]*6)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

tested locally
added new integration tests
CI still green

Any specific deployment considerations

For example, documentation changes, usability, usage/costs, secrets, etc.

Docs

Docs updated? What were the changes:

paulguerrie

Looks good to me!

PawelPeczek-Roboflow added 2 commits December 19, 2023 15:17

Add first draft of implementation

41ca32a

Fix issues spotted while initial testing

146bbf8

PawelPeczek-Roboflow requested a review from paulguerrie December 19, 2023 14:57

PawelPeczek-Roboflow added 8 commits December 19, 2023 16:01

Make linters happy

53de438

Make imports of specific models easier

5ca3e32

Make linters happy

b3057ba

Add e2e tests for VIT and Yolov5 models

e684f75

Add required test asset

d4527d9

Add tests for all models

4e6d40a

Add tests for new utils

a7094fc

Add models predictions tests to CI

0fa939f

PawelPeczek-Roboflow marked this pull request as ready for review December 20, 2023 14:27

paulguerrie approved these changes Dec 20, 2023

View reviewed changes

paulguerrie merged commit 5f70939 into main Dec 20, 2023
4 checks passed

paulguerrie deleted the feature/create_unified_batching branch December 20, 2023 16:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce unified batching #199

Introduce unified batching #199

PawelPeczek-Roboflow commented Dec 19, 2023 •

edited

Loading

paulguerrie left a comment

Introduce unified batching #199

Introduce unified batching #199

Conversation

PawelPeczek-Roboflow commented Dec 19, 2023 • edited Loading

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

paulguerrie left a comment

Choose a reason for hiding this comment

PawelPeczek-Roboflow commented Dec 19, 2023 •

edited

Loading