Add support for server-side batching for the TensorFlow Predictor #1193

RobertLucian · 2020-07-01T04:39:32Z

Closes #1060.

checklist:

run make test and make lint
test manually (i.e. build/push all images, restart operator, and re-deploy APIs)
update examples
update docs and add any new files to summary.md (view in gitbook after merging)

# Conflicts: # pkg/operator/operator/k8s_specs.go # pkg/types/spec/validations.go

cli/local/docker_spec.go

pkg/types/spec/validations.go

docs/deployments/api-configuration.md

docs/deployments/parallelism.md

examples/tensorflow/image-classifier-inception/cortex_batch_sized.yaml

examples/tensorflow/image-classifier-resnet50/README.md

examples/tensorflow/image-classifier-resnet50/cortex_gpu_batch_sized.yaml

images/tensorflow-serving-cpu/run.sh

pkg/types/spec/errors.go

# Conflicts: # pkg/operator/resources/syncapi/k8s_specs.go

# Conflicts: # pkg/types/spec/errors.go

pkg/lib/time/time.go

pkg/types/spec/errors.go

pkg/types/spec/validations.go

examples/tensorflow/image-classifier-inception/cortex_batch_sized.yaml

examples/tensorflow/image-classifier-resnet50/cortex_gpu_server_side_batching.yaml

RobertLucian · 2020-08-07T19:37:03Z

@deliahu I added an example deployment for the ResNet50 model for server-side batching for Inferentia. This is the link to the compiled model (with a fixed batch size of 5):
https://www.dropbox.com/s/yra52y2gqi8fm7f/rn50_fp16_compiled_b5_nc1.zip?dl=0

This means that for examples/tensorflow/image-classifier-resnet50/cortex_inf_server_side_batching.yaml config file, the model path has to be changed.

examples/tensorflow/image-classifier-resnet50/requirements.txt

RobertLucian added 30 commits June 19, 2020 02:38

Add batch-related keys to spec & userconfig

50c201e

Don't support batch-related fields for Py & ONNX

b5c29bd

Pass in the right arguments/envs when TF batching

294dbfc

Add runnable entries for TF serving containers

788e068

Add TF batching support for local provider

48a86db

Add missing commands in TFS Dockerfiles

73779b2

Fix various issues pertaining to batching feature

f68e2f6

Add autoscaling fix & reorder things

ff550d4

Avoid having to install bc CLI utility in TFS

adf24af

Fix bug when using server-side batching

42db608

Merge branch 'master' into feature/server-side-batching

81f7585

# Conflicts: # pkg/operator/operator/k8s_specs.go # pkg/types/spec/validations.go

Fix merge bugs

9a1d20c

Reorder text output for cortex get

3857171

Add wireframe for docs

0048f69

Disallow a concurrency level that's smaller than the BS

ddbd018

Move validation fields & limit upper-end vals

e895777

Set default batch size/timeout

7f98e13

Batching configuration stuff

cd1a780

Add default batch timeout to docs

110db10

Properly format cortex get

7fc14e7

Rm batch from throughput tester & universalize it

6bc42b2

Merge branch 'master' into feature/server-side-batching

0f11290

Move & refactor the throughput tester

b30762d

Modify image classifier resnet50 for TFS

83e6915

Revert back to passing the value when tensoring

1547ba8

Fix bug with throughput tester

ca4239a

Add batch-sized API config for ResNet50 model

795e799

Merge branch 'master' into feature/server-side-batching

d0fda85

Batch-sized examples

07b0c53

Bunch of modifications brought to the examples

32dc0ba

Docs/example modifications

4f2f75b

deliahu reviewed Jul 22, 2020

View reviewed changes

deliahu mentioned this pull request Jul 23, 2020

Add guide for how to do server side batching with Python / ONNX predictor #1235

Closed

RobertLucian added 5 commits July 26, 2020 01:39

Address review requests

1513712

Merge branch 'master' into feature/server-side-batching

d7fcefc

# Conflicts: # pkg/operator/resources/syncapi/k8s_specs.go

Change TF_BATCH_SIZE to TF_MAX_BATCH_SIZE

56e49e2

Limit max batch size to num of threads with inf

b43ff67

Limit num batched threads when infs are used

f5da1aa

RobertLucian requested a review from deliahu July 25, 2020 23:24

Merge branch 'master' into feature/server-side-batching (no testing)

ed80692

# Conflicts: # pkg/types/spec/errors.go

deliahu reviewed Aug 4, 2020

View reviewed changes

pkg/lib/time/time.go Outdated Show resolved Hide resolved

pkg/types/spec/errors.go Outdated Show resolved Hide resolved

pkg/types/spec/validations.go Outdated Show resolved Hide resolved

examples/tensorflow/image-classifier-inception/cortex_batch_sized.yaml Show resolved Hide resolved

deliahu added 2 commits August 4, 2020 17:30

Remove SecondStr()

7d781db

Small changes

8a95f8c

deliahu reviewed Aug 6, 2020

View reviewed changes

examples/tensorflow/image-classifier-resnet50/cortex_gpu_server_side_batching.yaml Outdated Show resolved Hide resolved

RobertLucian added 3 commits August 7, 2020 19:37

Add server-side batching for ResNet50 on Inf

64647f2

Small correction

02e0f4a

Add small comment for batched models on Inf

0cd1937

RobertLucian added 2 commits August 7, 2020 23:59

Merge branch 'master' into feature/server-side-batching

3ca7f8b

Fix invalid literal for int() with base 10 when jpg images are used

e7e4dff

deliahu reviewed Aug 8, 2020

View reviewed changes

examples/tensorflow/image-classifier-resnet50/requirements.txt Outdated Show resolved Hide resolved

deliahu approved these changes Aug 8, 2020

View reviewed changes

RobertLucian added 5 commits August 8, 2020 04:08

Change model_path path & modify imageio version

e47733c

Merge branch 'master' into feature/server-side-batching

76920b4

Clarify field description for batch_interval

e3da402

Send jpg images as octet-stream instead of JSON

9a4b47e

Merge branch 'master' into feature/server-side-batching

64070de

deliahu approved these changes Aug 12, 2020

View reviewed changes

RobertLucian merged commit 1930e63 into master Aug 12, 2020

RobertLucian deleted the feature/server-side-batching branch August 12, 2020 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for server-side batching for the TensorFlow Predictor #1193

Add support for server-side batching for the TensorFlow Predictor #1193

Uh oh!

RobertLucian commented Jul 1, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RobertLucian commented Aug 7, 2020 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for server-side batching for the TensorFlow Predictor #1193

Add support for server-side batching for the TensorFlow Predictor #1193

Uh oh!

Conversation

RobertLucian commented Jul 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RobertLucian commented Aug 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RobertLucian commented Jul 1, 2020 •

edited

Loading

RobertLucian commented Aug 7, 2020 •

edited

Loading