DL models as serverless functions #1767

nmanovic · 2020-06-19T21:39:28Z

Fix #796, fix #792, fix #743, fix #297, fix #296, fix #197, fix #196, fix #896, fix #910, fix #1028, fix #1832, fix #1846, fix #1551

Motivation and context

Before the PR CVAT has all "automatic annotation" features inside one cvat container. CUDA, OpenVINO, extra python packages. Also each "DL model" (such as Mask-RCNN, DEXTR, Faster RCNN) was implemented as a Django application with own REST API. It was very difficult to support new models.

The PR solves most of these issues. All "automatic annotation" features are serverless functions. Current implementation uses nuclio framework (https://github.com/nuclio/nuclio) to deploy and invoke them. Each such serverless function is a separate docker container which can be accessed by HTTP. lambda_manager is Django app which provides convenient
REST API to work with serverless functions:

It is possible to call a function directly using POST /api/v1/lambda/functions/<name> or send a request POST /api/v1/lambda/requests.

How has this been tested?

It was tested manually.

Checklist

I submit my changes into the develop branch
I have added description of my changes into CHANGELOG file
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues (read github docs)
I have increased versions of npm packages if it is necessary (cvat-canvas,
cvat-core, cvat-data and cvat-ui)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below)

# Copyright (C) 2020 Intel Corporation
#
# SPDX-License-Identifier: MIT

Permissions (only users with task.change permissions can run DL models)
Mask RCNN via TensorFlow
ReID serverless function
Code duplication (python3, model_loader.py) for OpenVINO functions
Semantic segmentation for ADAS function
Text detection function

Next PR:

Images by URL in serverless functions
Tracker serverless function
Optimize serverless function invocation (measure overhead of nuclio serverless platform and submitting images as json strings, indirect call using dashboard)
Fix swagger documentation for lambda_manager REST API

GET /api/v1/lambda/functions GET /api/v1/lambda/functions/public.dextr

- image decoding - restart policy always for the function

bsekachev · 2020-07-29T11:19:35Z

Excluding comments above, the PR looks good to me.

azhavoro

LGTM

nmanovic · 2020-07-29T15:32:42Z

I had two tasks based on the same video:

The first consists of 1 job
The second consists of 3 job (segment size 50) with enabled ZOrder
Stop frame in both cases is the same: 100

When I run openvino.omz.semantic-segmentation-adas-0001 on the first task, it works correctly
When I run the same model on the second task, progress goes to 100% and then process fails with: ZeroDivisionError: float division by zero

Can share the video with you

UPD. Checked Faster RCNN on the multi-job task and it works well

Found a typo in dataset_manager (probably old one). Fixed. Another problem was with a polygon with many points but at the same line. The area was 0.

bsekachev · 2020-07-29T15:52:17Z

Found a typo in dataset_manager (probably old one). Fixed. Another problem was with a polygon with many points but at the same line. The area was 0.

Confirm. It has been fixed.

rushtehrani · 2020-08-04T02:17:42Z

I ran into an issue where I had to pass --platform local to nuctl, example:

./nuctl deploy --project-name cvat \
  --path serverless/openvino/dextr/nuclio \
  --volume `pwd`/serverless/openvino/common:/opt/nuclio/common \
  --platform local

Otherwise, I would get this error:

Error - the server could not find the requested resource (post nuclioprojects.nuclio.io)
    /nuclio/pkg/platform/kube/platform.go:393

This may only be an issue with the latest nuctl release, but I can create a PR and update the docs accordingly if it makes sense.

rushtehrani · 2020-08-04T02:25:25Z

Also, I'm not seeing a deployed model, even though the model is showing in the API response:

It's also showing up in Nuclio's dashboard:

nmanovic · 2020-08-04T06:56:47Z

@rushtehrani , dextr isn't showed in the CVAT models list. I don't think that it is right and probably we need to fix it in the future. Now it is filtered explicitly.

nmanovic · 2020-08-04T06:58:06Z

@rushtehrani , I got the same advice about --platform local from nuclio maintainers. Definitely I will fix that ASAP. Thanks!

rushtehrani · 2020-08-05T01:00:10Z

dextr isn't showed in the CVAT models list. I don't think that it is right and probably we need to fix it in the future. Now it is filtered explicitly.

Got it. If that's the case, would it make sense to use another example command here that will show up in CVAT?

Nikita Manovich added 30 commits April 21, 2020 16:16

Initial experiments with nuclio

583deca

Update nuclio prototype

f9c58e6

Improve nuclio prototype for dextr.

2920c5b

Merge remote-tracking branch 'origin/develop' into nm/serverless

0fa76fb

Dummy lambda manager

2a76d71

OpenFaaS prototype (dextr.bin and dextr.xml are empty).

13978ec

Moved openfaas prototype.

0cd4127

Merge remote-tracking branch 'origin/develop' into nm/serverless

638ae73

Add comments

d089f49

Add serializers and HLD for lambda_manager

3481721

Merge remote-tracking branch 'origin/develop' into nm/serverless

096eb2d

Initial version of Mask RCNN (without debugging)

144f7be

Merge remote-tracking branch 'origin/develop' into nm/serverless

e85f4b7

Initial version for faster_rcnn_inception_v2_coco

e0f3aea

Fix faster_rcnn_inception_v2_coco

af5dda4

Implemented mask_rcnn_inception_resnet_v2_atrous_coco

6d061fc

Implemented yolo detector as a lambda function

8dbfeb9

Merge remote-tracking branch 'origin/develop' into nm/serverless

b9c7a86

Removed dextr app.

bb92b4a

Merge remote-tracking branch 'origin/develop' into nm/serverless

f9cfdf8

Added types for each function (detector and interactor)

fd06913

Merge remote-tracking branch 'origin/develop' into nm/serverless

985aa6d

Initial version of lambda_manager.

d79646a

Implement a couple of methods for lambda:

4ec9da9

GET /api/v1/lambda/functions GET /api/v1/lambda/functions/public.dextr

Merge remote-tracking branch 'origin/develop' into nm/serverless

55aeb80

Merge remote-tracking branch 'origin/develop' into nm/serverless

5930d07

First working version of dextr serverless function

666e71c

First version of dextr which works in UI.

c160941

Modify omz.public.faster_rcnn_inception_v2_coco

86f9ab2

- image decoding - restart policy always for the function

Improve omz.public.mask_rcnn_inception_resnet_v2_atrous_coco

b7c4768

Nikita Manovich added 2 commits July 29, 2020 15:29

Merge remote-tracking branch 'origin/develop' into nm/serverless

cc081bd

Removed reid route in installation.md

ced2746

azhavoro previously approved these changes Jul 29, 2020

View reviewed changes

Fix a command to get lena image in CONTRIBUTION guide.

4210645

nmanovic dismissed azhavoro’s stale review via 4210645 July 29, 2020 13:55

Fix typo and crash in case a polygon is a line.

6dcb354

nmanovic merged commit e7585b8 into develop Jul 29, 2020

nmanovic deleted the nm/serverless branch July 29, 2020 15:56

nmanovic mentioned this pull request Jul 29, 2020

Automatic annotation with Tensorflow Mask RCNN doesn't work #877

Closed

nmanovic linked an issue Jul 29, 2020 that may be closed by this pull request

Support for openVINO 2020 is missing #1179

Closed

This was referenced Jul 29, 2020

Support for openVINO 2020 is missing #1179

Closed

IEPlugin removed from upcoming OpenVINO versions #1830

Closed

This was linked to issues Jul 29, 2020

IEPlugin removed from upcoming OpenVINO versions #1830

Closed

Cannot login after adding OpenVINO toolkit support. #1122

Closed

Preprocessing step in automatic annotation pipeline #1004

Closed

nmanovic mentioned this pull request Jul 29, 2020

Preprocessing step in automatic annotation pipeline #1004

Closed

ActiveChooN mentioned this pull request Aug 3, 2020

Adding Kuberenetes templates and deployment guide #1962

Merged

8 tasks

aleksandrmelnikov mentioned this pull request Nov 17, 2020

Explore CVAT serverless implementation, and design our knative implementation onepanelio/onepanel#735

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DL models as serverless functions #1767

DL models as serverless functions #1767

nmanovic commented Jun 19, 2020 •

edited

Loading

bsekachev commented Jul 29, 2020

azhavoro left a comment

nmanovic commented Jul 29, 2020

bsekachev commented Jul 29, 2020

rushtehrani commented Aug 4, 2020

rushtehrani commented Aug 4, 2020

nmanovic commented Aug 4, 2020

nmanovic commented Aug 4, 2020

rushtehrani commented Aug 5, 2020

DL models as serverless functions #1767

DL models as serverless functions #1767

Conversation

nmanovic commented Jun 19, 2020 • edited Loading

Motivation and context

How has this been tested?

Checklist

License

bsekachev commented Jul 29, 2020

azhavoro left a comment

Choose a reason for hiding this comment

nmanovic commented Jul 29, 2020

bsekachev commented Jul 29, 2020

rushtehrani commented Aug 4, 2020

rushtehrani commented Aug 4, 2020

nmanovic commented Aug 4, 2020

nmanovic commented Aug 4, 2020

rushtehrani commented Aug 5, 2020

nmanovic commented Jun 19, 2020 •

edited

Loading