GPU-Suport: Mask-RCNN + Minor GPU fixes #2714

jahaniam · 2021-01-25T05:19:47Z

Summary:

Added GPU support for mask-rcnn
More robust gpu execution: Limited the GPU memory to 33.3% of the total GPU memory per worker and 1 worker per function for GPU
Improved mask-rcnn deployment speed by removing extra packages
Boosted nuclio to 1.5.16

Here is a short result on my laptop with Nvidia 1060m :

Mask-RCNN GPU: ~1.2 Sec / Image
Mask-RCNN CPU: ~7.5 Sec / Image

Related Issues:
#2635,
#2489,

Boosted to nuclio to 1.5.16, based on nuclio/nuclio#2058 , processorMountMode is deprecated and is replaced with mountMode, therefore, #2578 needs to be revisited.

Checklist

I submit my changes into the develop branch
I have added description of my changes into CHANGELOG file
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues (read github docs)
I have increased versions of npm packages if it is necessary (cvat-canvas,
cvat-core, cvat-data and cvat-ui)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below)

# Copyright (C) 2021 Intel Corporation
#
# SPDX-License-Identifier: MIT

SYNCED

coveralls · 2021-01-25T06:14:48Z

Coverage increased (+0.02%) to 69.774% when pulling cef255e on jahaniam:develop into 897267c on openvinotoolkit:develop.

azhavoro · 2021-01-27T06:43:24Z

@jahaniam Thanks for the your contribution!

jahaniam · 2021-01-28T08:49:39Z

@jahaniam Thanks for the your contribution!

Thank you and your team.

nmanovic · 2021-01-28T09:36:06Z

@jahaniam , could you please help us to fix codacy issues in the PR?

vat/apps/documentation/installation_automatic_annotation.md
[maximum-line-length] Line must be at most 120 characters
Also you will need to add `--resource-limit nvidia.com/gpu=1 --triggers '{"myHttpTrigger": {"maxWorkers": 1}}'` to the nuclio deployment command. You can increase the maxWorker if you have enough GPU memory.
[maximum-line-length] Line must be at most 120 characters
- Since the model is loaded during deployment, the number of GPU deployed functions will be limited to your GPU memory.
[list-item-content-indent] Don’t use mixed indentation for children, remove 2 spaces
- Since the model is loaded during deployment, the number of GPU deployed functions will be limited to your GPU memory.
[list-item-indent] Incorrect list-item indent: add 2 spaces
- Since the model is loaded during deployment, the number of GPU deployed functions will be limited to your GPU memory.
serverless/tensorflow/faster_rcnn_inception_v2_coco/nuclio/model_loader.py
Trailing whitespace

serverless/tensorflow/matterport/mask_rcnn/nuclio/main.py

serverless/tensorflow/matterport/mask_rcnn/nuclio/model_loader.py

nmanovic · 2021-01-28T11:10:41Z

serverless/tensorflow/matterport/mask_rcnn/nuclio/function.yaml

  build:
    image: cvat/tf.matterport.mask_rcnn
-    baseImage: tensorflow/tensorflow:2.1.0-py3
+    baseImage: tensorflow/tensorflow:1.13.1-py3


The latest version of tensorflow 1.x is 1.15.5 and I see the comment Note that this is the last patch release for the TensorFlow 1.x series.
What is a reason to move from 2.1 to 1.13.1?

Before you were installing RUN pip install tensorflow==1.13.1 . RUN can't be overwritten using nuctl for gpu. If I use tensorflow 2.1-GPU I was getting some errors. Besides the mask-rcnn code is in tensorflow 1.x. It is better to have the docker for 1.x .
There was no reason not to go with v 1.15.5, we might be able to switch to that version probably. I just needed a version that works fine. I used 1.13.1 because it was being installed on line 119.

nmanovic · 2021-01-28T11:14:25Z

@jahaniam , the patch looks great! Let's clarify a couple of moments and merge.

jahaniam

@jahaniam , could you please help us to fix codacy issues in the PR?

vat/apps/documentation/installation_automatic_annotation.md
[maximum-line-length] Line must be at most 120 characters
Also you will need to add `--resource-limit nvidia.com/gpu=1 --triggers '{"myHttpTrigger": {"maxWorkers": 1}}'` to the nuclio deployment command. You can increase the maxWorker if you have enough GPU memory.
[maximum-line-length] Line must be at most 120 characters
- Since the model is loaded during deployment, the number of GPU deployed functions will be limited to your GPU memory.
[list-item-content-indent] Don’t use mixed indentation for children, remove 2 spaces
- Since the model is loaded during deployment, the number of GPU deployed functions will be limited to your GPU memory.
[list-item-indent] Incorrect list-item indent: add 2 spaces
- Since the model is loaded during deployment, the number of GPU deployed functions will be limited to your GPU memory.
serverless/tensorflow/faster_rcnn_inception_v2_coco/nuclio/model_loader.py
Trailing whitespace

Do you know how I can recheck it? I did some changes, I wanna make sure it is ok then commit it.

serverless/tensorflow/matterport/mask_rcnn/nuclio/model_loader.py

jahaniam · 2021-02-01T02:26:06Z

All comments are addressed.

The only thing remaining is the codacy complaining about two spaces for the list item (my indentings are correct). I am not able to fix that. If you can fix it please go ahead.

nmanovic · 2021-02-10T14:38:16Z

Hi @jahaniam , I finally merged my old patch with IOG serverless function (#2578). Could you please adjust your PR?

jahaniam · 2021-02-11T06:58:55Z

Hi @jahaniam , I finally merged my old patch with IOG serverless function (#2578). Could you please adjust your PR?

Awesome. I was looking forward to that merge for a while. I'll look into it this weekend.

jahaniam · 2021-02-15T03:35:34Z

fixed conflicts and tested. It's working as expected. Please review @nmanovic @azhavoro

Hi @jahaniam , I finally merged my old patch with IOG serverless function (#2578). Could you please adjust your PR?

nmanovic · 2021-02-16T20:48:50Z

@jahaniam , thanks for the great contribution! Really appreciate your time and efforts.

* fixed cpu mask rcnn+preparation for gpu * fix-limit gpu memory to 30% of total memory per worker Co-authored-by: Nikita Manovich <nikita.manovich@intel.com>

valavanisleonidas · 2021-03-09T10:07:58Z

Hello,

@jahaniam you reported these times

Mask-RCNN GPU: ~1.2 Sec / Image
Mask-RCNN CPU: ~7.5 Sec / Image

this time is to use the model to predict one image annotations ??

jahaniam · 2021-03-09T16:30:26Z

Hello,

@jahaniam you reported these times

Mask-RCNN GPU: ~1.2 Sec / Image
Mask-RCNN CPU: ~7.5 Sec / Image

this time is to use the model to predict one image annotations ??

Yes.

jahaniam and others added 13 commits January 5, 2021 06:42

boost nuclio to 1.5.12

791b918

fixed cpu mask rcnn+preparation for gpu

6c7b8a0

Merge remote-tracking branch 'upstream/develop' into develop

1531ba5

Merge pull request #1 from openvinotoolkit/develop

ebb42ac

SYNCED

Merge branch 'develop' of https://github.com/jahaniam/cvat into develop

28925a2

fixed minor bug

eb6f682

minor fixes

cfd5da1

removing unnecessary packages

5ec0d04

Merge remote-tracking branch 'upstream/develop' into develop

bd3b8c6

cleaning up extra code

1e0e31f

updated documentation

e710348

bumped nuclio to 1.5.16

c22a14b

fix-limit gpu memory to 30% of total memory per worker

d5bca98

jahaniam requested review from azhavoro and nmanovic as code owners January 25, 2021 05:19

Updated Changelog

e73abeb

Merge branch 'develop' into develop

a271c3f