Add yolov5-youtube example #1201

dsuess · 2020-07-03T14:38:59Z

addresses #1130

This is work in progress. I opened the PR to discuss some details.

Here's what this example does:

you send a request with a youtube URL
the predictor runs a YoloV5 model in ONNX format over it
the overlayed video is returned to the user

This demonstrates both, how to load any Yolo model from the ultralytics repo and how to process video pretty easily.

I am using ffmpeg-python (instead of the more standard opencv/scikkit-video) since it allows you to do large parts of the pre-processing (resizing & padding) in a separate process. The reason I am using youtube-dl (instead of passing in a video directly) is that we don't have to provide an example video ourselves.

Would this be a good example? Any suggestions for improvements? How do you want to store the ONNX file?

Here's what still needs to be done:

document how to export the ONNX model from the ultralytics repo. There's a minor change that makes our life much easier that I'll need to document.
improve the overlay to show class names
document & clean up the python code (e.g. delete intermediate files)
~~update the documentation on the website~~
~~a bit of performance optimization~~
~~async the video-download?~~

checklist:

run make test and make lint
test manually (i.e. build/push all images, restart operator, and re-deploy APIs)
update examples
update docs and add any new files to summary.md (view in gitbook after merging)
cherry-pick into release branches if applicable
alert the dev team if the dev environment changed

deliahu · 2020-07-03T15:28:15Z

@dsuess this is really awesome!! I like how simple the API is: send a youtube link, and get back and annotated video!

I am using ffmpeg-python (instead of the more standard opencv/scikkit-video) since it allows you to do large parts of the pre-processing (resizing & padding) in a separate process.

Does this help improve latency or just throughput? Are you running the API with processes_per_replica=1 and threads_per_process=1? (note that processes_per_replica refers just to the uvicorn processes; ffmpeg will spawn additional processes). I'd be curious to understand how all of the threading/parallelism interplays, but also, there's no need to spend a lot of time on it, since it's just an example and not running in production :)

The reason I am using youtube-dl (instead of passing in a video directly) is that we don't have to provide an example video ourselves.

Yes, I agree, this makes for a nice example!

Would this be a good example? Any suggestions for improvements?

Yes, I think this is a great example! I can't think of any suggestions for improvements at the moment.

How do you want to store the ONNX file?

I will add it to our cortex-examples bucket; feel free to send me the model when it's ready (via google drive, dropbox, etc), and I can upload it there.

async the video-download?

This is also an interesting one, similar to the discussion about ffmpeg. My intuition says that downloading the video async could increase throughput (and not latency) assuming processes_per_replica>1 and/or threads_per_process>1. Like before, there's no need to spend a lot of time on this for the example's sake.

Thanks again for adding this!

dsuess · 2020-07-03T23:18:59Z

Does this help improve latency or just throughput? Are you running the API with processes_per_replica=1 and threads_per_process=1? (note that processes_per_replica refers just to the uvicorn processes; ffmpeg will spawn additional processes). I'd be curious to understand how all of the threading/parallelism interplays, but also, there's no need to spend a lot of time on it, since it's just an example and not running in production :)

To be honest, the main reason for using ffmpeg-python was that I didn't want to implement the resize-padding for the hundredth time 😄

I think it improves both (especially when you run on a GPU) since it allows you to run model inference and data preprocessing (decoding, resizing) in parallel. But I haven't done any tests and there are definitely better solutions if you actually need to optimize either performance metric.

async the video-download?

This is also an interesting one, similar to the discussion about ffmpeg. My intuition says that downloading the video async could increase throughput (and not latency) assuming processes_per_replica>1 and/or threads_per_process>1. Like before, there's no need to spend a lot of time on this for the example's sake.

You're right on this one, this would only benefit throughput. And just increasing either process_per_replica or `threads_per_replica will probably be the better solution anyway

dsuess · 2020-07-04T03:33:45Z

OK, I've send through the ONNX file and fixed the things that needed fixing. So if someone with the necessary permissions can a) upload the ONNX file and change the path in the config and b) fix the failing test, we're good to go IMO.

RobertLucian · 2020-07-04T04:09:47Z

@dsuess Thank you a lot for adding this!

I tried pushing to your branch, but apparently, I don't have write permissions. I think this might be because Allow edits from maintainer hasn't been checked. This option should be somewhere on the right-hand side of the PR I think.

Also, I'd want to test this over the weekend. For that, do you have a public link to your ONNX model I could use?

dsuess · 2020-07-04T04:38:57Z

The box is ticked as far as I can tell, so not sure what's going on.

For the ONNX file: https://drive.google.com/file/d/1p0nbSHUFpZhp6RxR2scNFaMk0ANHu_1e/view?usp=sharing

I've also included the steps how to create that ONNX file from the original repo, so you could also try to reproduce those setps

RobertLucian · 2020-07-04T04:53:57Z

@dsuess thanks for the link. As for the write access, that was a false alarm (an old peculiarity of my setup). It now works!

deliahu

This is a really great example, thanks again!

examples/onnx/yolov5-videos/README.md

examples/onnx/yolov5-videos/cortex.yaml

examples/onnx/yolov5-videos/predictor.py

deliahu · 2020-07-04T16:48:29Z

@RobertLucian @dsuess I've uploaded the model to s3://cortex-examples/onnx/yolov5-youtube/yolov5s.onnx for now, and I can move it depending on what we decide regarding the API name

Fixes error "conda libgnutls.so.30: symbol mpn_add_1 version HOGWEED_4 not defined in file libhogweed.so.4 with link time reference"

examples/onnx/yolov5-youtube/predictor.py

RobertLucian

This looks well to me! I took the liberty of modifying a few things here and there:

Read the output video file as bytes and deleted the file before returning the bytes as the response's payload.
Fix a bug with ffmpeg where the latest version of it from conda, which was released on the 6th of July, lead to this error: ffmpeg: relocation error: /opt/conda/envs/env/bin/../lib/./libgnutls.so.30: symbol mpn_add_1 version H OGWEED_4 not defined in file libhogweed.so.4 with link time reference. Reverting it to its previous version 4.2.3 made it work again - which is most likely the version @dsuess used.
Use a context manager when instantiating an object of FrameWriter's class.
Move functions to the utils module.
Decrease the line thickness by 4 times.
Create GIF using the sample YT video.

deliahu

Everything looks and works great, thanks again for adding this example!

@dsuess is it ready to merge from your perspective? If so, we can go ahead and merge it now.

dsuess · 2020-07-06T23:40:40Z

Yes, all good from my side

Implement first prototype for yolov5 example

98c8868

dsuess force-pushed the example/yolo branch from 2880c2a to 98c8868 Compare July 3, 2020 14:40

dsuess added 2 commits July 4, 2020 13:03

Update predictor with flexible input size & better overlay

313d408

Update readme

29977d7

RobertLucian added 2 commits July 4, 2020 07:47

Merge branch 'master' into example/yolo

da5c4a2

Make lint

b67e4eb

deliahu reviewed Jul 4, 2020

View reviewed changes

dsuess changed the title ~~WIP: Implement first prototype for yolov5 example~~ Implement first prototype for yolov5 example Jul 5, 2020

dsuess and others added 6 commits July 5, 2020 21:25

Rename to yolov5-youtube

d0caa6c

Update Readme

9de3281

Fix ffmpeg bug with gmp dependency

d81b0ec

Fixes error "conda libgnutls.so.30: symbol mpn_add_1 version HOGWEED_4 not defined in file libhogweed.so.4 with link time reference"

Use context manager & remove all videos from disk

4621c29

Reorganize the example a bit

8d65d4b

Polishing the docs & tuning line thickness

436bcd3

dsuess commented Jul 6, 2020

View reviewed changes

examples/onnx/yolov5-youtube/predictor.py Outdated Show resolved Hide resolved

RobertLucian approved these changes Jul 6, 2020

View reviewed changes

Use video/mp4 mime-type

001f40f

deliahu changed the title ~~Implement first prototype for yolov5 example~~ Add yolov5-youtube example Jul 6, 2020

deliahu approved these changes Jul 6, 2020

View reviewed changes

Merge branch 'master' into example/yolo

df254bc

deliahu merged commit 5986efc into cortexlabs:master Jul 7, 2020

Add yolov5-youtube example #1201

Add yolov5-youtube example #1201

Uh oh!

Conversation

dsuess commented Jul 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deliahu commented Jul 3, 2020

Uh oh!

dsuess commented Jul 3, 2020

Uh oh!

dsuess commented Jul 4, 2020

Uh oh!

RobertLucian commented Jul 4, 2020

Uh oh!

dsuess commented Jul 4, 2020

Uh oh!

RobertLucian commented Jul 4, 2020

Uh oh!

deliahu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

deliahu commented Jul 4, 2020

Uh oh!

Uh oh!

RobertLucian left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

deliahu left a comment

Choose a reason for hiding this comment

Uh oh!

dsuess commented Jul 6, 2020

Uh oh!

Uh oh!

dsuess commented Jul 3, 2020 •

edited

Loading

RobertLucian left a comment •

edited

Loading