Getting Started with Computer Vision

For this tutorial, you'll need a Raspberry Pi with a web cam and a screen. After completing these instructions, you can point your Pi's web cam at a variety of objects, and the deployed software will print a label for whatever object appears on camera. For example, we pointed our Pi's cam at a cup of coffee on our desk and saw this on the Pi's screen:

This tutorial shows you how to import a pre-trained image recognition model, compile it for the Raspberry Pi, and write the code that runs the image recognizer and prints the labels.

In this tutorial we will load a pre-trained Convolutional Neural Net (CNN). We will then connect your video camera using OpenCV and do some almost real-time image recognition of dogs, birds and all kinds of fun stuff. We will then get that working on a Raspberry Pi using the ELL model compiler.

Prerequisites

We will use Python 3.6 for this tutorial. We highly recommend using the miniconda or full anaconda python environment because it comes with many handy tools like curl which we will use later on.
You will also need a simple web cam or a pi cam. If you don't have one handy, we will show you how to load static images or .mp4 videos and process those instead.
To read images from the camera, you will need Open CV. There are download instructions on the opencv website, but we also created a simple Open CV setup page for you.
You also need NumPy since ELL processes images as NumPy arrays. NumPy can install using conda as follows:
```
conda install numpy
```
You will need to build ELL and the ELL Python Language Bindings. (You will need to build ELL after you install Python, so that the CMake step picks up on the fact that you have Python 3.6 installed.)

Downloading a pre-trained model

Now you can choose to use two different routes for pre-trained models, and so the tutorial forks at this point, please pick one of the following:

Microsoft Cognitive Toolkit
Darknet

Other

A list of other useful models, from both CNTK and Darknet, can be found in the PretrainedModels section.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Getting Started with Computer Vision

Prerequisites

Downloading a pre-trained model

Other

Files

README.md

Latest commit

History

README.md

File metadata and controls

Getting Started with Computer Vision

Prerequisites

Downloading a pre-trained model

Other