Introduction

In the project this week, we will focus on model deployment for the news classification model that we trained in week 1, and evaluated in week 2.

We will build a simple web application backend using FastAPI and Uvicorn that wraps our trained model, and exposes an endpoint to make predictions on live traffic that is sent to the endpoint.
We will learn how to wrap our application in a Linux container using Docker. This is a standard practice when we want to deploy our application in a cloud environment such as AWS or Google Cloud.
We will test our web application and Docker container by sending it live traffic (on our local machine) and logging the model predictions.
We will write some integration tests to test our Docker container is running fine before deployment
[advanced & optional] we will prepare to deploy as a serverless function using AWS Lambda, getting it working locally

[Step 1] Download & Install Prerequisites:

Download the instructions & starter code:

You can download the starter code from: https://corise-mlops.s3.us-west-2.amazonaws.com/project3/starter.zip. After downloading and unzipping the week 3 starter code, your directory structure should look like this:

project
│   README.md
│   Dockerfile
│   requirements.txt
|   test_app.py
|   __init__.py
└───app
│   │   server.py
│   │   __init__.py
|
└───data
    │   news_classifier.joblib
    |   logs.out

Go to the project directory from the command line. This will be the home directory for your project. All command line commands that follow are from this directory.

Create a local Python virtual environment, install Python dependencies

Before we can get started with writing any code for the project, we recommend creating a Python virtual environment. If you haven't created virtual environments in Python before, you can refer to this documentation.

It needs the following steps (the commands shown below work for MacOS/Unix operating systems. Please refer to the documentation above for Windows):

Insall pip:

$ python3 -m pip install --user --upgrade pip

Install virtualenv:

$ python3 -m pip install --user virtualenv

Create virtual environment:

$ python3 -m venv mlopsproject

Activate the virtual environment:

$ source mlopsproject/bin/activate

Install the required python dependencies:

$ python3 -m pip install --user -r requirements.txt

Install Docker

Download and install Docker. You can follow the steps in this document.

If you are new to Docker, we suggest spending some time to get familiar with the Docker command line and dashboard. Docker's getting started page is a good resource.

[Step 2] Create a FastAPI web application to serve model predictions

Before getting started on the web application changes, make sure you can run the starter web server code:

$ cd app
$ uvicorn server:app

You should see an output like:

INFO:     Started server process [5749]
INFO:     Waiting for application startup.
2022-07-21 20:46:59.898 | INFO     | server:startup_event:109 - Setup completed
INFO:     Application startup complete.
INFO:     Uvicorn running on http://127.0.0.1:8000 (Press CTRL+C to quit)

When you go to http://127.0.0.1:8000 from a web browser, you should see this text output: {"Hello": "World"}:

We are now ready to get started on writing the code! All the required code changes for this project are in app/server.py. Comments in this file will help you understand the changes we need to make to create our web application to make model predictions. Once the code changes are done, you can start the web server again using the command from the above step.
Test with an example request:

Option 1: Using the web browser.

Visit http://127.0.0.1:8000/docs. You will see a /predict endpoint:

You can click on "Try it now" which will let you modify the input request. Click on "Execute" to see the model prediction response from the web server:

Option 2: Using the command line.

You can construct the POST request and send it to the web server from another tab in the command line as follows:

$ curl -X 'POST' \
  'http://127.0.0.1:8000/predict' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "source": "<value>",
  "url": "<value>",
  "title": "<value>",
  "description": "<value>"
}'

[Step 3] Containerize the application using Docker

Build the Docker Image

$ docker build --platform linux/amd64 -t news-classifier .

Start the container:

$ docker run -p 80:80 news-classifier-w3

Test the Docker container with an example request:

Option 1: Using the web browser: Visit http://0.0.0.0/docs and follow the same guidelines as above.

Option 2: Using the command line:

$ curl -X 'POST' \
  'http://0.0.0.0/predict' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "source": "<value>",
  "url": "<value>",
  "title": "<value>",
  "description": "<value>"
}'

[Step 4] Local testing & examining logs

Find out the container id of the running container:

$ docker ps

This will return a response like the following:

CONTAINER ID   IMAGE                COMMAND                  CREATED         STATUS         PORTS                NAMES
3a45c7f7661c   news-classifier-w3   "uvicorn server:app …"   4 minutes ago   Up 4 minutes   0.0.0.0:80->80/tcp   happy_burnell

SSH into the container using the container id from above:

$ docker exec -it <container id> /bin/sh

Tail the logs:

$ tail -f ../data/logs.out

Now when you send any request to the web server (from the browser, or another tab in the command line), you can see the log output coming through in logs.out. Test the web server with these requests and make sure you can see the outputs in logs.out:

{
  "source": "BBC Technology",
  "url": "http://news.bbc.co.uk/go/click/rss/0.91/public/-/2/hi/business/4144939.stm",
  "title": "System gremlins resolved at HSBC",
  "description": "Computer glitches which led to chaos for HSBC customers on Monday are fixed, the High Street bank confirms."
}

{
  "source": "Yahoo World",
  "url": "http://us.rd.yahoo.com/dailynews/rss/world/*http://story.news.yahoo.com/news?tmpl=story2u=/nm/20050104/bs_nm/markets_stocks_us_europe_dc",
  "title": "Wall Street Set to Open Firmer (Reuters)",
  "description": "Reuters - Wall Street was set to start higher on\Tuesday to recoup some of the prior session's losses, though high-profile retailer Amazon.com  may come under\pressure after a broker downgrade."
}

{
  "source": "New York Times",
  "url": "",
  "title": "Weis chooses not to make pickoff",
  "description": "Bill Belichick won't have to worry about Charlie Weis raiding his coaching staff for Notre Dame. But we'll have to see whether new Miami Dolphins coach Nick Saban has an eye on any of his former assistants."
}

{
  "source": "Boston Globe",
  "url": "http://www.boston.com/business/articles/2005/01/04/mike_wallace_subpoenaed?rss_id=BostonGlobe--BusinessNews",
  "title": "Mike Wallace subpoenaed",
  "description": "Richard Scrushy once sat down to talk with 60 Minutes correspondent Mike Wallace about allegations that Scrushy started a huge fraud while chief executive of rehabilitation giant HealthSouth Corp. Now, Scrushy wants Wallace to do the talking."
}

{
  "source": "Reuters World",
  "url": "http://www.reuters.com/newsArticle.jhtml?type=worldNewsstoryID=7228962",
  "title": "Peru Arrests Siege Leader, to Storm Police Post",
  "description": "LIMA, Peru (Reuters) - Peruvian authorities arrested a former army major who led a three-day uprising in a southern  Andean town and will storm the police station where some of his  200 supporters remain unless they surrender soon, Prime  Minister Carlos Ferrero said on Tuesday."
}

{
  "source": "The Washington Post",
  "url": "http://www.washingtonpost.com/wp-dyn/articles/A46063-2005Jan3.html?nav=rss_sports",
  "title": "Ruffin Fills Key Role",
  "description": "With power forward Etan Thomas having missed the entire season, reserve forward Michael Ruffin has done well in taking his place."
}

[Step 5][Optional] Testing with Pytest

This part is optional. We've built our web application, and containerized it with Docker. But imagine a team of ML engineers and scientists that needs to maintain, improve and scale this service over time. It would be nice to write some tests to ensure we don't regress!

Pytest is a popular testing framework for Python. If you haven't used it before, take a look at this page to get started and familiarize yourself with this library.
How do we test FastAPI applications with Pytest? Glad you asked, here's two resources to help you get started: (i) Introduction to testing FastAPI (ii) Testing FastAPI with startup and shutdown events
Head over to test_app.py to get started. As you develop the tests using prompts in this file, you can run pytest to run the tests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Introduction

[Step 1] Download & Install Prerequisites:

Download the instructions & starter code:

Create a local Python virtual environment, install Python dependencies

Install Docker

[Step 2] Create a FastAPI web application to serve model predictions

[Step 3] Containerize the application using Docker

[Step 4] Local testing & examining logs

[Step 5][Optional] Testing with Pytest

Files

README.md

Latest commit

History

README.md

File metadata and controls

Introduction

[Step 1] Download & Install Prerequisites:

Download the instructions & starter code:

Create a local Python virtual environment, install Python dependencies

Install Docker

[Step 2] Create a FastAPI web application to serve model predictions

[Step 3] Containerize the application using Docker

[Step 4] Local testing & examining logs

[Step 5][Optional] Testing with Pytest