Skip to content

Production Potassium http model server for english-centric news snippet generation with a BLOOMZ model.

Notifications You must be signed in to change notification settings

snipaid-nlg/bloomz-model-server-v2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BLOOMZ Potassium Model Server

This is a Potassium HTTP server for a BLOOMZ model with our english-centric approach for german news snippet generation.

Quickstart

Note: This model requires a GPU with ~ 8GB memory for generation!

Curious to get your hand on englsh-centric BLOOMZ news snippet generation?

You can check it out with docker:

  1. Run docker build -t bloomz-english-centric . && docker run -it bloomz-english-centric to build and run the docker container.

Or you can check it out manually:

  1. Run pip3 install -r requirements.txt to download dependencies.
  2. Run python3 app.py to start the server.
  3. You should see:
------  
Starting server 🍌  
Running init()  
...  
Serving on http://localhost:8000  
------
  1. Now open up a different terminal and hit the server with a simple cURL POST request
curl -X POST \
-H "Content-Type: application/json" \
-d '{prompt": "Generate a title for the following news article.", "document": "<Insert-the-fulltext-of-a-news-article-here>"}' \
http://localhost:8000/
  1. Boom! 🎉 You just ran an inference on the model on your local machine!
{
    "output": "Here is the text the model generated."
}

🍌

Test and deploy with Banana

Testing

Note: For this you need the banana-cli installed. Run pip3 install banana-cli to install it.

  1. Fork this repo and clone it to your local device.
  2. Start a local dev server with banana dev.

Deployment

  1. Log into Banana.
  2. Select your fork of the repo to build and deploy!

About

Production Potassium http model server for english-centric news snippet generation with a BLOOMZ model.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published