FastAPI Llama2 Huggingface Hub API

This repository contains a Jupyter notebook (FastAPI-Llama-HuggingfaceHub-Collab.ipyn) that demonstrates how to set up and run a FastAPI server with Llama 2 model integration using Google Colab's free T4 GPU.

Features

Sets up a FastAPI server with Llama 2 model integration
Uses Google Colab's free GPU for model inference
Creates a public URL for the API using ngrok
Provides an example of how to make API requests to the server

Usage

Open the FastAPI-Llama-HuggingfaceHub-Collab.ipynb notebook in Google Colab
Follow the instructions in the notebook to set up and run the server
Use the provided ngrok URL to make API requests to the Llama 2 model

Requirements

Google Colab account (for free GPU access)
ngrok account (free tier is sufficient)

Note

Make sure to shut down the server and ngrok processes when you're done using the notebook to free up resources. For more detailed instructions and code explanations, please refer to the comments within the notebook.

Name	Name	Last commit message	Last commit date
Latest commit tooniez docs: add readme Aug 27, 2024 30e9e94 · Aug 27, 2024 History 4 Commits
.github	.github	docs: add gh docs	Aug 27, 2024
FastAPI-Llama-HuggingfaceHub-Collab.ipynb	FastAPI-Llama-HuggingfaceHub-Collab.ipynb	feat: add jupyter notebook	Aug 27, 2024
LICENSE	LICENSE	docs: add license	Aug 27, 2024
README.md	README.md	docs: add readme	Aug 27, 2024
requirements.txt	requirements.txt	feat: add jupyter notebook	Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastAPI Llama2 Huggingface Hub API

Features

Contents

Usage

Requirements

Note

License

About

Releases

Sponsor this project

Packages

Languages

License

tooniez/fastapi-llama-hub-collab

Folders and files

Latest commit

History

Repository files navigation

FastAPI Llama2 Huggingface Hub API

Features

Contents

Usage

Requirements

Note

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages