`openai-batch-utils`

Simple, helpful tools for batch async calls to OpenAI APIs.

Why does this exist?

Sometimes I have a need to make 100s of 1000s of independent calls to OpenAI APIs, mostly to the chat and embeddings APIs. The python openai library contains all the tools to make this possible, but setting things up for each project, test or experimentation can be cumbersome and may involve a good amount of boilerplate code.

Also, I sometimes feel the need to use a Jupyter notebook, and other times I prefer to use a Python script. There are some small differences in both scenarios when making async calls, due to the existence of an event loop in Jupyter notebooks. I prefer to have a single way to make such calls, no matter in which environment I am working, without having to worry about these details.

Thus I make this library, mostly for myself and to help me in my work, projects and expeirments. Perhaps othes might find it useful too. Feel free to open issues, propose changes, or contribute to the codebase.

Installation

For now I am keeping this simple and will not push a package to PyPI, since it is a small "library" and essentially a wrapper around the OpenAI Python library. You can install it directly from GitHub:

pip install git+https://github.com/JovanVeljanoski/openai-batch-utils

Alternatively, and for potential contributors, you can always clone the repository and install it locally:

First fork the repository to your GitHub account, then:

git clone https://github.com/<your-username>/openai-batch-utils.git
cd openai-batch-utils
pip install -e ".[dev]"

Usage

Generally, looking at the tests is a fine way of understanding how to use the library. Here are some examples straight from the tests:

Batch calling the chat api via the `OpenAIChat` class

from openai_batch_utils import OpenAIChat
import pandas as pd


prompt = ['what is the capital of netherlands',
          'what is the capital of france',
          'what is the capital of germany',
          'what is the capital of italy',
          'what is the capital of spain']

system = """Reply in JSON format. Obey the following schema:
{'city': `Capital`. 'country': `Country`}"""

chat = OpenAIChat()
# Here a small `batch_size` is used for demonstration.
# In practice, one might like to vary between 500 - 3000 or depending on the usecase.
result = chat.openai_chat(prompt=prompt,
                          system_prompt=system,
                          batch_size=3,
                          model='gpt-3.5-turbo',
                          response_format={'type': 'json_object'},
                          sleep_interval=5,
                          verbose=True)
result = pd.DataFrame(result)
print(result)

Batch calling the embeddings via the `OpenAIEmbed` class

from openai_batch_utils import OpenAIEmbed

prompt = ['what is the capital of netherlands',
          'what is the capital of france',
          'what is the capital of germany',
          'what is the capital of italy',
          'what is the capital of spain']

embed = OpenAIEmbed()
# Here a small `batch_size` is used for demonstration.
# In practice, one might like to vary between 500 - 3000 or depending on the usecase.
result = embed.openai_embed(input=prompt, model='text-embedding-3-small', batch_size=3)
print(result)

Contributing

Be kind, respectful, helpful, and we will get along just fine! Feel free to open issues, propose changes, or contribute to the codebase. Open to ideas, suggestions, and improvements.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
openai_batch_utils		openai_batch_utils
tests		tests
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`openai-batch-utils`

Why does this exist?

Installation

Usage

Batch calling the chat api via the `OpenAIChat` class

Batch calling the embeddings via the `OpenAIEmbed` class

Contributing

License

About

Releases

Packages

Languages

License

Tiqets/openai-batch-utils

Folders and files

Latest commit

History

Repository files navigation

openai-batch-utils

Why does this exist?

Installation

Usage

Batch calling the chat api via the OpenAIChat class

Batch calling the embeddings via the OpenAIEmbed class

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`openai-batch-utils`

Batch calling the chat api via the `OpenAIChat` class

Batch calling the embeddings via the `OpenAIEmbed` class

Packages