Skip to content

Latest commit

 

History

History
162 lines (110 loc) · 6.94 KB

chatbots.md

File metadata and controls

162 lines (110 loc) · 6.94 KB
description
CLI tool to build and deploy chatbots

Chatbots

Introduction

A command line tool to build and deploy a knowledge based chatbot using PostgresML and OpenAI API.

There are two stages in building a knowledge based chatbot:

  • Build a knowledge base by ingesting documents, chunking documents, generating embeddings and indexing these embeddings for fast query
  • Generate responses to user queries by retrieving relevant documents and generating responses using OpenAI API

This tool automates the above two stages and provides a command line interface to build and deploy a knowledge based chatbot.

Prerequisites

Before you begin, make sure you have the following:

Getting started

  1. Create a virtual environment and install pgml-chat using pip:
pip install pgml-chat

pgml-chat will be installed in your PATH.

  1. Download .env.template file from PostgresML Github repository.
wget https://raw.githubusercontent.com/postgresml/postgresml/master/pgml-apps/pgml-chat/.env.template
  1. Copy the template file to .env
  2. Update environment variables with your OpenAI API key and PostgresML database credentials.
OPENAI_API_KEY=<OPENAI_API_KEY>
DATABASE_URL=<POSTGRES_DATABASE_URL starts with postgres://>
MODEL=hkunlp/instructor-xl
MODEL_PARAMS={"instruction": "Represent the Wikipedia document for retrieval: "}
QUERY_PARAMS={"instruction": "Represent the Wikipedia question for retrieving supporting documents: "}
SYSTEM_PROMPT="You are an assistant to answer questions about an open source software named PostgresML. Your name is PgBot. You are based out of San Francisco, California."
BASE_PROMPT="Given relevant parts of a document and a question, create a final answer.\ 
                Include a SQL query in the answer wherever possible. \
                Use the following portion of a long document to see if any of the text is relevant to answer the question.\
                \nReturn any relevant text verbatim.\n{context}\nQuestion: {question}\n \
                If the context is empty then ask for clarification and suggest user to send an email to team@postgresml.org or join PostgresML [Discord](https://discord.gg/DmyJP3qJ7U)."

Usage

You can get help on the command line interface by running:

(pgml-bot-builder-py3.9) pgml-chat % pgml-chat --help
usage: pgml-chat [-h] --collection_name COLLECTION_NAME [--root_dir ROOT_DIR] [--stage {ingest,chat}] [--chat_interface {cli,slack}]

PostgresML Chatbot Builder

optional arguments:
  -h, --help            show this help message and exit
  --collection_name COLLECTION_NAME
                        Name of the collection (schema) to store the data in PostgresML database (default: None)
  --root_dir ROOT_DIR   Input folder to scan for markdown files. Required for ingest stage. Not required for chat stage (default: None)
  --stage {ingest,chat}
                        Stage to run (default: chat)
  --chat_interface {cli, slack, discord}
                        Chat interface to use (default: cli)

Ingest

In this step, we ingest documents, chunk documents, generate embeddings and index these embeddings for fast query.

LOG_LEVEL=DEBUG pgml-chat --root_dir <directory> --collection_name <collection_name> --stage ingest

You will see output logging the pipelines progress.

Chat

You can interact with the bot using the command line interface or Slack.

Command Line Interface

In this step, we start chatting with the chatbot at the command line. You can increase the log level to ERROR to suppress the logs. CLI is the default chat interface.

LOG_LEVEL=ERROR pgml-chat --collection_name <collection_name> --stage chat --chat_interface cli

You should be able to interact with the bot as shown below. Control-C to exit.

User (Ctrl-C to exit): Who are you?
PgBot: I am PgBot, an AI assistant here to answer your questions about PostgresML, an open source software. How can I assist you today?
User (Ctrl-C to exit): What is PostgresML?
Found relevant documentation.... 
PgBot: PostgresML is an open source software that allows you to unlock the full potential of your data and drive more sophisticated insights and decision-making processes. It provides a dashboard with analytical views of the training data and 
model performance, as well as integrated notebooks for rapid iteration. PostgresML is primarily written in Rust using Rocket as a lightweight web framework and SQLx to interact with the database.

If you have any further questions or need more information, please feel free to send an email to team@postgresml.org or join the PostgresML Discord community at https://discord.gg/DmyJP3qJ7U.

Slack

Setup You need SLACK_BOT_TOKEN and SLACK_APP_TOKEN to run the chatbot on Slack. You can get these tokens by creating a Slack app. Follow the instructions here to create a Slack app.Include the following environment variables in your .env file:

SLACK_BOT_TOKEN=<SLACK_BOT_TOKEN>
SLACK_APP_TOKEN=<SLACK_APP_TOKEN>

In this step, we start chatting with the chatbot on Slack. You can increase the log level to ERROR to suppress the logs.

LOG_LEVEL=ERROR pgml-chat --collection_name <collection_name> --stage chat --chat_interface slack

If you have set up the Slack app correctly, you should see the following output:

⚡️ Bolt app is running!

Once the slack app is running, you can interact with the chatbot on Slack as shown below. In the example here, name of the bot is PgBot. This app responds only to direct messages to the bot.

Discord

Setup You need DISCORD_BOT_TOKEN to run the chatbot on Discord. You can get this token by creating a Discord app. Follow the instructions here to create a Discord app. Include the following environment variables in your .env file:

DISCORD_BOT_TOKEN=<DISCORD_BOT_TOKEN>

In this step, we start chatting with the chatbot on Discord. You can increase the log level to ERROR to suppress the logs.

pgml-chat --collection_name <collection_name> --stage chat --chat_interface discord

If you have set up the Discord app correctly, you should see the following output:

2023-08-02 16:09:57 INFO     discord.client logging in using static token

Once the discord app is running, you can interact with the chatbot on Discord as shown below. In the example here, name of the bot is pgchat. This app responds only to direct messages to the bot.