Poc Embedding Applications

Background

I build this application with jupter notebook to learn how to build applications using GPT and embegging database, which is called as Retrieval Augmented Generation(RAG).

What this application can do

This is Q&A application answering the specific web blog contents.
A user query about the those web blog contents and then the app answer it.　　

How to build this application

Data Preparation
1. Scrapt the Web Pages
  We need to gather the data from Web Blog about the details, which Q&A refer it to answer the query.
  ** Please be careful about the copyright of the web page.
  The file is here
2. (Optional) Format the texts by using ChatGPT.
Build Application
The script is here
1. Import raw data from csv file prepared by step 1.
2. Generate the embeddings(vector data) of the raw data by OpenAI API.
3. Insert the embeddings(vector data) to Pinecone.
  (Ready for this application to answer the query from a user)
4. A user input the query
5. Generate the embeddings(vector data) of the query by OpenAI API.
6. Query to Pinecone with the embeddings(vector data) of the query
7. Call OpenAI API by providing the prompt including the raw data as context.

Solution Archtecture

Application Archtecture

graph LR;
    Local-->|API| Pinecone;
    Local-->|API| OpenAI;

Reference

OpenAI
OpenAI
OpenAI Tutorials
OpenAI Models
OpenAI API Reference
OpenAI tokenizer
OpenAI Cookbook Question_answering_using_embeddings.ipynb
OpenAI Cookbook vector_databases pinecone

Vector Database Pinecone
Pinecone
What is a Vector Database & How Does it Work? Use Cases + Examples
The Missing WHERE Clause in Vector Search
Pinecone Docs

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
app		app
data		data
scraping		scraping
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Poc Embedding Applications

Background

What this application can do

How to build this application

Solution Archtecture

Reference

About

Releases

Packages

Languages

kurakurakuda/sandbox_rag_app

Folders and files

Latest commit

History

Repository files navigation

Poc Embedding Applications

Background

What this application can do

How to build this application

Solution Archtecture

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages