✍🏻 ParollaGPT

Learn Corsican language using Large Language Model through a discussion with a personalized assistant.

Philosophy

We want to propose a 1 to 1 tutoring dialogue that will be based on large corpus of all kinds of books about the given language, how to conjugate verbs and more. We are planning to add more languages, but we started with Corsican language since it's the repository author country. We are targetting languages that have bad coverage in LLM datasets.

Demo

DM on twitter to request a demo token.

https://twitter.com/zenocode_org

✅ Running locally

Install dependencies: pip install -r requirements.txt
Run ingest.sh to ingest LangChain docs data into the vectorstore (only needs to be done once).
1. You can use other Document Loaders to load your own data into the vectorstore.
Run the app: make start
1. To enable tracing, make sure langchain-server is running locally and pass tracing=True to get_chain in main.py. You can find more documentation here.
Open localhost:9000 in your browser.

🚀 Important Links

Deployed version (to be updated soon):

📚 Technical description

There are two components: ingestion and question-answering.

Ingestion has the following steps:

Pull html from documentation site
Load html with LangChain's ReadTheDocs Loader
Split documents with LangChain's TextSplitter
Create a vectorstore of embeddings, using LangChain's vectorstore wrapper (with OpenAI's embeddings and FAISS vectorstore).

Question-Answering has the following steps, all handled by ChatVectorDBChain:

Given the chat history and new student input, determine student missing knowledge and propose an exercise to fill the knowledge gap.
Given that exercise proposition, look up relevant documents from the vectorstore.
Pass the student input and relevant documents to a LLM to generate a final answer.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.vscode		.vscode
archive		archive
documents		documents
templates		templates
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
callback.py		callback.py
courses.py		courses.py
environment.yml		environment.yml
ingest.py		ingest.py
ingest.sh		ingest.sh
log.ini		log.ini
main.py		main.py
prompt.py		prompt.py
query_data.py		query_data.py
requirements.txt		requirements.txt
schemas.py		schemas.py
tutor_conversation.py		tutor_conversation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✍🏻 ParollaGPT

Philosophy

Demo

✅ Running locally

🚀 Important Links

📚 Technical description

About

Releases 3

Packages

Languages

License

zenocode-org/parolla-chat

Folders and files

Latest commit

History

Repository files navigation

✍🏻 ParollaGPT

Philosophy

Demo

✅ Running locally

🚀 Important Links

📚 Technical description

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages