This repository is dedicated to the art of summarizing Long text using the power of Langchain in conjunction with Google Vertex AI.
The primary goal here is to acquire a deep understanding of utilizing Langchain for processing exceptionally long text, well beyond the typical context window of conventional Language Models. Within this repository, you will find an exploration of Langchain's applications, along with a recommended pipeline for effectively summarizing lengthy textual content.
-
build environment:
pip -r requirement.txt
-
cd to the source directory:
cd src
-
run streamlit app:
streamlit run app.py
This main idea of the pipeline is from: https://towardsdatascience.com/summarize-podcast-transcripts-and-long-texts-better-with-nlp-and-ai-e04c89d3b2cb
All the credits should go to the original author: https://iztham.medium.com/