This repository has been archived by the owner on Oct 17, 2024. It is now read-only.
New release adds the "text_files" stream type, that uses textract to read contents of almost any type of file. Intended for RAG type ingest pipelines, typically together with the meltano mapper map-gpt-embeddings
.