Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VERSION 0.0.3 #7

Merged
merged 5 commits into from
May 29, 2024
Merged

VERSION 0.0.3 #7

merged 5 commits into from
May 29, 2024

Conversation

sanjaychelliah
Copy link
Contributor

@sanjaychelliah sanjaychelliah commented May 29, 2024

Added

  • Data Ingestion Pipelines from clarifai_datautils.text import Pipeline (#5)
  • Data Ingestion Pipeline: PDF and Text file support from clarifai_datautils.text import TextPartition, PDFPartition (#5)
  • Data Ingestion Pipeline: Cleaners and Extractors (#5)
  • Data Ingestion Pipeline: Clarifai Dataloader support pipeline.run(files = filepath, loader = True) (#5)
  • Data Ingestion Pipeline: Llama Index Metadata Extractor (#5)
  • Data Ingestion Pipeline: Unit tests (#5)

Note

  • Tests are failing in windows due to an issue in python-magic package. Once it is fixed, windows will be included to the tests.

@sanjaychelliah sanjaychelliah requested a review from sainivedh May 29, 2024 13:24
@sanjaychelliah sanjaychelliah merged commit e94ff71 into main May 29, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants