Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Indexing Images via OCR (#823) #41

Merged
merged 1 commit into from
Jul 2, 2024

Conversation

stevennt
Copy link
Owner

@stevennt stevennt commented Jul 2, 2024

  • Added support for uploading .jpeg, .jpg, and .png files to Khoj from Web, Desktop app
  • Updating indexer to generate raw text and entries using RapidOCR
  • Details
    • added support for indexing images via ocr
    • fixed pyproject.toml
    • Update src/khoj/processor/content/images/image_to_entries.py
      Co-authored-by: Debanjum debanjum@gmail.com
    • Update src/khoj/processor/content/images/image_to_entries.py
      Co-authored-by: Debanjum debanjum@gmail.com
    • removed redudant try except blocks
    • updated desktop js file to support image formats
    • added tests for jpg and png
    • Fix processing for image to entries files
    • Update unit tests with working image indexer
    • Change png test from version verificaition to open-cv verification

- Added support for uploading .jpeg, .jpg, and .png files to Khoj from Web, Desktop app
- Updating indexer to generate raw text and entries using RapidOCR
- Details
  * added support for indexing images via ocr
  * fixed pyproject.toml
  * Update src/khoj/processor/content/images/image_to_entries.py
     Co-authored-by: Debanjum <debanjum@gmail.com>
  * Update src/khoj/processor/content/images/image_to_entries.py
     Co-authored-by: Debanjum <debanjum@gmail.com>
  * removed redudant try except blocks
  * updated desktop js file to support image formats
  * added tests for jpg and png
  * Fix processing for image to entries files
  * Update unit tests with working image indexer
  * Change png test from version verificaition to open-cv verification

---------

Co-authored-by: Debanjum <debanjum@gmail.com>
Co-authored-by: sabaimran <narmiabas@gmail.com>
@stevennt stevennt merged commit 588c1ca into stevennt:khoj.master Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants