Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to select areas from images and apply filters while using the OCR #465

Closed
3 tasks done
asereze opened this issue Oct 31, 2024 · 2 comments
Closed
3 tasks done
Labels
enhancement New feature or request

Comments

@asereze
Copy link
Contributor

asereze commented Oct 31, 2024

Describe your suggested feature

When selecting an image to extract text from, it would be really useful to be able to see it inside the app and select a specific area of it. Right now, the entire image gets parsed, and that can create some issues when there are different blocks of text that could get erroneously mixed (for example, in a comic book page).
It would be even better to be able to apply some filters to the selected text before analysing it.

I don't know if it could be useful, but there are already some open source projects for Android that could be used as references:

- Android OCR (OCR based on Tesseract 5)
https://f-droid.org/packages/io.github.subhamtyagi.ocr/

https://github.com/SubhamTyagi/android-ocr

This one allows the user to import an image, rotate it and flip it before the analysis, and to apply some automatic pre-processing filters as well.

Screenshot_20241029-223249_OCR
Screenshot_20241029-223300_OCR
Screenshot_20241029-223316_OCR

- Text Fairy
https://apt.izzysoft.de/fdroid/index/apk/com.renard.ocr?repo=archive

https://github.com/renard314/textfairy

With this one you can select an area in a polygonal shape (especially useful for inclined photos, but not just that) and select multiple columns in the order that you prefer.

Screenshot_20241029-223708_Text Fairy
Screenshot_20241029-223728_Text Fairy
Screenshot_20241031-223015_Text Fairy
Screenshot_20241029-223747_Text Fairy
Screenshot_20241029-223843_Text Fairy

- Kaku (Japanese OCR Dictionary)
https://f-droid.org/packages/ca.fuwafuwa.kaku/

https://github.com/0xbad1d3a5/Kaku

Although this one works differently (it is displayed above the other apps), it allows you to select different areas one after the other and, by long tapping and dragging left or right, you can instantly change the contrast of the text within the selected area before parsing it. An automatic black and white filter for the same area is also available.

Screenshot_20241031-224120_Firefox
Screenshot_20241029-224620_Galleria
Screenshot_20241029-224635_Galleria

Other details

No response

Acknowledgements

  • I have searched the existing issues and this is a new ticket, NOT a duplicate or related to another open issue.
  • I have written a short but informative title.
  • I will fill out all of the requested information in this form.
@asereze asereze added the enhancement New feature or request label Oct 31, 2024
@asereze
Copy link
Contributor Author

asereze commented Nov 1, 2024

@drogga I don't know why sharing it from the app doesn't work, the last FOSS version (4.1.0, the one that I used to make the screenshots) is available for download from the F-Droid Archive repository. Unfortunately yes, it's from 2021, the new releases' code is not available.

@asereze
Copy link
Contributor Author

asereze commented Nov 1, 2024

Ok, thank you!

@Bnyro Bnyro closed this as completed in 1f80c62 Nov 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant