-
Notifications
You must be signed in to change notification settings - Fork 230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A better page number-crop algorithm with NumPy. (Manga) #709
Conversation
Wow this looks incredible, might take me a while to fully review it though. For reference, my perspective is mostly using a Kindle Scribe 10" which doesn't need to zoom or crop at all. But when I use smaller devices like an android Hisense a9 eink phone cropping is essential. Have you looked into any android manga reading software? I bet they might like a feature like this, and maybe they have something similar. And what platform did you develop on? I use macOS (arm64/apple silicon). And do you you have a general idea of the performance difference and how much increases the binary size? Aka ’python setup.py build_binary’ |
I hope it would actually work for others, I tested it on 30+ images from 5-6 different Mangas. I want to try it on some more Mangas before I'll continue.
Very shallowly. I tried to search on Google for some existing manga or comic crop algorithms, even some general manga-readers apps, But couldn't find something meaningful (aside of the standard cropping). I usually read on my PC or my [used to have] first gen Kindle Paperwhite 6" (as you can understand my constraint here). Recently, my Kindle's screen shattered and I've ordered a new e-reader with a 7.8" screen. I've yet to receive it so in the meantime I thought I would at least try to optimize the Mangas that I plan to read/load when I get it. Hence, I tried to write this algorithm.
I use an old Windows 10 laptop (intel x86-64)
TBH, I forgot to measure the binary, so I ran it just now to see. Regarding performance, to my surprise, it didn't suffer much of a hit, I did multiple types of benchmarks.
By the way, |
This comment was marked as resolved.
This comment was marked as resolved.
While I agree, I thought we wanted to optimize output file size (as numpy increases it by 30%). Do you want me to add the NumPy version back? Unrelated - I've detected that the algorithm sometimes fails on thin fonts and single-digit page numbers. Also, there is an issue with using higher power levels that causes the page number to become distorted and the algorithm fails to detect it (currently I'm optimizing power=1 to work the best in most cases). |
That's great! Excuse me, is the Cropping mode value Cropping Power: 2 in KCC's current version of the cropping mode that corresponds to the Power3 you're demonstrating? |
Hi! No, the power levels [0...3] that I'm using are different entirely from the current power levels of [0...2] that exist currently in KCC v6.1.0. I've noted that in the issues at the bottom of OP.
|
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
I have used the better optimization algorithm you provided and it is really great! Thank you very much! |
I did it. Thank you |
@zhaohengkang feel free to upload some comparison photos. |
320d056
to
38ff5e3
Compare
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
I see. Then NumPy version it is. Regarding the algorithm - there are still issues with thin fonts of page numbers and instances of a single-digit page num that the algorithm fails to detect with higher power levels (but less of a problem with lower power levels). As Zhao demonstrated, the algorithm works fine in most cases and I think it is ready for general use, even if more tweaks and improvements are needed for some edge cases or maybe some scenarios that I haven't tested yet. What about the issue I have with the inconsistency with power levels of the current "crop margins" versus the new "crop margin + page num"? @zhaohengkang Very nice comparison images you compiled there! seems like you worked hard sharing these results! |
@neyney10 Sorry, because when I turned it on, the default was power=2, and precisely this power met my requirements, so I did not use the lower power to test. |
Sorry, totally forgot you asked a question here. Is it easy to modify the existing modify "crop margins" algorithm? And is it unlikely to cause issues for users? If so, do it. Otherwise, just put a note in the tooltip that the power is used differently. If you really want to offer the old behavior too, you can add a LEGACY checkbox. |
Hi, thank you for providing such an amazing algorithm! However, I encountered an issue while using it: when I select the Cropping mode (i.e., when it is "checked"), an error often occurs. A screenshot of the error is attached. It seems that the issue might be related to the algorithm's inability to handle blank images within comic files. To be honest, I’m not very familiar with the technical aspects, so I’ve been testing things out bit by bit. Initially, I suspected that the problem was due to the format conversion software I was using, so I tried several different ones, but the error persisted. Then, I thought it might be related to the PDF format, so I converted it to a folder containing image files and tested again, but the issue remained. Eventually, I discovered that the problem occurs when there are completely blank images within the comic files. I further tested and found that the error only happens when the Cropping mode is checked. When it’s set to "indeterminate" or when cropping is not used, everything works fine. Additionally, using the original cropping algorithm version of KCC does not have this issue. |
Thank you for the heads up, it seems like u did a nice investigation. It could be that I overlooked an edge-case of blank images. I'll confirm it and see what's going on. |
1. Replaced both crop margins and crop margins & page num with newer algorithm. 2. Crop max power level increased to 3.0 3. Adds NumPy as a new dependency.
This comment was marked as resolved.
This comment was marked as resolved.
For the update:
|
This comment has been minimized.
This comment has been minimized.
This comment was marked as resolved.
This comment was marked as resolved.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
Alright, I'm ready to merge as a tentative 7.0.0 release, because the numpy dependency is a big dependency change. Get ready for more user feedback since this is now in the actual releases tab instead of being linked in the release notes. Is this alright as the first line in the release notes?
code quality is great, I only have some minor nitpicks about defining functions before you use them and the fact that the margin crop algorithm should be defined first to make reviewing easier. Maybe the page# crop algo should call the margin crop algo to reduce duplicated code/comments. People in the discord have reported good results with the new algo. I have merged this for now, but I'll keep looking into it, I wonder if any python or numpy built ins can do what the helper functions are doing? Since kcc has no unit tests, always unsure if we miss a corner case. And armv7 now takes forever to build due to numpy source build... |
Ye, it's okay.
I guess we can somehow reduce duplicated code by reusing a more open function of the margin cropping algo.
During the development of the algorithm, I've created a small test system. It applies the algorithm on dozens of images and checks if it cropped them okay and matches the expected boxes. I've used numpy in the helper functions... I don't know if there is are better fit functions that encompass more of the logic contained in the helper functions... But I agree, I'm worried of corner cases as well.
Oh. Why is that? Is it because NumPy package isnt optimized for arm? Maybe NumPy v2+ is more optimized? |
Lots of packages including numpy don't have precompiled armv7 wheels. don't worry about it, just a curious observation. armv7 is typically just raspberry pi. |
Summary
Motivation
E-reader screen real estate is an important resource.
More available screen size means more details can be better seen, especially text.
Text is one of the most important elements that need to be clearly readable on e-readers,
which mostly are smaller devices where the need to zoom is unwanted.
By cropping the page number on the bottom of the page, 2%-5% of the page height can be regained
that allows us to upscale the image even more.
The Current Algorithm Fails
While it is possible to adjust the 'power' of the cropping, I can only gain some success when the power is at least 4 (using the power user EXE), where the GUI itself only allows up to 2.
The current algorithm fails in all of the cases that I have at hand.
A Better Algorithm
I wrote a Python function using PIL, NumPy (openCV might perform faster, but I regressed to PIL as it exists as a dependency already and including OpenCV is heavy, package weight-wise).
OUTPUT EXAMPLES OF THE NEW ALGORITHM
(I extended the power level limit to 3, from the previous limit of 2)
Page Number Cropping
Showcasing Its Ability To Retain Manga Content
It does not crop the page number if it detects other things that might be cropped along with the page number.
Issues