-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluate PDF parsing libraries #127
Comments
have tried both with camelot and tabula-py with the game reports. Both went well in my cases. Can you explain the "problem" cases? Is it related to correct_data.py ? Why is needing java an issue and what are the corner cases you mention? |
@filgit the "problem" with java is that it's just an otherwise unnecessary technology/dependency in the system. Line 51 in 00ee8a8
Line 8 in 00ee8a8
for the "corner cases" I don't remember what exactly they were, but I have a long list of "erroneous reports", performance might be another reason, but that should be measured first to really count as an argument. but this issue didn't really have priority, else it wouldn't celebrate birthday soon 😅 |
okay, see. With camelot you will have ghostscript as an additional dependency. |
currently using https://github.com/chezou/tabula-py/tree/master
problems:
Alternatives:
The text was updated successfully, but these errors were encountered: