This repository has been archived by the owner on Jun 15, 2023. It is now read-only.
-
-
Notifications
You must be signed in to change notification settings - Fork 113
URLs truncated at line endings #21
Comments
Hi, I'm not sure if you are still working on this code. But on the chance that you are, I wanted to let you know that I also experience the same issue in pdfx v 1.3.1 that bitsgalore reported above. |
I would love to see a solution to this issue. It is one of two problems that is stopping me from using pdfx for my academic research. |
I see the same issue; reported good or 404 URLs are truncated at 20 characters when using the command format: |
Same issue here. Lots of URLs are ignored or treated as invalid because they cover multiple lines in a PDF (especially when the lines are narrow). Please fix - this is a critical issue preventing me from using pdfx! |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
First of all: great tool! I did however come across a problem with URLs that span more than one line. I've attached a PDF that reproduces the problem here:
testpdfx.pdf
Command:
The URL in the footnote is extracted as::
Whereas this should be:
I used pdfx version 1.3.1 on Linux Mint.
The text was updated successfully, but these errors were encountered: