Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Node.js Server got stuck when parsing specific PDF while it is working for other PDFs #321

Closed
pedrodev033 opened this issue Nov 15, 2023 · 2 comments

Comments

@pedrodev033
Copy link

when trying to call this function this.PDFParser.parseBuffer(PDF)
server got stuck. even setTimeout is not working and not getting api call. overall server freeze and I had to restart server.
Appreciate the help.

@modesty
Copy link
Owner

modesty commented Dec 27, 2023

can you upload the PDF file?

@abhithkba
Copy link

abhithkba commented Apr 11, 2024

same issue.
here is the pdf file
21583473018.pdf

the error is not getting caught by try catch or pdfParser.on("pdfParser_dataError", errData => console.error(errData.parserError) );

modesty added a commit that referenced this issue May 6, 2024
…on will only be warned in log, won't stop the text and form parsing
modesty added a commit that referenced this issue May 6, 2024
modesty added a commit that referenced this issue May 11, 2024
…on will only be warned in log, won't stop the text and form parsing
modesty added a commit that referenced this issue May 11, 2024
modesty added a commit that referenced this issue May 24, 2024
…tion handlings (#346)

* feat: add eslint, fix issue #321, add test PDF for #321:image exception will only be warned in log, won't stop the text and form parsing

* fix: issue #318, fixed together with #321

* fix: issue #343, resolve pkInfo at build time

* fix: issue #255 and #277, keep parsing if img data is not resolved

* maint: add ts dependency and config, remove root pdf2json.js, use ./bin/pdf2json.js instead

* maint: replace lib/p2jcom*.js with src/cli/p2jcli*.ts, compied cli is moved to /bin/cli

* maint: add compiled bundle to bin/cli and dist

* maint: remove unnecessary build artifacts

* build: add prepare scripts

* test: add more page content verification in test

* test: add more cli tests after jest

* fix: auto create output dir in CLI to make more tests run without setting up directory structure

* fix: #262: correct the condition check, also add i262_4hXzVq.pdf to test/misc
@modesty modesty closed this as completed May 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants