Fix various corrupt PDF files (issue 9252, issue 9418) #9827

…getNumber` (PR 8359 follow-up) With the current code line-breaks are accepted not just after an operator, but after a decimal point as well. When looking at this again, the latter case seems prone to cause false positives and might also interfere with subsequent patches. Hence this is code is adjusted to actually do what the original commit message says, and nothing more.

This is consistent with the behaviour in Adobe Reader.

…rators in `XRef.indexObjects` (PR 9288 follow-up)

…ef.parse`

… to recover when possible Note that the `Catalog` constructor, and some of its methods, are already enforcing that the 'Root' dictionary is valid/well-formed. However, by doing additional validation already in `XRef.parse` there's a slightly larger chance that corrupt PDF files could be successfully parsed/rendered.

…indexObjects` (issue 9418) This patch avoids choosing a (possible) 'trailer' dictionary that `XRef.parse` and/or the `Catalog` constructor/methods will reject anyway. Since `XRef.indexObjects` is already parsing the entire PDF file, the extra dictionary look-ups added here shouldn't matter much. Besides, this is a fallback code-path that only applies to corrupt PDF files anyway.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix various corrupt PDF files (issue 9252, issue 9418) #9827

Fix various corrupt PDF files (issue 9252, issue 9418) #9827

Commits on Jun 20, 2018