Major performance issues for a simple PDF file #6961

timvandermeij · 2016-02-08T14:53:08Z

I have created a simple PDF file using Scribus 1.5.0svn. Notice that the file size is large for a two-page PDF file, so I can only suspect that Scribus is doing something really unefficient when exporting the PDF file. Nevertheless, the PDF file below renders instantly with both Adobe Acrobat Reader DC and Foxit Reader (within 0.5 seconds), however PDF.js takes 27 seconds to render only the first page of this file. I have no idea why PDF.js is taking such an excessive amount of time, but we need to do better here, given that other viewers do not have any problems with this file. Are there perhaps inefficient patterns in this file that the optimizer could remove? I notice that the PDF file contains an excessive amount of resources in the Resources dictionary of each page (XObject, Font, Pattern and ExtGState).

Below is the PDF file. I made this myself, so anyone is free to use this as a test case in a PR that addresses this issue:
test.pdf

Rob--W · 2016-02-08T17:01:19Z

The CPU profiler shows that most of the time is spent in PartialEvaluator_hasBlendModes:

pdf.js/src/core/evaluator.js

Lines 144 to 200 in a0aa781

    
           hasBlendModes: function PartialEvaluator_hasBlendModes(resources) { 
        
             if (!isDict(resources)) { 
        
               return false; 
        
             } 
        
             var processed = Object.create(null); 
        
             if (resources.objId) { 
        
               processed[resources.objId] = true; 
        
             } 
        
             var nodes = [resources]; 
        
             while (nodes.length) { 
        
               var key; 
        
               var node = nodes.shift(); 
        
               // First check the current resources for blend modes. 
        
               var graphicStates = node.get('ExtGState'); 
        
               if (isDict(graphicStates)) { 
        
                 graphicStates = graphicStates.getAll(); 
        
                 for (key in graphicStates) { 
        
                   var graphicState = graphicStates[key]; 
        
                   var bm = graphicState['BM']; 
        
                   if (isName(bm) && bm.name !== 'Normal') { 
        
                     return true; 
        
                   } 
        
                 } 
        
               } 
        
               // Descend into the XObjects to look for more resources and blend modes. 
        
               var xObjects = node.get('XObject'); 
        
               if (!isDict(xObjects)) { 
        
                 continue; 
        
               } 
        
               xObjects = xObjects.getAll(); 
        
               for (key in xObjects) { 
        
                 var xObject = xObjects[key]; 
        
                 if (!isStream(xObject)) { 
        
                   continue; 
        
                 } 
        
                 if (xObject.dict.objId) { 
        
                   if (processed[xObject.dict.objId]) { 
        
                     // stream has objId and is processed already 
        
                     continue; 
        
                   } 
        
                   processed[xObject.dict.objId] = true; 
        
                 } 
        
                 var xResources = xObject.dict.get('Resources'); 
        
                 // Checking objId to detect an infinite loop. 
        
                 if (isDict(xResources) && 
        
                     (!xResources.objId || !processed[xResources.objId])) { 
        
                   nodes.push(xResources); 
        
                   if (xResources.objId) { 
        
                     processed[xResources.objId] = true; 
        
                   } 
        
                 } 
        
               } 
        
             } 
        
             return false; 
        
           },

To profile (using Chrome), simply:

Download test.pdf from above.
Visit https://mozilla.github.io/pdf.js/web/viewer.html?file=
Open the developer tools, go to the Profiles tab.
Open test.pdf with PDF.js
Select "Target: pdf.worker.js" and click on Start to profile
Whenever you feel ready (e.g. when the CPU is idle), stop the capture.
Analyse the results.

timvandermeij · 2016-02-08T19:20:51Z

Thank you for profiling this! This code seems to loop over (potentially) all ExtGState and XObject dictionaries, and there are a lot of them in this PDF file. I'm afraid the in operator might cause delays here.

timvandermeij · 2016-02-08T20:09:00Z

I have narrowed down the flaw a bit. For the first page,

pdf.js/src/core/evaluator.js

Line 182 in a0aa781

if (processed[xObject.dict.objId]) {

is triggered almost 31000 times, meaning that we attempt to process a lot of XObjects that we already processed before.

Snuffleupagus · 2016-02-09T16:13:01Z

This seems to do the trick, but I've not had time to run tests yet: master...Snuffleupagus:issue-6961.
Also, I'm not sure what kind of test we can add for this, will look into this more later tonight.

timvandermeij · 2016-02-09T20:32:05Z

Funny, I also tried using getKeys() in a local test, but with no result, possibly because I missed master...Snuffleupagus:issue-6961#diff-0b94c2e77a5259f7a728122fdbf9f46aR182. Thanks for looking into this!

Snuffleupagus · 2016-02-10T12:34:53Z

So, my patch seems to pass all tests locally, but there're two problems:

First of all, how do we test this? Given that the run time is hardware/software dependent, I'm not sure how we can assert that a test doesn't run for too long!?
Second of all, I don't understand why the patch actually works ;-)
https://github.com/Snuffleupagus/pdf.js/blob/21a19c1a1cdeb1bb9ddae8a44e7da00c241c899e/src/core/evaluator.js#L193-L197 ought to be enough, since as far as I can tell xObject.dict.objId always seem to equal xObjects.getRaw(key).toString() in this PDF file.

timvandermeij · 2016-02-10T12:44:48Z

The only test I can imagine is a unit test where we assert that the number of processed XObjects is less than it is currently. It's not great, but it's the only kind of test I can come up with since measuring the runtime is not an option. Otherwise I think it suffices to review the patch, test it manually and make sure that the test suite passes.

Regarding the second point, I would have to look into this more. If they are equal then the current code should do, so I'm not yet sure what the difference is with your patch.

Rob--W · 2016-02-10T13:49:52Z

First of all, how do we test this? Given that the run time is hardware/software dependent, I'm not sure how we can assert that a test doesn't run for too long!?

27 seconds for such a simple PDF is excessive. We could create a suite of PDFs whose rendering time is measured (can be as simple as subtracting two time stamps), and then report the results to some central place. If we occasionally look at the results (e.g. a table, or a fancy graph), then we should get a good picture of what rendering times are normal and detect performance regressions.

Second of all, I don't understand why the patch actually works ;-)

What exactly is unclear? Replacing getAll with getKeys seems like an obvious boost, is there something else with your patch that magically improves the runtime?

Snuffleupagus · 2016-02-10T14:16:26Z

What exactly is unclear? Replacing getAll with getKeys seems like an obvious boost, is there something else with your patch that magically improves the runtime?

The getKeys change is not noticeable in the grand scheme of things, the real improvement comes from master...Snuffleupagus:issue-6961#diff-0b94c2e77a5259f7a728122fdbf9f46aR186.
In practice that check seem, for all intents and purposes, to be equal to an already existing check just below (as I commented above):

https://github.com/Snuffleupagus/pdf.js/blob/21a19c1a1cdeb1bb9ddae8a44e7da00c241c899e/src/core/evaluator.js#L193-L197 ought to be enough, since as far as I can tell xObject.dict.objId always seem to equal xObjects.getRaw(key).toString() in this PDF file.

Rob--W · 2016-02-10T14:32:05Z

Using getAll results in traversing the whole tree and fetching Refs.
Using getKeys merely requires looking up all keys. But I think that if you call fetch on every Ref value, then you're getting a similar runtime as when you're using .getAll. By skipping .fetch if the Ref is already known, you're saving the overhead of resolving Refs.

(this is my guess, I didn't check whether it is really the reason).

yurydelendik · 2016-02-10T15:00:47Z

The reason above somewhat right. We had a thought to disable getAll(), not for performance but because it can pull recursively not-needed data into operator list. We probably need to review all getAll usages and remove it.

Snuffleupagus · 2016-02-10T15:24:40Z

@Rob--W You are absolutely correct, I don't know how I missed that myself. Thank you!

When an xobject is a group we were double applying the matrix and bounding box. This improves mozilla#6961 quite a bit, but it still is missing the indention in the ruler.

In `beginGroup` we create a new canvas that is the size of the bounding box and we translate it to the offset. This means we don't need to also apply the bounding box during `paintFormXObjectBegin`. This improves mozilla#6961 quite a bit, but it still is missing the indention in the ruler.

timvandermeij added the performance label Feb 8, 2016

Snuffleupagus mentioned this issue Feb 10, 2016

Replace getAll with getKeys in PartialEvaluator_hasBlendModes to speed up loading of badly generated PDF files (issue 6961) #6971

Merged

yurydelendik closed this as completed in #6971 Feb 10, 2016

Snuffleupagus mentioned this issue Feb 12, 2016

Remove the only remaining Dict_getAll usage (in evaluator.js) and the method itself #6982

Merged

yurydelendik mentioned this issue Mar 25, 2016

Removes global PDFJS usage from the src/core/. #7053

Merged

Snuffleupagus mentioned this issue May 31, 2016

Scrolling is slow for openMagazin PDF #5808

Closed

Snuffleupagus mentioned this issue Oct 12, 2019

Cache processed 'ExtGState's in PartialEvaluator.hasBlendModes to avoid unnecessary parsing/lookups #11232

Merged

Snuffleupagus mentioned this issue Nov 5, 2020

Add global caching, for /Resources without blend modes, and use it to reduce repeated fetching/parsing in PartialEvaluator.hasBlendModes #12583

Merged

THausherr mentioned this issue Aug 10, 2021

Lazier clipping apache/pdfbox#127

Closed

brendandahl mentioned this issue Nov 5, 2021

Don't double apply a group xobject's bbox. #14241

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major performance issues for a simple PDF file #6961

Major performance issues for a simple PDF file #6961

timvandermeij commented Feb 8, 2016

Rob--W commented Feb 8, 2016

timvandermeij commented Feb 8, 2016

timvandermeij commented Feb 8, 2016

Snuffleupagus commented Feb 9, 2016

timvandermeij commented Feb 9, 2016

Snuffleupagus commented Feb 10, 2016

timvandermeij commented Feb 10, 2016

Rob--W commented Feb 10, 2016

Snuffleupagus commented Feb 10, 2016

Rob--W commented Feb 10, 2016

yurydelendik commented Feb 10, 2016

Snuffleupagus commented Feb 10, 2016

Major performance issues for a simple PDF file #6961

Major performance issues for a simple PDF file #6961

Comments

timvandermeij commented Feb 8, 2016

Rob--W commented Feb 8, 2016

timvandermeij commented Feb 8, 2016

timvandermeij commented Feb 8, 2016

Snuffleupagus commented Feb 9, 2016

timvandermeij commented Feb 9, 2016

Snuffleupagus commented Feb 10, 2016

timvandermeij commented Feb 10, 2016

Rob--W commented Feb 10, 2016

Snuffleupagus commented Feb 10, 2016

Rob--W commented Feb 10, 2016

yurydelendik commented Feb 10, 2016

Snuffleupagus commented Feb 10, 2016