Add logic to track rendering area of various PDF ops #19043

nicolo-ribaudo · 2024-11-14T12:32:47Z

I started working towards #6419. This PR introduces the logic to track where different elements of the PDF are rendered, and hooks it up to the debugger since @calixteman mentioned that it would be useful.

I'm marking this as draft because there are a few changes I need to make:

change the various methods in canvas.js to receive the index as a param, rather than returning a function that takes the index
clean up the "dependencies tracking", since currently it's all over the place. Ideally most of this logic should be self-contained in CanvasRecorder, so that when not recording it doesn't have a performance impact.
improve the dependency tracking (so far I'm only tracking some of them)
do not track extra dependencies (for example, a stroke path doesn't depend on the fill color)
track object dependencies
fix image dependencies tracking for transform (currently there is a .setTransform that makes it get lost)

However, I'd love to receive feedback on the direction.

Commit 1:

Add logic to track rendering area of various PDF ops

This commit is a first step towards #6419, and it can also help with
#13287. To support rendering part of a page, we will need to
first compute which ops can affect what is visible in that part of
the page.

This commit adds logic to track "group of ops" with their respective
bounding boxes. Each group eather corresponds to a single op or
to a range, and it can have dependencies earlier in the ops list that
are not contiguous to the range.

Consider the following example:
0. setFillRGBColor
1. beginText
2. showText "Hello"
3. endText
4. constructPath [...]
5. eoFill
here we have two groups: the text (range 1-3) and the path (range 4-5).
Each of them has a corresponding bounding box, and a dependency
on the op at index 0.

This tracking happens when first rendering a PDF: we wrap the canvas
with a "canvas recorder" that has the same API, but with additional
methods to mark the start/end of a group.

Commit 2:

Hook up the ops bbox logic to the pdf debugger

When using the pdf debugger, when hovering over a step now:

it highlights the steps in the same groups

it highlights the steps that they depend on

it highlights on the PDF itself the bounding box

This is an example of what the debugger integration looks like (note: I couldn't figure out how to make my cursor show up in the recording 😅 I'm moving it over the steps list):

Screen.Recording.2024-11-14.at.16.35.58.mov

By default it doesn't show all the bounding boxes because on some PDFs it's too much noise, but if you click on the checkbox then it shows the boxes and you can click on a box to scroll into view the corresponding ops.

src/display/canvas.js

src/display/api.js

src/display/canvas.js

This commit is a first step towards mozilla#6419, and it can also help with first compute which ops can affect what is visible in that part of the page. This commit adds logic to track "group of ops" with their respective bounding boxes. Each group eather corresponds to a single op or to a range, and it can have dependencies earlier in the ops list that are not contiguous to the range. Consider the following example: ``` 0. setFillRGBColor 1. beginText 2. showText "Hello" 3. endText 4. constructPath [...] 5. eoFill ``` here we have two groups: the text (range 1-3) and the path (range 4-5). Each of them has a corresponding bounding box, and a dependency on the op at index 0. This tracking happens when first rendering a PDF: we wrap the canvas with a "canvas recorder" that has the same API, but with additional methods to mark the start/end of a group.

When using the pdf debugger, when hovering over a step now: - it highlights the steps in the same groups - it highlights the steps that they depend on - it highlights on the PDF itself the bounding box

Fix tracking of clip operations

Track transform dependencies

Track miter limit as stroke dependency

Actually, do not generate a bbox for clipping operations, since they render nothing on the canvas.

Use canvas width/height instead of Infinity for unknown

nicolo-ribaudo · 2024-12-17T17:20:28Z

master...nicolo-ribaudo:pdf.js:draw-page-portion-optimized is a branch merging this PR together with #19128. In the video below you can see that it first renders in the background a low-resolution image "the old way" taking 12 seconds, and then it renders the "detail view" on top taking only 1.4 seconds and only running one fifth of the PDF operations :)

Screen.Recording.2024-12-17.at.18.10.30.mp4

Still keeping this as draft because there are significant bugs (in the PDF I'm using for testing, it often skips rendering some pieces of text even if they are visible on screen, or it renders some paths with the wrong color), but it's nice to see some progress.

nicolo-ribaudo commented Nov 14, 2024

View reviewed changes

src/display/canvas.js Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems Nov 14, 2024

View reviewed changes

src/display/api.js Fixed Show fixed Hide fixed

src/display/api.js Fixed Show fixed Hide fixed

nicolo-ribaudo force-pushed the compute-bounding-boxes branch 4 times, most recently from 2475b16 to 5a6a877 Compare November 14, 2024 15:54

timvandermeij added core viewer performance labels Nov 17, 2024

github-advanced-security bot found potential problems Nov 18, 2024

View reviewed changes

src/display/canvas.js Fixed Show fixed Hide fixed

nicolo-ribaudo mentioned this pull request Nov 29, 2024

Render high-res partial page views when falling back to CSS zoom #19128

Open

nicolo-ribaudo added 8 commits December 16, 2024 16:34

Hook up the ops bbox logic to the pdf debugger

626aa67

When using the pdf debugger, when hovering over a step now: - it highlights the steps in the same groups - it highlights the steps that they depend on - it highlights on the PDF itself the bounding box

fixup! Add logic to track rendering area of various PDF ops

0da4bef

Fix tracking of clip operations

fixup! Add logic to track rendering area of various PDF ops

912ec06

Track transform dependencies

fixup! Add logic to track rendering area of various PDF ops

1485af7

Track miter limit as stroke dependency

Record save/restore groups

6027c0e

fixup! Add logic to track rendering area of various PDF ops

e7af90f

Actually, do not generate a bbox for clipping operations, since they render nothing on the canvas.

fixup! Add logic to track rendering area of various PDF ops

eac70e4

Use canvas width/height instead of Infinity for unknown

nicolo-ribaudo force-pushed the compute-bounding-boxes branch from 49c4689 to eac70e4 Compare December 16, 2024 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add logic to track rendering area of various PDF ops #19043

Add logic to track rendering area of various PDF ops #19043

nicolo-ribaudo commented Nov 14, 2024 •

edited

Loading

nicolo-ribaudo commented Dec 17, 2024

Add logic to track rendering area of various PDF ops #19043

Are you sure you want to change the base?

Add logic to track rendering area of various PDF ops #19043

Conversation

nicolo-ribaudo commented Nov 14, 2024 • edited Loading

nicolo-ribaudo commented Dec 17, 2024

nicolo-ribaudo commented Nov 14, 2024 •

edited

Loading