Bug #1329087 “Use tile-based PDF rendering” : Bugs : qpdfview

Adam Reichold (adamreichold) on 2014-06-12

Changed in qpdfview:
status:	New → Triaged
importance:	Undecided → Wishlist

Revision history for this message

Adam Reichold (adamreichold) wrote on 2014-06-12:

#1

Hello Martin,

thanks for creating a bug report about this and yes, qpdfview does render on a per-page level which makes high scale factors unnecessarily memory-intensive. The API of Poppler itself does expose functions to render arbitrary sub-rectangles of the page's bounding box, so the problem is one of development time and complexity. (You could even render each tile in a separate thread using the Qt frontend.) All related functionality in qpdfview like caching, obsolete pixmaps assumes a per-page granulartiy, so even if the tiled rendeing itself would be rather straight forward, its interaction with other features is not.
In any case, it is mainly a question of someone finding the time to factor out rendering from PageItem and RenderTask into an intermediate TileItem class. (As it stands, I will probably not find the necessary spare time to do this.)

Best regards, Adam.

Adam Reichold (adamreichold) on 2014-06-12

Changed in qpdfview:
assignee:	nobody → Adam Reichold (adamreichold)
status:	Triaged → In Progress

Revision history for this message

Adam Reichold (adamreichold) wrote on 2014-06-12:

#2

Hello again,

I gave this some though and actually started to implement the outlined solution using an intermediate TileItem class to handle rendering rectangular parts of the whole page with a predefined size. Currently this target tile size is hard-coded to 1024 pixel, but I guess making it a hidden setting, i.e. accessible only through the configuration file, is sensibleas well.

Of course, this work is far from complete: I basically copied PageItem and stripped it down to rendering, then used several of those new items as children of the page to render parts of it. Hence, the code needs to be cleaned-up all over the place to remove the redundancies between PageItem and TileItem. It also needs to be simplified w.r.t. passing of the rendering parameters from DocumentView down to RenderTask. And of course, it needs to be tested a lot since this will probably introduce a gazillion new corner cases into the rendering logic.

There is also a more insidious issue: If prefetching is enabled, a lot of render tasks might be in-flight and in need of cancellation when one goes from a large to a small zoom factor (i.e. from many to few tiles per page). Even when implementing forced cancellation to override the prefetching, this can block the interface for several seconds. I am not sure how to solve this. Maybe the amount of prefetching needs to be adjusted w.r.t. the amount of tiling or the cancellation concept has to be rethought. (But we really can't delete a TileItem before its RenderTask is gone, since it holds a reference to the model page.)

Please test and report back here. Best regards, Adam.

@Benjamin: This branch seems like a good candidate for some tests using Foobar.pdf, maybe you can drop me a copy again... ;-)

Revision history for this message

Adam Reichold (adamreichold) wrote on 2014-06-13:

#3

Hello again,

the branch starts to come together: the settings are there, most of the redundancies are gone and the deletion problem is solved by making obsolete tile items try to delete themselves whenever the event loop is idle but only doing so if their render task is already finished. (I also added overlap to the tiles to prevent rendering artifacts due to pixel rounding errors.)

There do remain two serious problem which IMHO prevent this from being universally useful: The obsolete pixmaps functionality is still missing and I am not sure how this can be implemented at all. Also updating the tiles will trigger updates to the neighboring tiles as well, hence if not all visible tiles are cached, the will continuously trigger repaints of each other which never stop. (I have not yet found a way to tell QGraphicsView to repaint one and only one item and depending on how its compositing works, this might not be possible at all. Maybe using Z ordering in between the tiles could help here.)

Best regards, Adam.

Revision history for this message

Martin Spacek (mspacek) wrote on 2014-06-13:

#4

Wow, that was quick! Nice! OK, I've tried it out on my LaTeX thesis, which is full of big figures, both raster and vector graphics. Note that I have prefetch turned on and set to 1, cache is 1024 MB, obsolete pixmaps are kept, and I tend to use continuous 2-page mode. Some impressions:

1. When zooming in or out with prefetch on, overall rendering time is as slow, or even slower than in trunk. Turning prefetch off fixes this. Actually, with prefetch off, the tile-based branch feels faster than trunk. I already like it better!

2. Would it be possible to display the tiles that rendered the fastest (mostly just text) first, so that they aren't held up by the slower tiles (mostly figures)? A kind of race to the finish line? Right now, I get the impression that doesn't happen. Tiles generally seem to paint left to right, and mostly top-down too, even when the top left is dominated by a figure.

3. Is the tile size in screen pixels? Would it not make more sense to have it in page units, say a few centimeters on a side, something like 1/6 or 1/8 of a (US letter) page? Could tiles be rectangular? It might make more sense to adjust the aspect ratio of the tiles for each page to match the aspect ratio of the page.

4. Actually, the tiles already seem to be rectangular with the right aspect ratio, maybe 1024 pix high, and the proportionally smaller amount (8.5/11) wide. However, when I zoom in, the tile size seems to increase up to a point, and then drops again once less than a certain number of tiles are on screen. Hm, so perhaps they are more page-based than screen-based after all?

5. I've set pixel overlap to 0, and I haven't seen any rendering artifacts.

6. What are the prospects for multithreaded tile rendering and making use of all the cores?

7. Making obsolete pixmaps work with tile-based rendering would definitely be great. Right now, if the tile isn't cached on a zoom, it's blank :(

I'll continue using the tile-rendering branch and report any other problems.