Fix intermittent unit/reference test failures #12123

timvandermeij · 2020-07-26T10:36:05Z

To debug this, pull request #12124 included the browser name in the logs for the unit tests too.

The following is a list of unit test failures we have seen:

TEST-UNEXPECTED-FAIL | cleans up document resources during rendering of page | in firefox | Unhandled promise rejection: Error: shall fail cleanup in http://127.0.0.1:38165/node_modules/systemjs/dist/system.js line 4 > eval (line 1585) (Linux; fixed in Attempt to reduce intermittent failures in the "cleans up document resources during rendering of page" unit-test #12126)
TEST-UNEXPECTED-FAIL | creates pdf doc from non-existent URL | in chrome | Unhandled promise rejection: MissingPDFException: Missing PDF "http://127.0.0.1:38165/test/pdfs/non-existent.pdf". (Linux)
TEST-UNEXPECTED-FAIL | gets default page mode | in chrome | Unhandled promise rejection: AbortException: Worker was terminated. (Linux; fixed in Prevent Uncaught (in promise) AbortException when running the unit-tests #12144/Ignore fetch() errors, in PDFFetchStreamRangeReader, once the request has been aborted #12136)
TEST-UNEXPECTED-FAIL | gets non-existent outline | in chrome | Unhandled promise rejection: AbortException: Worker was terminated. (Linux; fixed in Prevent Uncaught (in promise) AbortException when running the unit-tests #12144/Ignore fetch() errors, in PDFFetchStreamRangeReader, once the request has been aborted #12136)
TEST-UNEXPECTED-FAIL | gets operatorList with JPEG image (issue 4888) | in chrome | Unhandled promise rejection: AbortError: The user aborted a request. (Windows; fixed in Prevent Uncaught (in promise) AbortException when running the unit-tests #12144/Ignore fetch() errors, in PDFFetchStreamRangeReader, once the request has been aborted #12136)
TEST-UNEXPECTED-FAIL | should correctly render PDFs in parallel | in firefox | Unhandled promise rejection: AbortError: The operation was aborted. (line 369) (Windows; fixed in Prevent Uncaught (in promise) AbortException when running the unit-tests #12144/Ignore fetch() errors, in PDFFetchStreamRangeReader, once the request has been aborted #12136)
TEST-UNEXPECTED-FAIL | multiple render() on the same canvas | in firefox | Failed: shall fail rendering (Windows; fixed in Attempt to reduce intermittent failures in the "multiple render() on the same canvas" unit-test #12171)

It's clear from this overview that most problems are related to unexpected action abortions, both in Chrome/Firefox and on Windows/Linux. This wasn't happening a few days ago, so maybe something regressed? The second one is also interesting since we explicitly catch the exception in the unit test.

Pull request #12125 tries to improve the situation for the reference test failures.

The text was updated successfully, but these errors were encountered:

timvandermeij · 2020-07-26T17:43:15Z

It seems to have something to do with 47ab676, also the first PR where we noticed the issues. If I revert that commit, the unit tests on Linux seem to pass all the time. Strange...

Snuffleupagus · 2020-07-26T17:57:56Z

If I revert that commit, the unit tests on Linux seem to pass all the time.

Most likely the updated Jasmine-packages are responsible in that case. Perhaps we should just revert those changes for now, and also pin the packages at the old ~3.5.0 versions?

(Obviously that wouldn't really fix things, only hide the errors for now, and future updates would be blocked until the problems have been identified and addressed.)

timvandermeij · 2020-07-26T18:47:42Z

I have made #12125 which should solve the problems for now so the unit tests all pass again. This issue will remain open for the follow-up because those upgrades need to be unblocked.

timvandermeij · 2020-07-26T19:14:48Z

I filed an upstream issue for Jasmine: jasmine/jasmine#1840

timvandermeij · 2020-07-27T22:40:14Z

I received a reply for the Jasmine developers and it seems like a pre-existing issue in our unit tests that just surfaced in 3.6 because of behavior changes in Jasmine. The idea is to to track down the test that causes it by trying to get it reproducible, for example by running the tests until failures happen and keeping track of the random seed so we can replay it. Another approach is to track it down backwards, i.e., finding what can throw the AbortException and tracing it back to a particular test, but that may be more difficult.

timvandermeij · 2020-07-31T22:25:32Z

Only one unit test failure remains after which the Jasmine update is unblocked.

Snuffleupagus · 2020-08-01T12:54:45Z

TEST-UNEXPECTED-FAIL | creates pdf doc from non-existent URL | in chrome | Unhandled promise rejection: MissingPDFException: Missing PDF "http://127.0.0.1:38165/test/pdfs/non-existent.pdf".

I've been able to reproduce this a handful of times locally with Jasmine 3.6.1, even on Windows, but it's extremely intermittent[1] to the point that I cannot do any meaningful debugging :-(

[1] Something like 1-2 failures for 100 runs, even with a constant Jasmine seed.

timvandermeij · 2020-08-01T13:17:04Z

I'll also give this a try. Perhaps it's more easily reproduced on Linux for some reason, or we can find a way to make it more reproducible so it's hopefully easier to find a fix.

timvandermeij · 2020-08-01T17:34:19Z

I'm not having any "luck" so far with reproducing the failure at all on Linux. If you still have it, could you post the random seed so I can try with that?

Snuffleupagus · 2020-08-01T18:03:00Z

If you still have it, could you post the random seed so I can try with that?

I don't think that the seed actually matters though, since I've seen the intermittent failure for different ones.

Also, I forgot to mention that (almost) all of my testing was done directly in the browser, using http://localhost:8888/test/unit/unit_test.html?spec=api%20getDocument%20creates%20pdf%20doc%20from%20non-existent%20URL, which means that only the affected test runs and the seed should thus be (mostly) irrelevant. Still I only managed something like a ~1 % failure rate.

Edit: When testing in the browser, don't forget to run TESTING=true gulp generic first (and also after making any changes in src/core/ files) such that the worker is being used. Otherwise you're not going to be able to reproduce this at all, at least I don't think so, given just how different the "fakeWorker" code-paths are.

My apologies, #12123 (comment) wasn't entirely clear on the details.

timvandermeij · 2020-08-01T18:14:29Z

Ah, that helps, thanks! I was worried that the problem wouldn't show up if I just ran that one particular test, perhaps because another test might influence this one, but it's good to know that you have managed to get it reproduced with just running that single test since that makes things easier. I had also not used the TESTING=true environment variable (I just ran gulp unittest once and then re-ran the tests in the browser window with some code to not close the window after the test), so that may explain my lack of observed test failures ;-)

Snuffleupagus · 2020-08-01T18:31:42Z

I just ran gulp unittest once

Note that that command actually uses TESTING=true gulp generic under the hood, I just find it more convenient to use the latter format directly when running unit-tests manually in the browser.

Edit: Also, I suspect that the source of the intermittent unit-test failure might be related to the timing when destroying the worker-thread MessageHandler-instance and/or the webWorker-instance itself.

timvandermeij · 2020-08-01T19:54:50Z

The remaining intermittent failure didn't happen for me locally at all, but happens more often on the bots. If it's related to ordering, we now at least have a run with a random seed: http://54.67.70.0:8877/e79cd113d2ab405/output.txt

brendandahl · 2020-11-06T04:39:52Z

For the issue where text on page 11 of tracemonkey disappears I was able to create a "smaller" test case and file a bug with chromium.
https://bugs.chromium.org/p/chromium/issues/detail?id=1146296

timvandermeij · 2020-11-06T19:56:58Z

That's great news! Let's hope the Chrome team can resolve this.

timvandermeij · 2020-11-09T23:17:03Z

It looks like a fix is hopefully coming soon given that the upstream bugs are resolved. We'll keep an eye on this.

timvandermeij · 2020-12-19T21:31:53Z

The fix mentioned above didn't change the majority of the intermittent failures unfortunately. However, I do notice that recent browser updates (automatically for Firefox and through Puppeteer updates for Chrome) did resolve the few intermittent failures that happened in Firefox, so I don't see any Firefox reference tests failing intermittently anymore. Chrome on Windows also improved with around 6 failures remaining, but Chrome on Linux is still problematic.

brendandahl · 2020-12-21T17:00:45Z

I don't think chrome has been updated in puppeteer to the fixed version yet. It should be in version 88.0.43XX and the bots are still on an older version.

brendandahl · 2021-02-04T00:55:57Z

We're now on chrome 89.0.4389.0, it looks like a number of the windows issues have been fixed, but there are now more linux intermittent failures. They seem to mainly be small pixel changes vs the old behavior where text would be completely missing.

timvandermeij · 2021-02-13T18:09:45Z

The most recent Puppeteer update seems to have resolved most issues, but there are still a few intermittent ones, mainly in Chrome, albeit much fewer than before. The unit test mentioned above seems to fail a bit less often, but still fails from time to time.

timvandermeij · 2021-03-07T12:10:31Z

Closing since I've now opened a new issue with the remaining test failures for a better overview.

timvandermeij added the test label Jul 26, 2020

timvandermeij mentioned this issue Jul 26, 2020

Improve test bot stability #12125

Merged

timvandermeij changed the title ~~Intermittent unit/reference test failures on the bots~~ Intermittent unit/reference test failures Jul 26, 2020

timvandermeij changed the title ~~Intermittent unit/reference test failures~~ Fix intermittent unit/reference test failures Jul 26, 2020

This was referenced Jul 31, 2020

Remove extraction log suppression code #11998

Closed

Add types annotations #12102

Merged

timvandermeij mentioned this issue Aug 1, 2020

Unit test improvements #12153

Merged

timvandermeij mentioned this issue Aug 3, 2020

Add support for optional marked content. #12095

Merged

timvandermeij mentioned this issue Sep 12, 2020

canvas: Properly restore all the remaining items in stateStack in endDrawing #12363

Merged

timvandermeij mentioned this issue Oct 5, 2020

Fix invalid XUID entries in CFF fonts #12450

Merged

timvandermeij mentioned this issue Jan 12, 2021

Images missing #12854

Closed

timvandermeij closed this as completed Mar 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix intermittent unit/reference test failures #12123

Fix intermittent unit/reference test failures #12123

timvandermeij commented Jul 26, 2020 •

edited

Loading

timvandermeij commented Jul 26, 2020

Snuffleupagus commented Jul 26, 2020

timvandermeij commented Jul 26, 2020

timvandermeij commented Jul 26, 2020 •

edited

Loading

timvandermeij commented Jul 27, 2020 •

edited

Loading

timvandermeij commented Jul 31, 2020

Snuffleupagus commented Aug 1, 2020 •

edited

Loading

timvandermeij commented Aug 1, 2020

timvandermeij commented Aug 1, 2020 •

edited

Loading

Snuffleupagus commented Aug 1, 2020 •

edited

Loading

timvandermeij commented Aug 1, 2020

Snuffleupagus commented Aug 1, 2020 •

edited

Loading

timvandermeij commented Aug 1, 2020

brendandahl commented Nov 6, 2020

timvandermeij commented Nov 6, 2020

timvandermeij commented Nov 9, 2020

timvandermeij commented Dec 19, 2020 •

edited

Loading

brendandahl commented Dec 21, 2020

brendandahl commented Feb 4, 2021

timvandermeij commented Feb 13, 2021

timvandermeij commented Mar 7, 2021

Fix intermittent unit/reference test failures #12123

Fix intermittent unit/reference test failures #12123

Comments

timvandermeij commented Jul 26, 2020 • edited Loading

timvandermeij commented Jul 26, 2020

Snuffleupagus commented Jul 26, 2020

timvandermeij commented Jul 26, 2020

timvandermeij commented Jul 26, 2020 • edited Loading

timvandermeij commented Jul 27, 2020 • edited Loading

timvandermeij commented Jul 31, 2020

Snuffleupagus commented Aug 1, 2020 • edited Loading

timvandermeij commented Aug 1, 2020

timvandermeij commented Aug 1, 2020 • edited Loading

Snuffleupagus commented Aug 1, 2020 • edited Loading

timvandermeij commented Aug 1, 2020

Snuffleupagus commented Aug 1, 2020 • edited Loading

timvandermeij commented Aug 1, 2020

brendandahl commented Nov 6, 2020

timvandermeij commented Nov 6, 2020

timvandermeij commented Nov 9, 2020

timvandermeij commented Dec 19, 2020 • edited Loading

brendandahl commented Dec 21, 2020

brendandahl commented Feb 4, 2021

timvandermeij commented Feb 13, 2021

timvandermeij commented Mar 7, 2021

timvandermeij commented Jul 26, 2020 •

edited

Loading

timvandermeij commented Jul 26, 2020 •

edited

Loading

timvandermeij commented Jul 27, 2020 •

edited

Loading

Snuffleupagus commented Aug 1, 2020 •

edited

Loading

timvandermeij commented Aug 1, 2020 •

edited

Loading

Snuffleupagus commented Aug 1, 2020 •

edited

Loading

Snuffleupagus commented Aug 1, 2020 •

edited

Loading

timvandermeij commented Dec 19, 2020 •

edited

Loading