Verify correctness of Taskcluster PR checks #13194

jugglinmike · 2018-09-24T19:26:25Z

In gh-12657, we configured TaskCluster to validate patches submitted via pull request on GitHub.com. Typically, failing validation jobs prevent reviewers from merging pull requests. The new TaskCluster integration was unproven, and we recognized that it may have bugs. To avoid interrupting ongoing work with spuriously-failing jobs, we integrated it to allow failures.

TravisCI has been running equivalent jobs for a far longer time; we persisted those jobs so that we would continue to receive notifications for errors in pull requests.

The TaskCluster integration has been running for over a week, so we should have enough information to judge its correctness.

Collect the results of the TaskCluster tasks and TravisCI "pull request" jobs since gh-12657, and compare them. Note that due to subtle differences in the tasks, a certain kind of discrepancy is acceptable.

jugglinmike · 2018-09-24T19:29:39Z

Forgot to mention the goals:

allow the TaskCluster tasks to influence whether pull requests can be merged
remove the equivalent jobs from the TravisCI configuration

Hexcles · 2018-09-25T17:48:59Z

I changed the title to clarify that this issue is mostly about PR checks on Taskcluster.

On the other hand, I'm verifying the master builds on Taskcluster against BuildBot.

foolip · 2018-11-02T09:44:16Z

@jugglinmike does the script for this exist yet? I'm sort of eager to turn off ci_stability.py in Travis :)

jugglinmike · 2018-11-07T01:10:56Z

The script is available in a standalone project which includes some documentation on installation and running:

https://github.com/bocoup/wpt-validate-taskcluster

The initial results were a little shaky, so I reported them to the Taskcluster maintainers. They got back to me yesterday. The biggest takeaway is that that many of those "shaky" results were due to stragglers--pull requests that were based on a commit that pre-dated our change to enable Taskcluster. A primitive way to keep that problem from influencing the data is to limit the query to the past 3 weeks.

$ node . --between 2018-10-15:2018-11-06 --format markdown --discrepancies

That also limits the data, but we'll still have close to 500 commits to work with.

All discrepancies

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13942	moz-wptsync-bot	`e26e776`	PASS	FAIL	FAIL	FAIL
gh-13929	moz-wptsync-bot	`1d98d96`	FAIL		FAIL
gh-13926	pyup-bot	`3e96501`		PASS		PASS
gh-13907	youennf	`af0b381`	PASS		PASS
gh-13906	soareschen	`1937a04`	PASS		FAIL
gh-13905	youennf	`ff33b7e`	PASS		PASS
gh-13904	youennf	`8da49cd`	PASS		PASS
gh-13904	youennf	`92297ec`	PASS		PASS
gh-13903	youennf	`2acc419`	PASS	PASS	PASS	FAIL
gh-13899	youennf	`59e5213`	PASS	FAIL	FAIL	FAIL
gh-13886	soareschen	`f0d1b04`	PASS	FAIL	FAIL	FAIL
gh-13886	soareschen	`34a8296`	PASS		FAIL
gh-13886	soareschen	`5150b9b`	PASS	FAIL	FAIL	FAIL
gh-13884	autofoolip	`bf19490`	PASS		PASS
gh-13880	jugglinmike	`26a3c32`	PASS	PASS	FAIL	PASS
gh-13880	jugglinmike	`99f8707`	PASS	PASS	FAIL	PASS
gh-13878	lukebjerring	`340b297`	PASS	FAIL	PASS	FAIL
gh-13878	lukebjerring	`c4f9ad6`	PASS	FAIL	PASS	FAIL
gh-13877	Hexcles	`55eb739`	PASS	FAIL	PASS	FAIL
gh-13869	youennf	`d72af25`	PASS	FAIL	FAIL	FAIL
gh-13864	chromium-wpt-export-bot	`c6db0ea`	FAIL	FAIL	PASS	FAIL
gh-13862	zdobersek	`fc92447`	PASS		PASS
gh-13859	marcoscaceres	`a6311d7`	FAIL	PASS	FAIL	PASS
gh-13842	akansha2608	`38deacc`	PASS		PASS
gh-13842	akansha2608	`b9a47dc`	PASS		PASS
gh-13838	moz-wptsync-bot	`16eed42`	PASS	PASS	FAIL
gh-13827	lukebjerring	`c0592a4`	FAIL	PASS	FAIL
gh-13823	yoavweiss	`5bf797d`	PASS		FAIL
gh-13822	chromium-wpt-export-bot	`b3bed6b`	PASS	PASS	FAIL	PASS
gh-13819	ewilligers	`1a925d7`	PASS		PASS
gh-13812	akansha2608	`6f63f30`	FAIL	PASS	PASS	PASS
gh-13804	moz-wptsync-bot	`342f7cf`	PASS	PASS	FAIL
gh-13804	moz-wptsync-bot	`4db1377`	FAIL		FAIL
gh-13789	cvazac	`854f470`	PASS	PASS	PASS	FAIL
gh-13787	chromium-wpt-export-bot	`c169e95`	PASS	FAIL	PASS	PASS
gh-13781	christian-bromann	`0a36538`	PASS		PASS
gh-13781	christian-bromann	`7548d3c`	PASS		PASS
gh-13767	chromium-wpt-export-bot	`995cdc7`	PASS		PASS
gh-13757	moz-wptsync-bot	`ee3ff96`	FAIL		FAIL
gh-13748	ewilligers	`3c50079`	PASS		PASS
gh-13744	autofoolip	`6d259c9`	PASS		FAIL
gh-13740	moz-wptsync-bot	`852fe69`	PASS	PASS	PASS	FAIL
gh-13738	gsnedders	`874b6a0`	PASS		PASS
gh-13729	domfarolino	`c0c5084`	PASS	FAIL	PASS	PASS
gh-13727	chromium-wpt-export-bot	`623f033`	PASS	FAIL	PASS	PASS
gh-13723	moz-wptsync-bot	`1ddb18e`	PASS	FAIL	PASS	FAIL
gh-13696	qiuzhong	`b5b38b0`	FAIL	PASS	FAIL	PASS
gh-13689	chromium-wpt-export-bot	`1066be1`	PASS	FAIL	PASS	PASS
gh-13681	servo-wpt-sync	`a0438c4`	PASS		PASS
gh-13677	ewilligers	`18477d4`	PASS		PASS
gh-13671	domfarolino	`6e6d6be`	PASS	PASS	FAIL	PASS
gh-13669	alanwaketan	`d1ae1b3`	PASS	PASS	FAIL	PASS
gh-13666	chromium-wpt-export-bot	`a2b2b90`	PASS	PASS	FAIL	PASS
gh-13664	servo-wpt-sync	`6bf7422`	FAIL		FAIL
gh-13663	chromium-wpt-export-bot	`d0df0bd`	PASS	PASS	PASS	FAIL
gh-13640	chromium-wpt-export-bot	`f2c4c0f`	PASS	FAIL	PASS	PASS
gh-13627	moz-wptsync-bot	`a5199e7`	PASS		PASS
gh-13616	rachelandrew	`e28b31f`	PASS		PASS
gh-13616	rachelandrew	`a823191`	PASS		PASS
gh-13616	rachelandrew	`ed322df`	FAIL		FAIL
gh-13616	rachelandrew	`659a97e`	FAIL		FAIL
gh-13616	rachelandrew	`2e263df`	FAIL		FAIL
gh-13615	annevk	`fd1d2c5`	PASS	PASS	PASS	FAIL
gh-13611	lukebjerring	`ac016d8`	PASS	PASS	PASS	FAIL
gh-13606	woaiwyhty	`267f7dc`	PASS		PASS
gh-13606	woaiwyhty	`d1ae1b3`	PASS	PASS	FAIL	PASS
gh-13605	chromium-wpt-export-bot	`07d77be`	PASS		PASS
gh-13603	chromium-wpt-export-bot	`f01a681`	PASS		PASS
gh-13597	chromium-wpt-export-bot	`4f05c01`	PASS	FAIL	PASS	PASS
gh-13589	frivoal	`267f7dc`	PASS		PASS
gh-13587	moz-wptsync-bot	`07052a8`	PASS	PASS	PASS	FAIL
gh-13579	chromium-wpt-export-bot	`1e81a41`	PASS	PASS	PASS	FAIL
gh-13566	foolip	`2ee886e`	PASS		PASS
gh-13556	autofoolip	`40dfbac`	PASS		PASS
gh-13550	chromium-wpt-export-bot	`dffb313`	PASS		PASS
gh-13549	chromium-wpt-export-bot	`067c5e2`	PASS	FAIL	PASS	PASS
gh-13548	pyup-bot	`ade02ef`	PASS		PASS
gh-13528	chromium-wpt-export-bot	`27ebdcb`	PASS	FAIL	PASS	PASS
gh-13518	yoavweiss	`78f5a14`	PASS		PASS
gh-13517	annevk	`7495dc2`	PASS		PASS
gh-13511	servo-wpt-sync	`2d87392`	PASS		PASS

In summary:

Total pull requests: 386
Total commits: 580
Tested commits: 485
Contested commits (Chrome): 14
Contested commits (Firefox): 18
Contested commits (both): 49

In other words, of the commits which were validated by TravisCI and/or Taskcluster, 17% had discrepancies between the two systems. I've classified those 81 discrepancies, but doing so involved some additional interpretation, so bear with me:

Taskcluster was silent or 29 of these commits, but 3 of those were due to the "stragglers" problem described above.
We had a deficiency in our Taskcluster configuration a few weeks after we enabled pull request validation. Before we fixed it, Taskcluster would needlessly re-validate commits in response to irrelevant git/GitHub events. In many cases, this would occur just as the pull request was being merged, and Taskcluster would fail because it would be unable to fetch the pull request. Of the 8 commits which Taskcluster was unable to fetch, 4 did not have the fix in their ancestry. The other 4 seem to be caused by tomfoolery from bots or from @lukebjerring.
Taskcluster timed out while validating 5 commits. In every case, TravisCI also timed out.
The majority of discrepancies concerned the stability results themselves. Although Taskcluster seems to be better at identifying instability (possibly owing to the aforementioned difference in heuristics), the problem is inherently non-deterministic. That means some discrepancy along these lines is to be expected, so we should interpret this as neutral.

✅ TravisCI missed the commit (1 commit)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13926	pyup-bot	`3e96501`		PASS		PASS

❌ GitHub missed the results (2 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13904	youennf	`8da49cd`	PASS		PASS
gh-13605	chromium-wpt-export-bot	`07d77be`	PASS		PASS

❌ Taskcluster timed out (5 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox
gh-13929	moz-wptsync-bot	`1d98d96`	FAIL		FAIL
gh-13838	moz-wptsync-bot	`16eed42`	PASS	PASS	FAIL
gh-13804	moz-wptsync-bot	`342f7cf`	PASS	PASS	FAIL
gh-13804	moz-wptsync-bot	`4db1377`	FAIL		FAIL
gh-13757	moz-wptsync-bot	`ee3ff96`	FAIL		FAIL

✅ Missing the "event filtering" fix (4 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-13566	foolip	`2ee886e`	PASS	PASS
gh-13548	pyup-bot	`ade02ef`	PASS	PASS
gh-13517	annevk	`7495dc2`	PASS	PASS
gh-13511	servo-wpt-sync	`2d87392`	PASS	PASS

✅ Closed before Taskcluster completed (4 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox
gh-13827	lukebjerring	`c0592a4`	FAIL	PASS	FAIL
gh-13767	chromium-wpt-export-bot	`995cdc7`	PASS		PASS
gh-13603	chromium-wpt-export-bot	`f01a681`	PASS		PASS
gh-13550	chromium-wpt-export-bot	`dffb313`	PASS		PASS

✅ Taskcluster not enabled on base branch (3 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-13616	rachelandrew	`e28b31f`	PASS	PASS
gh-13616	rachelandrew	`a823191`	PASS	PASS
gh-13518	yoavweiss	`78f5a14`	PASS	PASS

❌ Taskcluster missed the commit (26 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-13907	youennf	`af0b381`	PASS	PASS
gh-13906	soareschen	`1937a04`	PASS	FAIL
gh-13905	youennf	`ff33b7e`	PASS	PASS
gh-13904	youennf	`92297ec`	PASS	PASS
gh-13886	soareschen	`34a8296`	PASS	FAIL
gh-13884	autofoolip	`bf19490`	PASS	PASS
gh-13862	zdobersek	`fc92447`	PASS	PASS
gh-13842	akansha2608	`38deacc`	PASS	PASS
gh-13842	akansha2608	`b9a47dc`	PASS	PASS
gh-13823	yoavweiss	`5bf797d`	PASS	FAIL
gh-13819	ewilligers	`1a925d7`	PASS	PASS
gh-13781	christian-bromann	`0a36538`	PASS	PASS
gh-13781	christian-bromann	`7548d3c`	PASS	PASS
gh-13748	ewilligers	`3c50079`	PASS	PASS
gh-13744	autofoolip	`6d259c9`	PASS	FAIL
gh-13738	gsnedders	`874b6a0`	PASS	PASS
gh-13681	servo-wpt-sync	`a0438c4`	PASS	PASS
gh-13677	ewilligers	`18477d4`	PASS	PASS
gh-13664	servo-wpt-sync	`6bf7422`	FAIL	FAIL
gh-13627	moz-wptsync-bot	`a5199e7`	PASS	PASS
gh-13616	rachelandrew	`ed322df`	FAIL	FAIL
gh-13616	rachelandrew	`659a97e`	FAIL	FAIL
gh-13616	rachelandrew	`2e263df`	FAIL	FAIL
gh-13606	woaiwyhty	`267f7dc`	PASS	PASS
gh-13589	frivoal	`267f7dc`	PASS	PASS
gh-13556	autofoolip	`40dfbac`	PASS	PASS

✅ Stability discrepancy - Taskcluster **found** instability that TravisCI missed (26 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13942	moz-wptsync-bot	`e26e776`	PASS	FAIL	FAIL	FAIL
gh-13899	youennf	`59e5213`	PASS	FAIL	FAIL	FAIL
gh-13886	soareschen	`f0d1b04`	PASS	FAIL	FAIL	FAIL
gh-13886	soareschen	`5150b9b`	PASS	FAIL	FAIL	FAIL
gh-13878	lukebjerring	`340b297`	PASS	FAIL	PASS	FAIL
gh-13878	lukebjerring	`c4f9ad6`	PASS	FAIL	PASS	FAIL
gh-13877	Hexcles	`55eb739`	PASS	FAIL	PASS	FAIL
gh-13869	youennf	`d72af25`	PASS	FAIL	FAIL	FAIL
gh-13787	chromium-wpt-export-bot	`c169e95`	PASS	FAIL	PASS	PASS
gh-13729	domfarolino	`c0c5084`	PASS	FAIL	PASS	PASS
gh-13727	chromium-wpt-export-bot	`623f033`	PASS	FAIL	PASS	PASS
gh-13723	moz-wptsync-bot	`1ddb18e`	PASS	FAIL	PASS	FAIL
gh-13689	chromium-wpt-export-bot	`1066be1`	PASS	FAIL	PASS	PASS
gh-13640	chromium-wpt-export-bot	`f2c4c0f`	PASS	FAIL	PASS	PASS
gh-13597	chromium-wpt-export-bot	`4f05c01`	PASS	FAIL	PASS	PASS
gh-13549	chromium-wpt-export-bot	`067c5e2`	PASS	FAIL	PASS	PASS
gh-13528	chromium-wpt-export-bot	`27ebdcb`	PASS	FAIL	PASS	PASS
gh-13903	youennf	`2acc419`	PASS	PASS	PASS	FAIL
gh-13864	chromium-wpt-export-bot	`c6db0ea`	FAIL	FAIL	PASS	FAIL
gh-13789	cvazac	`854f470`	PASS	PASS	PASS	FAIL
gh-13740	moz-wptsync-bot	`852fe69`	PASS	PASS	PASS	FAIL
gh-13663	chromium-wpt-export-bot	`d0df0bd`	PASS	PASS	PASS	FAIL
gh-13615	annevk	`fd1d2c5`	PASS	PASS	PASS	FAIL
gh-13611	lukebjerring	`ac016d8`	PASS	PASS	PASS	FAIL
gh-13587	moz-wptsync-bot	`07052a8`	PASS	PASS	PASS	FAIL
gh-13579	chromium-wpt-export-bot	`1e81a41`	PASS	PASS	PASS	FAIL

✅ Stability discrepancy - Taskcluster **missed** instability that TravisCI found (10 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13880	jugglinmike	`26a3c32`	PASS	PASS	FAIL	PASS
gh-13880	jugglinmike	`99f8707`	PASS	PASS	FAIL	PASS
gh-13859	marcoscaceres	`a6311d7`	FAIL	PASS	FAIL	PASS
gh-13822	chromium-wpt-export-bot	`b3bed6b`	PASS	PASS	FAIL	PASS
gh-13812	akansha2608	`6f63f30`	FAIL	PASS	PASS	PASS
gh-13696	qiuzhong	`b5b38b0`	FAIL	PASS	FAIL	PASS
gh-13671	domfarolino	`6e6d6be`	PASS	PASS	FAIL	PASS
gh-13669	alanwaketan	`d1ae1b3`	PASS	PASS	FAIL	PASS
gh-13666	chromium-wpt-export-bot	`a2b2b90`	PASS	PASS	FAIL	PASS
gh-13606	woaiwyhty	`d1ae1b3`	PASS	PASS	FAIL	PASS

So if you buy my hand-wavy excuses, only 33 commits over the past three weeks (i.e. 7%) are concerning.

Next steps

7% seems pretty good, but we probably want to do better than that.

"GitHub missed the results" This is rare but still worth understanding. It didn't come up in my original report to the Taskcluster team, so I've updated that bug with some information along those lines.

"Taskcluster timed out" We set our own limits for maximum build duration. We can certainly set it higher, but I personally don't have a sense for what value is appropriate.

"Taskcluster missed the commit" These are the most concerning, but they're also the hardest to investigate. Missed commits were my primary motivation for bugging the Taskcluster maintainers since I assumed they would have access to additional logging information. Their feedback already helped us understand some of the discrepancies, but they're not done! They classified 3 of the commits I originally reported team as "mysterious." They've improving logging, and they're continuing the investigation, so we could stand to wait a little longer to see if that turns up any new information.

Script used to identify "stragglers" (for enabling validation)

#!/bin/bash

pr_validation_enabled=1ede22e333195e47e13260a58ce89ef3220c19c7

values='
13907,af0b3812c550cc3fabd7c39da5f057a0d4a94556
13906,1937a04bf6b3c183d0fcd53e552fe9ecb969119b
13905,ff33b7ebc5772eed2bbe415f832cb9173cebb8f5
13904,92297ec317f6be7222ccca4a095e345e8cb9dd14
13886,34a8296fc37dc68681334aaded3f5609eee5b555
13884,bf194907e3d1f2676bc0314bc2dc1020b8fefe70
13862,fc92447525ca55ea8f21d26f327f541d5d169025
13842,38deacc55a3424561011ba5e981c30ddf449758b
13842,b9a47dc778f80c237cbbdede6243b29b470e2948
13823,5bf797d1753108033e6f8694cefee901ce240ac4
13819,1a925d7b1c2eaf33db23cbde57ccd12c8b49629c
13781,0a36538808bea9c77624a43219091880d908a578
13781,7548d3c76a9e8c05573dcb9898eb0911d851d971
13748,3c50079362e3dc0855fafd30b5e1d66056910e0d
13744,6d259c91b1498f145ed6fbbf6795fd46ba17e007
13738,874b6a0c4247acf57f4fd8d4ba8f75fc3766b819
13681,a0438c4b7bf34e22dc2dacd34c20f160a15fa55e
13677,18477d46836c59d24aa40259822af622f367264d
13664,6bf7422c8aaae6c86d6d4b65a2e4ba8dcb686c25
13627,a5199e740165877f316430a1fa67e0b8bdde7d88
13616,e28b31f5671fa68392b47dd33895bf3fb34f1c6a
13616,a823191597d9aca7b96091bd66c88f007ef711ef
13616,ed322df2d11feec5946baa9fa115cd60118021dc
13616,659a97e7c1f7ca640ebfdc147c514825f83e56bf
13616,2e263dfb07b25a88e66d76994e8d04da1786a4a8
13606,267f7dcb3019ca3a4483bc81de7270e37e0d7421
13589,267f7dcb3019ca3a4483bc81de7270e37e0d7421
13556,40dfbac0dd1d7d75cf06665430b41b810dcf8dfe
13518,78f5a144ac27066228d688a347608b0775c12d1c
'

for value in ${values}; do
  pr_number=$(echo ${value} | cut -d ',' -f 1)
  sha=$(echo ${value} | cut -d ',' -f 2)

  git fetch upstream refs/pull/${pr_number}/head > /dev/null 2>&1

  if ! git merge-base --is-ancestor ${pr_validation_enabled} ${sha}; then
    echo Pull request based on outdated master: https://github.com/web-platform-tests/wpt/pulls/${pr_number} $sha
  fi
done

Script used to identify "stragglers" (for filtering events)

#!/bin/bash

events_filtered=4f2038853851cb7c58986bc0c93478efa9ce1a62

values='
13827,c0592a41661e70dde422e5626d21375fc578e3c3
13767,995cdc7ad9f5beeaf35fc0ea01e52e5417feca56
13603,f01a6815810145bbef6f41b9467bb00a750e09b8
13566,2ee886e56916352276a7674ebcdaf8f8dbd561e5
13550,dffb31352e8110fd65d7ce247e7fcafe4326ad5f
13548,ade02ef5c6f917d6d1d491f62304804704a040fc
13517,7495dc272f0abe8c610f3376d776d388d784dba0
13511,2d873920755ce9ccbd07bfc7ea2b9cafbdf9f193
'

for value in ${values}; do
  pr_number=$(echo ${value} | cut -d ',' -f 1)
  sha=$(echo ${value} | cut -d ',' -f 2)

  git fetch upstream refs/pull/${pr_number}/head > /dev/null 2>&1

  if ! git merge-base --is-ancestor ${events_filtered} ${sha}; then
    echo Pull request based on outdated master: https://github.com/web-platform-tests/wpt/pulls/${pr_number} $sha
  fi
done

foolip · 2018-11-13T12:25:42Z

@jugglinmike, do you think your conclusions support #14033 or at reversing the "dry run" vs. "for realz" roles of Travis vs. Taskcluster as suggested in #14033 (comment)?

jugglinmike · 2018-11-14T01:14:19Z

There are a few unknowns that make me reluctant to answer:

Will Taskcluster's reliability improve? When and by how much?
What is TravisCI's reliability?

I was content to wait for an answer to the former, but we can answer the latter ourselves, and that may be all we need to know.

...which made me realize my report above made a faulty assumption: that when TravisCI fails, it is always for legitimate reasons. We know that's not true, so I've updated the script to report build errors distinctly from stability failures.

For the same set of commits (i.e. those tested by TravisCI and/or Taskcluster between 2018-10-15 and 2018-11-06), 40 produced build errors. I've included them in a new version of the report. When judging the correctness of the stability job, we've been using TravisCI as our source of truth. To err on the side of caution, I've removed the relevant commits from the results regarding "stability discrepancies". However, I've maintained those commits in the other results (e.g. "Taskcluster missed the commit") because they are an issue regardless of the corresponding behavior in TravisCI.

✅ TravisCI reported a build error (40 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13942	moz-wptsync-bot	`e26e776`	PASS	FAIL	ERROR	FAIL
gh-13929	moz-wptsync-bot	`1d98d96`	ERROR		ERROR
gh-13914	markandrus	`683054c`	PASS	PASS	ERROR	FAIL
gh-13914	markandrus	`c959163`	PASS	PASS	ERROR	FAIL
gh-13908	youennf	`58364be`	PASS	PASS	ERROR	FAIL
gh-13906	soareschen	`1937a04`	PASS		ERROR
gh-13899	youennf	`59e5213`	PASS	FAIL	ERROR	FAIL
gh-13894	youennf	`c3be323`	PASS	PASS	ERROR	FAIL
gh-13893	youennf	`232c8cf`	PASS	PASS	ERROR	FAIL
gh-13893	youennf	`d3c64ff`	PASS	PASS	ERROR	FAIL
gh-13887	alvestrand	`34f7fa7`	ERROR	FAIL	PASS	PASS
gh-13886	soareschen	`e610400`	PASS	PASS	ERROR	FAIL
gh-13886	soareschen	`f0d1b04`	PASS	FAIL	ERROR	FAIL
gh-13886	soareschen	`34a8296`	PASS		ERROR
gh-13886	soareschen	`5150b9b`	PASS	FAIL	ERROR	FAIL
gh-13885	moz-wptsync-bot	`49cdc59`	ERROR	FAIL	ERROR	FAIL
gh-13880	jugglinmike	`26a3c32`	PASS	PASS	ERROR	PASS
gh-13880	jugglinmike	`99f8707`	PASS	PASS	ERROR	PASS
gh-13869	youennf	`d72af25`	PASS	FAIL	ERROR	FAIL
gh-13864	chromium-wpt-export-bot	`c6db0ea`	ERROR	FAIL	PASS	FAIL
gh-13838	moz-wptsync-bot	`16eed42`	PASS	PASS	ERROR
gh-13827	lukebjerring	`c0592a4`	ERROR	PASS	ERROR
gh-13823	yoavweiss	`82491b8`	PASS	PASS	ERROR	FAIL
gh-13823	yoavweiss	`010d357`	PASS	PASS	ERROR	FAIL
gh-13823	yoavweiss	`5bf797d`	PASS		ERROR
gh-13822	chromium-wpt-export-bot	`b3bed6b`	PASS	PASS	ERROR	PASS
gh-13804	moz-wptsync-bot	`342f7cf`	PASS	PASS	ERROR
gh-13804	moz-wptsync-bot	`4db1377`	ERROR		ERROR
gh-13789	cvazac	`b6ccb69`	PASS	PASS	ERROR	FAIL
gh-13789	cvazac	`364fb76`	PASS	PASS	ERROR	FAIL
gh-13789	cvazac	`2c9df3b`	PASS	PASS	ERROR	FAIL
gh-13773	chromium-wpt-export-bot	`4becbdb`	ERROR	FAIL	ERROR	FAIL
gh-13757	moz-wptsync-bot	`ee3ff96`	ERROR		ERROR
gh-13755	chromium-wpt-export-bot	`b369dab`	PASS	PASS	ERROR	FAIL
gh-13744	autofoolip	`6d259c9`	PASS		ERROR
gh-13691	chromium-wpt-export-bot	`c68bc9e`	ERROR	FAIL	PASS	PASS
gh-13666	chromium-wpt-export-bot	`a2b2b90`	PASS	PASS	ERROR	PASS
gh-13664	servo-wpt-sync	`6bf7422`	ERROR		ERROR
gh-13652	Ms2ger	`6ac2a03`	ERROR	FAIL	PASS	PASS
gh-13606	woaiwyhty	`1cee72e`	ERROR	FAIL	PASS	PASS

✅ TravisCI missed the commit (1 commit)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13926	pyup-bot	`3e96501`		PASS		PASS

❌ GitHub missed the results (2 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13904	youennf	`8da49cd`	PASS		PASS
gh-13605	chromium-wpt-export-bot	`07d77be`	PASS		PASS

❌ Taskcluster timed out (5 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox
gh-13929	moz-wptsync-bot	`1d98d96`	ERROR		ERROR
gh-13838	moz-wptsync-bot	`16eed42`	PASS	PASS	ERROR
gh-13804	moz-wptsync-bot	`342f7cf`	PASS	PASS	ERROR
gh-13804	moz-wptsync-bot	`4db1377`	ERROR		ERROR
gh-13757	moz-wptsync-bot	`ee3ff96`	ERROR		ERROR

✅ Missing the "event filtering" fix (4 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-13566	foolip	`2ee886e`	PASS	PASS
gh-13548	pyup-bot	`ade02ef`	PASS	PASS
gh-13517	annevk	`7495dc2`	PASS	PASS
gh-13511	servo-wpt-sync	`2d87392`	PASS	PASS

✅ Closed before Taskcluster completed (4 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox
gh-13827	lukebjerring	`c0592a4`	ERROR	PASS	ERROR
gh-13767	chromium-wpt-export-bot	`995cdc7`	PASS		PASS
gh-13603	chromium-wpt-export-bot	`f01a681`	PASS		PASS
gh-13550	chromium-wpt-export-bot	`dffb313`	PASS		PASS

✅ Taskcluster not enabled on base branch (3 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-13616	rachelandrew	`e28b31f`	PASS	PASS
gh-13616	rachelandrew	`a823191`	PASS	PASS
gh-13518	yoavweiss	`78f5a14`	PASS	PASS

❌ Taskcluster missed the commit (26 commits)

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-13907	youennf	`af0b381`	PASS	PASS
gh-13906	soareschen	`1937a04`	PASS	ERROR
gh-13905	youennf	`ff33b7e`	PASS	PASS
gh-13904	youennf	`92297ec`	PASS	PASS
gh-13886	soareschen	`34a8296`	PASS	ERROR
gh-13884	autofoolip	`bf19490`	PASS	PASS
gh-13862	zdobersek	`fc92447`	PASS	PASS
gh-13842	akansha2608	`38deacc`	PASS	PASS
gh-13842	akansha2608	`b9a47dc`	PASS	PASS
gh-13823	yoavweiss	`5bf797d`	PASS	ERROR
gh-13819	ewilligers	`1a925d7`	PASS	PASS
gh-13781	christian-bromann	`0a36538`	PASS	PASS
gh-13781	christian-bromann	`7548d3c`	PASS	PASS
gh-13748	ewilligers	`3c50079`	PASS	PASS
gh-13744	autofoolip	`6d259c9`	PASS	ERROR
gh-13738	gsnedders	`874b6a0`	PASS	PASS
gh-13681	servo-wpt-sync	`a0438c4`	PASS	PASS
gh-13677	ewilligers	`18477d4`	PASS	PASS
gh-13664	servo-wpt-sync	`6bf7422`	ERROR	ERROR
gh-13627	moz-wptsync-bot	`a5199e7`	PASS	PASS
gh-13616	rachelandrew	`ed322df`	FAIL	FAIL
gh-13616	rachelandrew	`659a97e`	FAIL	FAIL
gh-13616	rachelandrew	`2e263df`	FAIL	FAIL
gh-13606	woaiwyhty	`267f7dc`	PASS	PASS
gh-13589	frivoal	`267f7dc`	PASS	PASS
gh-13556	autofoolip	`40dfbac`	PASS	PASS

✅ Stability discrepancy - Taskcluster **found** instability that TravisCI missed (19 commits -- down from 26)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13878	lukebjerring	`340b297`	PASS	FAIL	PASS	FAIL
gh-13878	lukebjerring	`c4f9ad6`	PASS	FAIL	PASS	FAIL
gh-13877	Hexcles	`55eb739`	PASS	FAIL	PASS	FAIL
gh-13787	chromium-wpt-export-bot	`c169e95`	PASS	FAIL	PASS	PASS
gh-13729	domfarolino	`c0c5084`	PASS	FAIL	PASS	PASS
gh-13727	chromium-wpt-export-bot	`623f033`	PASS	FAIL	PASS	PASS
gh-13723	moz-wptsync-bot	`1ddb18e`	PASS	FAIL	PASS	FAIL
gh-13689	chromium-wpt-export-bot	`1066be1`	PASS	FAIL	PASS	PASS
gh-13640	chromium-wpt-export-bot	`f2c4c0f`	PASS	FAIL	PASS	PASS
gh-13597	chromium-wpt-export-bot	`4f05c01`	PASS	FAIL	PASS	PASS
gh-13549	chromium-wpt-export-bot	`067c5e2`	PASS	FAIL	PASS	PASS
gh-13528	chromium-wpt-export-bot	`27ebdcb`	PASS	FAIL	PASS	PASS
gh-13903	youennf	`2acc419`	PASS	PASS	PASS	FAIL
gh-13740	moz-wptsync-bot	`852fe69`	PASS	PASS	PASS	FAIL
gh-13663	chromium-wpt-export-bot	`d0df0bd`	PASS	PASS	PASS	FAIL
gh-13615	annevk	`fd1d2c5`	PASS	PASS	PASS	FAIL
gh-13611	lukebjerring	`ac016d8`	PASS	PASS	PASS	FAIL
gh-13587	moz-wptsync-bot	`07052a8`	PASS	PASS	PASS	FAIL
gh-13579	chromium-wpt-export-bot	`1e81a41`	PASS	PASS	PASS	FAIL

✅ Stability discrepancy - Taskcluster **missed** instability that TravisCI found (5 commits -- down from 10)

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-13859	marcoscaceres	`a6311d7`	FAIL	PASS	FAIL	PASS
gh-13812	akansha2608	`6f63f30`	FAIL	PASS	PASS	PASS
gh-13696	qiuzhong	`b5b38b0`	FAIL	PASS	FAIL	PASS
gh-13671	domfarolino	`6e6d6be`	PASS	PASS	FAIL	PASS
gh-13669	alanwaketan	`d1ae1b3`	PASS	PASS	FAIL	PASS

This means:

TravisCI's error rate is 8% (41/485)
Taskcluster's error rate is 7% (33/485)

Keep in mind that the failure mode is substantially different between the two systems. When TravisCI fails, it does so with a discoverable log file (good) that takes a very long time to produce (bad). When Taskcluster fails, it most commonly does so silently (bad) but in a way that is immediately apparent (good).

Contributors can recover from Taskcluster failures by pushing empty commits to their feature branch, but they have no recourse for TravisCI failures. That doesn't sway me, though.

Not only is a silent CI failure a poor experience for contributors, it's a risk to the health of the project. Reviewers may not recognize that one of the project's "checks" is not running, but they will be able to merge as soon as the others have passed. This will be more concerning when/if we move more work to Taskcluster (e.g. linting, infrastructure tests).

At this point, it doesn't look like Taskcluster is better or worse than TravisCI. "Differently Concerning" happens to be the title of by debut LP, but it also seems fitting here. Personally, I think we need a more significant reduction in error rate to justify the new risk.

jugglinmike · 2018-11-14T19:50:52Z

Taskcluster missed 3 commits over the past seven days:

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-14004	qiuzhong	`f4bd28a`	PASS	PASS
gh-14001	autofoolip	`842c1b0`	PASS	PASS
gh-14000	autofoolip	`e56f8cb`	PASS	PASS

This was over 112 commits, so it represents a 3% failure rate. The sample size is smaller, though, and I don't have the math to determine statistical significance.

It's still good news, though, because shortly after I reported those results last week, the Taskcluster team improved their internal logging. I've submitted a new bug to the Taskcluster team--fingers crossed that we'll get some more clarity from the new data.

foolip · 2018-11-16T12:54:41Z

@jugglinmike other than the 3% failure rate, which I think we can tolerate if it's being improved, is there anything else that would favor continuing with Travis instead of making the Taskcluster stability runs blocking?

Fixes #7660. #13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would.

…ster (#14096) Fixes #7660. #13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would.

foolip · 2018-11-16T17:09:56Z

With #7660 closed, as part of this task should also include finally removing the stability jobs from Travis entirely, which should free up some capacity.

jugglinmike · 2018-11-16T18:41:43Z

@jugglinmike other than the 3% failure rate, which I think we can tolerate if it's being improved

3% only describes the performance over the past seven days. We have no reason to expect any improvements yet, so a more accurate picture should include the previously-reported data. That indicates a 6% failure rate.

is there anything else that would favor continuing with Travis instead of making the Taskcluster stability runs blocking?

I'm not convinced that silently allowing 6% of commits to go unverified is acceptable. The acceptance of gh-14096 shows a difference of opinion. I was hoping to get some clarity around this from the new project governance (it's why I requested a status update earlier this week), but it looks like we need to move on this before that's active.

jgraham · 2018-11-16T18:54:02Z

I think we can find out what happens if a required status check is missing; if that blocks landing then we don't get 5±2% of PRs landing whilst missing status checks, but need action in those cases. I think that's comparable to the rate of PRs that need admin access due to travis having errors that TaskCluster won't have (and in almost all cases the admin action for travis problems has been "merge and assume things are fine", so we already have some percentage of PRs missing status checks).

foolip · 2018-11-16T21:43:54Z

In my view, #7660 has been a low-level emergency since I filed it over a year ago. I've had to work around it and apologize about it in blocked PRs over and over and over again. It has occasionally detected flaky tests, which I haven't counted and made a long list of, but certainly my experience has been more occurrences of trouble than catching of real problems. Given that, as long as the Taskcluster isn't much worse than Travis at catching real problems, it seems like a great trade. I'm glad I got two people agreeing on #14096 :)

foolip · 2018-11-16T21:50:42Z

I'm not convinced that silently allowing 6% of commits to go unverified is acceptable.

I think the difference here is how valuable we're guessing the stability checks have been. I have felt them to be a net negative for the past year but knew I couldn't get away with just disabling them. Assuming that flakiness is detected one something like maybe 0.1% or 1% of PRs, if we did miss 6% of those it wouldn't end up causing much trouble in practice, 94% of the trouble already averted!

There's also the empirical question of whether making a check required blocks the PR if the check never starts. Hopefully we'll learn the answer very soon. If it doesn't block I guess we should pester both the Taskcluster team to react to the webhooks better, and GitHub to block PRs if expected required checks haven't started.

jugglinmike · 2018-11-16T22:29:54Z

I agree that if the Checks API requires a response, then the behavior we're seeing from Taskcluster is far more acceptable. And I recognize that I have not had to help folks understand irrelevant TravisCI failures, so I probably underestimate the value of disabling those jobs.

christian-bromann · 2018-11-20T14:53:21Z

@foolip I've seen that we once did those stability tests using Sauce Labs. Is there any value to bring this back now? I can provide the necessary account capabilities so we have enough concurrency to efficiently run these tests.

foolip · 2018-11-20T15:18:54Z

@christian-bromann unfortunately we had to remove that when JWT was deprecated, see #9903. The trouble we'd face if trying to add it back is that we can't put any secrets in our CI setup, because they can be extracted with anyone with write access to the repo. Or, to put it differently, however we authenticate we have to be OK with that being available to hundreds of people and being possible for them to extract and use elsewhere. I have no reason to think anyone would, but we'd have to accept the risk.

jgraham · 2018-11-20T15:23:26Z

Also, managing our own infrastructure has generally worked better and been more flexible; our needs are not always straightforward and it's useful to be able to run arbitary code rather than just having WebDriver access. With a combination of TaskCluster and Azure we covered all our requirements except Edge. If we could make Edge work on Sauce without the problems @foolip mentioned, that would be helpful, although I expect we would still prefer to move to Edge on another system (likely Azure) giving us OS-level access if we can get Microsoft to arrange cloud access to Windows 10.

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096

foolip · 2018-11-21T09:17:13Z

I've filed #14165, turns out missing required checks are blocking, and Taskcluster isn't reliably starting.

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096

jugglinmike · 2018-11-21T18:00:13Z

gh-14096 changes the way Taskcluster statuses should be interpreted. I've updated the script to account for this and ran it through the period since my previous report (from 2018-11-15 to 2018-11-21):

Full listing of discrepancies

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-14162	moz-wptsync-bot	`6ef09d0`	PASS	PASS	ERROR	PASS
gh-14161	jugglinmike	`65ab13c`	ERROR	FAIL	ERROR	FAIL
gh-14154	chromium-wpt-export-bot	`f3661b7`	ERROR	PASS	ERROR	PASS
gh-14153	chromium-wpt-export-bot	`96ab4cf`	PASS	PASS	PASS	FAIL
gh-14151	chromium-wpt-export-bot	`5b45293`	ERROR	FAIL	ERROR	FAIL
gh-14147	chromium-wpt-export-bot	`1fa9612`	ERROR	FAIL	ERROR	FAIL
gh-14142	chromium-wpt-export-bot	`b5ab067`	PASS	PASS	PASS	FAIL
gh-14138	servo-wpt-sync	`d43d3b5`	PASS		PASS
gh-14134	moz-wptsync-bot	`7af7f7f`	PASS	PASS	PASS	FAIL
gh-14132	jugglinmike	`908af0c`	PASS	FAIL	PASS	FAIL
gh-14132	jugglinmike	`2cf4f5c`	ERROR		PASS
gh-14131	chromium-wpt-export-bot	`b6a5f11`	ERROR	FAIL	ERROR	FAIL
gh-14122	chromium-wpt-export-bot	`96f017f`	ERROR	FAIL	ERROR	FAIL
gh-14121	chromium-wpt-export-bot	`a44d2f3`	ERROR	PASS	PASS	PASS
gh-14114	ewilligers	`6bbaeea`	ERROR		PASS
gh-14111	ewilligers	`a19c7b2`	ERROR	PASS	PASS	PASS
gh-14106	moz-wptsync-bot	`ec0619b`	PASS	PASS	ERROR	PASS
gh-14096	foolip	`1f6ab50`	PASS		PASS
gh-14096	foolip	`04ecdcc`	PASS		PASS
gh-14096	foolip	`b74d52c`	ERROR		ERROR
gh-14096	foolip	`65525f9`	PASS		PASS
gh-14092	mrego	`ef7824a`	PASS	PASS	PASS	FAIL
gh-14089	qiuzhong	`81c52db`	ERROR	PASS	ERROR	PASS
gh-14088	qiuzhong	`b37b12d`	ERROR		ERROR
gh-14088	qiuzhong	`53ef62c`	ERROR		ERROR
gh-14087	qiuzhong	`cb656d1`	ERROR	FAIL	ERROR	PASS
gh-14087	qiuzhong	`59a1292`	ERROR	PASS	ERROR	PASS
gh-14084	chromium-wpt-export-bot	`d38540d`	PASS		PASS
gh-14081	NavidZ	`465df5e`	PASS		PASS
gh-14081	NavidZ	`7a62d26`	PASS		PASS
gh-14076	jgraham	`1fe94c0`	PASS	FAIL	PASS	PASS
gh-14075	ewilligers	`bb2b9f8`	ERROR	PASS	PASS	PASS
gh-14074	christian-bromann	`067dec7`	PASS		PASS
gh-14074	christian-bromann	`f207ac2`	ERROR	FAIL	ERROR	FAIL
gh-14073	foolip	`acb3ab7`	PASS	FAIL	PASS	PASS

Summary

Total pull requests: 91
Total commits: 119
Tested commits: 111
Contested commits (Chrome): 5
Contested commits (Firefox): 6
Contested commits (both): 24

Taskcluster missed 13 of the 111 commits which were validated by Taskcluster. While a failure rate of 12% is much higher than our previous estimate of 6%, bear in mind that our sample size continues to shrink.

jugglinmike · 2018-11-21T18:36:53Z

I spoke too soon. The following commits were validated correctly:

pull request URL	pull request author	commit	TravisCI: Chrome	TravisCI: Firefox
gh-14096	foolip	`1f6ab50`	PASS	PASS
gh-14096	foolip	`04ecdcc`	PASS	PASS
gh-14096	foolip	`b74d52c`	ERROR	ERROR
gh-14096	foolip	`65525f9`	PASS	PASS
gh-14084	chromium-wpt-export-bot	`d38540d`	PASS	PASS

Further, one of the missing results is due to a commit that was force-pushed about an hour ago, so the pending task is not a concern:

pull request URL	pull request author	commit	TravisCI: Chrome	Taskcluster: Chrome	TravisCI: Firefox	Taskcluster: Firefox
gh-14114	ewilligers	`6bbaeea`	ERROR		PASS

This makes for a failure rate of 6% over the past 6 days, which is in-line with our expectations. I've updated the Taskcluster bug that I opened last week with the list of new misses.

(Verifying Taskcluster results for commits created in this time frame is more complicated than it might seem. We can't use a simple time-based heuristic because Taskcluster's behavior is determined by the state of the .taskcluster.yml file in the pull request branch, not in master. I haven't accounted for this in the script; I'd like to wait and see if the effort needed to account for that subtlety is justified.)

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096

foolip · 2019-02-04T23:10:48Z

Closing this, we've already switched to Taskcluster. A number of issues have already been found, many fixed, and we should file more if we see things being missed.

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096 UltraBlame original commit: e758e29e8a1a9cfc2aea0d2fd6eace425676ec0d

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096 UltraBlame original commit: 3cec1475937580e33ee98d3bae84c8c7f4d5e862

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096 UltraBlame original commit: e758e29e8a1a9cfc2aea0d2fd6eace425676ec0d

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096 UltraBlame original commit: 3cec1475937580e33ee98d3bae84c8c7f4d5e862

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096 UltraBlame original commit: e758e29e8a1a9cfc2aea0d2fd6eace425676ec0d

…is and report them on Taskcluster, a=testonly Automatic update from web-platform-testsAllow stability-checker failures on Travis and report them on Taskcluster (#14096) Fixes web-platform-tests/wpt#7660. web-platform-tests/wpt#13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would. -- wpt-commits: bcd2471a16325fada550e05dad2b17895a232c6a wpt-pr: 14096 UltraBlame original commit: 3cec1475937580e33ee98d3bae84c8c7f4d5e862

jugglinmike added the infra label Sep 24, 2018

jugglinmike self-assigned this Sep 24, 2018

foolip added the priority:roadmap label Sep 24, 2018

Hexcles changed the title ~~Verify correctness of TaskCluster tasks~~ Verify correctness of TaskCluster PR checks Sep 25, 2018

foolip mentioned this issue Oct 1, 2018

Run affected tests in Edge and Safari for PRs #13299

Closed

jugglinmike assigned mariestaver and unassigned mariestaver Nov 7, 2018

jugglinmike mentioned this issue Nov 14, 2018

Remove Travis stability checks (Chrome Dev + Firefox Nightly) #14033

Merged

foolip added a commit that referenced this issue Nov 16, 2018

Make stability checks on Travis blocking and on Travis not

1f6ab50

Fixes #7660. #13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would.

foolip mentioned this issue Nov 16, 2018

Allow stability-checker failures on Travis and report them on Taskcluster #14096

Merged

foolip added a commit that referenced this issue Nov 16, 2018

Allow stability-checker failures on Travis and report them on Taskclu…

bcd2471

…ster (#14096) Fixes #7660. #13194 remains open to verify that Taskcluster isn't failing to detect flakiness that Travis would.

foolip mentioned this issue Nov 21, 2018

Taskcluster status checks not finishing and blocking PRs #14165

Closed

foolip changed the title ~~Verify correctness of TaskCluster PR checks~~ Verify correctness of Taskcluster PR checks Nov 23, 2018

foolip mentioned this issue Nov 23, 2018

Monitor Taskcluster PR and master runs #14210

Closed

foolip closed this as completed Feb 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify correctness of Taskcluster PR checks #13194

Verify correctness of Taskcluster PR checks #13194

jugglinmike commented Sep 24, 2018

jugglinmike commented Sep 24, 2018

Hexcles commented Sep 25, 2018

foolip commented Nov 2, 2018

jugglinmike commented Nov 7, 2018

foolip commented Nov 13, 2018

jugglinmike commented Nov 14, 2018

jugglinmike commented Nov 14, 2018

foolip commented Nov 16, 2018

foolip commented Nov 16, 2018

jugglinmike commented Nov 16, 2018

jgraham commented Nov 16, 2018 •

edited

Loading

foolip commented Nov 16, 2018

foolip commented Nov 16, 2018

jugglinmike commented Nov 16, 2018

christian-bromann commented Nov 20, 2018

foolip commented Nov 20, 2018

jgraham commented Nov 20, 2018

foolip commented Nov 21, 2018

jugglinmike commented Nov 21, 2018

jugglinmike commented Nov 21, 2018

foolip commented Feb 4, 2019

Verify correctness of Taskcluster PR checks #13194

Verify correctness of Taskcluster PR checks #13194

Comments

jugglinmike commented Sep 24, 2018

jugglinmike commented Sep 24, 2018

Hexcles commented Sep 25, 2018

foolip commented Nov 2, 2018

jugglinmike commented Nov 7, 2018

foolip commented Nov 13, 2018

jugglinmike commented Nov 14, 2018

jugglinmike commented Nov 14, 2018

foolip commented Nov 16, 2018

foolip commented Nov 16, 2018

jugglinmike commented Nov 16, 2018

jgraham commented Nov 16, 2018 • edited Loading

foolip commented Nov 16, 2018

foolip commented Nov 16, 2018

jugglinmike commented Nov 16, 2018

christian-bromann commented Nov 20, 2018

foolip commented Nov 20, 2018

jgraham commented Nov 20, 2018

foolip commented Nov 21, 2018

jugglinmike commented Nov 21, 2018

jugglinmike commented Nov 21, 2018

foolip commented Feb 4, 2019

jgraham commented Nov 16, 2018 •

edited

Loading