ci: Limit macOS testing to one version of python #7507

seemethere · 2023-04-07T21:09:17Z

This limits macOS testing to one version of python since macOS unittests take a long time to run and are the most expensive runner type that we currently utilize

Long time follow up to:

ci: Limit scope of unittest to one python version #5479

This limits macOS testing to one version of python since macOS unittests take a long time to run and are the most expensive runner type that we currently utilize Signed-off-by: Eli Uriegas <eliuriegas@meta.com>

pytorch-bot · 2023-04-07T21:09:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7507

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 Failures

As of commit 45806db:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

weiwangmeta

$$ savings, yeah!

huydhn

LGTM! Why not 3.9 though as we are using 3.9 for all MacOS jobs in CI. 3.8, as the minimum version, also kind of makes sense though

.github/workflows/test-macos.yml

Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com>

malfet · 2023-04-07T21:22:41Z

LGTM! Why not 3.9 though as we are using 3.9 for all MacOS jobs in CI. 3.8, as the minimum version, also kind of makes sense though

But we test minversion in other platforms, so 3.9 is probably a good idea

Signed-off-by: Eli Uriegas <eliuriegas@meta.com>

pmeier

One thing I don't understand yet is how this saves money. My understanding is that the macos-12 runner is not self-hosted, but rather the "regular" one from GitHub. This is how this comment came into being:

vision/.github/workflows/test-macos.yml

Lines 29 to 31 in 5b07d6c

    
           # We need an increased timeout here, since the macos-12 runner is the free one from GH 
        
           # and needs roughly 2 hours to just run the test suite 
        
           timeout: 240

Quoting from the billing documentation

GitHub Actions usage is free for standard GitHub-hosted runners in public repositories, and for self-hosted runners.

Since we are a public repository, we should be able to use them for free?

One thing @malfet and I discussed offline might be that we are paying extra to increase the concurrency limits and thus decrease the queue time. Can we make sure that this is actually saving money before we merge?

pmeier · 2023-04-11T07:57:01Z

.github/workflows/test-macos.yml

          - python-version: "3.8"
-            runner: macos-m1-12
+            runner: "macos-12"
+          # Minimum version available for Apple Silicon is 3.9, so just use that


We have been testing against 3.8 before, so this can't be the whole truth? Could you clarify this?

pmeier · 2023-04-11T07:57:46Z

.github/workflows/test-macos.yml

        include:
+          # Test against the most popular version of Python (at the time of commit 40% of torch downloads use 3.8)


Should we do the same for Windows as well? Meaning, only Linux will have the full 3.7 to 3.11 coverage?

NicolasHug

(I'm marking this as "request changes" to avoid merging prematurely before concerns are addressed)

Thanks for the PR @seemethere.

I have the same concerns that those that were raised in #5479. In particular, I am worried that removing those jobs will reduce our capacity to catch bugs or errors early. As mentioned in #5479 (review), it is fairly common to have some Python version CI job pass while some others fail - often due to different dependency versions (PIL, etc.).

Before we stop running those on PRs, do we have a safe and reliable mechanism to be alerted when those start failing on main as well?

seemethere · 2023-04-11T19:57:37Z

(I'm marking this as "request changes" to avoid merging prematurely before concerns are addressed)

Thanks for the PR @seemethere.

I have the same concerns that those that were raised in #5479. In particular, I am worried that removing those jobs will reduce our capacity to catch bugs or errors early. As mentioned in #5479 (review), it is fairly common to have some Python version CI job pass while some others fail - often due to different dependency versions (PIL, etc.).

Before we stop running those on PRs, do we have a safe and reliable mechanism to be alerted when those start failing on main as well?

No but we're going to need to do this anyway, intel macOS is a platform that we are de-prioritizing and with needs for efficiency we're going to need to cut intel macOS testing across our entire organization. If these jobs could run in 5 minutes this wouldn't be an issue but since these jobs take over an hour to run then they need to get cut.

NicolasHug · 2023-04-12T09:24:06Z

intel macOS is a platform that we are de-prioritizing

Does that mean we won't be releasing binaries for intel macOS? If we're not releasing binaries then that's fine, we can remove the testing jobs. But if we're still going to provide binaries, surely we want to keep some form of testing for those platforms.

If these jobs could run in 5 minutes this wouldn't be an issue but since these jobs take over an hour to run then they need to get cut.

Do we know why these jobs take 1h? The linux tests run in < 30min and they run the same tests. If it's just a matter of speeding up the CI, perhaps the MacOS runners are simply underspecced?

seemethere · 2023-04-12T18:47:04Z

intel macOS is a platform that we are de-prioritizing

Does that mean we won't be releasing binaries for intel macOS? If we're not releasing binaries then that's fine, we can remove the testing jobs. But if we're still going to provide binaries, surely we want to keep some form of testing for those platforms.

We will still release binaries for the next release at a minimum but we're considering dropping after that release.

If these jobs could run in 5 minutes this wouldn't be an issue but since these jobs take over an hour to run then they need to get cut.

Do we know why these jobs take 1h? The linux tests run in < 30min and they run the same tests. If it's just a matter of speeding up the CI, perhaps the MacOS runners are simply underspecced?

They are underspecced at 3 cores, so it makes sense as to why they take 1.5 hours to actually run.

To give you an idea of how much each of these runs costs (reference: Pricing Documentation):

1.5 hours (90 minutes) * $0.08 / minute (rate for macos-12) * 4 versions of python = $28.80 / run

One other thing to note as well is that core pytorch/pytorch doesn't even test more than one macOS version on either it's pull request workflows / trunk workflows so this will bring it more in line with what we do on core pytorch as well

pmeier · 2023-04-13T09:22:17Z

@seemethere The numbers you quoted come from the same page that I quoted above in #7507 (review)

GitHub Actions usage is free for standard GitHub-hosted runners in public repositories, and for self-hosted runners

Could you explain why we need to pay for them at all?

pmeier · 2023-04-13T09:44:25Z

Perusing the pricing documentation some more, here is how I understand this:

The very first paragraph states:

GitHub Actions usage is free for standard GitHub-hosted runners in public repositories, and for self-hosted runners. For private repositories, each GitHub account receives a certain amount of free minutes and storage for use with GitHub-hosted runners, depending on the product used with the account. Any usage beyond the included amounts is controlled by spending limits. [emphasis mine]

My understanding is that as long as we are using "standard GitHub-hosted runners", they should be free and incur no charge at all.
In the "Per-minutes rates" section they qualify the above statement a little further:

The larger runners are not free for public repositories.
The page dedicated to larger runners links to "standard GitHub-hosted runners". That page lists macos-12 and thus it shouldn't be a "larger runner".
The same page also lists a macos-12-xl runner, which might be a "larger runner" after all, but that doesn't matter here, since we are not using it.

seemethere · 2023-04-17T19:31:01Z

Me and @pmeier talked about this over VC. Here's an overview of some of the questions that came up?

Q: Why don't the macOS runners fall under the free plan for open source repositories?

A: Free plans typically only account for a maximum number of minutes, 2000 in the case of free organizations. Since we are an enterprise organization and we exceed 2000 minutes we pay for all usage of Github Actions, including what would typically be "free" github actions runners. With that in mind, macOS x86 is our second most expensive platform to test on so we are approaching all angles in order to ensure that the cost for this specific platform goes down, which includes limiting unittest runs here.

ci: Limit macOS testing to one version of python

327bc09

This limits macOS testing to one version of python since macOS unittests take a long time to run and are the most expensive runner type that we currently utilize Signed-off-by: Eli Uriegas <eliuriegas@meta.com>

seemethere requested a review from pmeier April 7, 2023 21:09

facebook-github-bot added the cla signed label Apr 7, 2023

seemethere requested review from DanilBaibak and osalpekar April 7, 2023 21:09

weiwangmeta approved these changes Apr 7, 2023

View reviewed changes

huydhn approved these changes Apr 7, 2023

View reviewed changes

malfet approved these changes Apr 7, 2023

View reviewed changes

.github/workflows/test-macos.yml Outdated Show resolved Hide resolved

Add note why 3.8 is important

4a387eb

Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com>

update for minimum versions

45806db

Signed-off-by: Eli Uriegas <eliuriegas@meta.com>

pmeier reviewed Apr 11, 2023

View reviewed changes

NicolasHug requested changes Apr 11, 2023

View reviewed changes

pmeier mentioned this pull request May 5, 2023

try the new macos-12-xl runners #7560

Closed

pmeier mentioned this pull request Jun 2, 2023

Please don't push spuriously numbered git tags #7650

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Limit macOS testing to one version of python #7507

ci: Limit macOS testing to one version of python #7507

seemethere commented Apr 7, 2023

pytorch-bot bot commented Apr 7, 2023 •

edited

Loading

weiwangmeta left a comment

huydhn left a comment •

edited

Loading

malfet commented Apr 7, 2023

pmeier left a comment

pmeier Apr 11, 2023 •

edited

Loading

pmeier Apr 11, 2023

NicolasHug left a comment

seemethere commented Apr 11, 2023

NicolasHug commented Apr 12, 2023

seemethere commented Apr 12, 2023

pmeier commented Apr 13, 2023

pmeier commented Apr 13, 2023

seemethere commented Apr 17, 2023 •

edited

Loading

	# We need an increased timeout here, since the macos-12 runner is the free one from GH
	# and needs roughly 2 hours to just run the test suite
	timeout: 240

		include:
		# Test against the most popular version of Python (at the time of commit 40% of torch downloads use 3.8)

ci: Limit macOS testing to one version of python #7507

Are you sure you want to change the base?

ci: Limit macOS testing to one version of python #7507

Conversation

seemethere commented Apr 7, 2023

pytorch-bot bot commented Apr 7, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7507

❌ 3 Failures

weiwangmeta left a comment

Choose a reason for hiding this comment

huydhn left a comment • edited Loading

Choose a reason for hiding this comment

malfet commented Apr 7, 2023

pmeier left a comment

Choose a reason for hiding this comment

pmeier Apr 11, 2023 • edited Loading

Choose a reason for hiding this comment

pmeier Apr 11, 2023

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

seemethere commented Apr 11, 2023

NicolasHug commented Apr 12, 2023

seemethere commented Apr 12, 2023

pmeier commented Apr 13, 2023

pmeier commented Apr 13, 2023

seemethere commented Apr 17, 2023 • edited Loading

pytorch-bot bot commented Apr 7, 2023 •

edited

Loading

huydhn left a comment •

edited

Loading

pmeier Apr 11, 2023 •

edited

Loading

seemethere commented Apr 17, 2023 •

edited

Loading