-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] macOS Integ Test Flakiness #442
Comments
How about doing something similar to core opensearch-project/OpenSearch#2547? We can use the https://blog.gradle.org/gradle-flaky-test-retry-plugin plugin to perform the retry. |
I apologize if I'm intruding here, but if you're considering MacOS runners, may I suggest giving FlyCI a try? We offer MacOS M1 and M2 runners that are faster than GitHub's. For public repos, we offer 500 mins/month of free M1 usage (4 vCPUs, 7 GB RAM, 28 GB storage). The setup is super easy:
jobs:
ci:
- runs-on: macos-latest
+ runs-on: flyci-macos-large-latest-m1
steps:
- name: 👀 Checkout repo
uses: actions/checkout@v4 Do you think this might be a good option for flow-framework? Web: flyci.net | Twitter: flyciapp | LinkedIn: FlyCI | Discord: FlyCI |
Hey @kgantchev thanks for the idea. I've actually taken advantage of these runners on other projects I maintain. All OpenSearch project repo settings are centrally managed so getting this app approved has to go through these organization admins. We'd need to come up with a justification for the need to do this testing to propose adding it, but I am not sure it's really necessary here (and given the scale of this organization's projects, I doubt we'd fit under your free offering.) our primary user base uses cloud servers on hardware optimized for that purpose (flexible storage and memory choices, no need for displays), while Apple Silicon is primarily optimized for the end user market (integrated on-chip memory and GPU). So our investments in testing hardware/software need to focus on the overwhelming majority of users. |
Awesome! I'm glad it has come in handy.
Got it. Thanks for the feedback. I guess I missed the full context here. :) I appreciate it. |
Reopening this issue as the recent PRs did not fix the node drop issue (but did give much less confusing logs) |
Closing this as it's no longer macOS central, opening a new issue for multi-node. |
Is your feature request related to a problem?
Integ tests are flaky on macOS. Given that failures have primarily been resource-constrained, and that macOS runners are really VMs and tend to ve more limiting, this is not a surprise.
What solution would you like?
Pick one:
Platform
class to test whether we're on macOS and add longer time for that platformbash
script in the github action to automatically retry N times. See this example from a project I maintain. This (with 2 tries) is consistent with OpenSearch Jenkins retries.What alternatives have you considered?
Investigating whether macOS is officially supported and if not, using GitHub's
allow_failure: true
on macOS tests. I don't like this option but it's the easiest.Do you have any additional context?
macOS is a fantastic operating system but it is specifically designed to run on Apple hardware and not in a VM.
The text was updated successfully, but these errors were encountered: