-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increasing sidecar default CPU limits and memory. #21537
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
⏩
Airbyte Code Coverage
|
@davinchia one thing to note though is that with our current node sizes in cloud today, we almost always run out of CPU before memory on our nodes, so with this change the problem could potentially get worse so that we end up needing to spin up more nodes (and have even more unused memory). Just something to keep an eye out for after deploying this change and possibly consider resizing our job nodes to be more efficient depending on how things look. |
@git-phu good point. Maybe it's worth pairing this with
Happy to do the work. Want your thoughts on if we should |
@davinchia yeah we can probably test in PR in dev with a sync that pushes more data and see how it behaves. I don't know how much leeway we have to increase the max size of node pools in prod, so to optimize I think we may just want nodes with more CPU (and same amount of memory). But I also don't think we'll start hitting the max pool size because of this change so it should be safe to deploy it first and collect some metrics before deciding to change node sizes. |
What
Related to https://github.com/airbytehq/airbyte-cloud/issues/3814.
These vars govern the heartbeat and socat Kube sidecars.
Testing has shown that socat requires at least 1.5 CPU to support > 20 MB/s throughput.
Bump to 2.0 CPU just in case. This is fine because CPU limits affect throttling, and not scheduling, so this change will not affect OSS users or baseline Kube deployment requirements.
Also a modest increase in default resource request for the sidecars for better reliability.
How
See what section.
Recommended reading order
🚨 User Impact 🚨
Are there any breaking changes? What is the end result perceived by the user? If yes, please merge this PR with the 🚨🚨 emoji so changelog authors can further highlight this if needed.
Pre-merge Checklist
Expand the relevant checklist and delete the others.
New Connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampledocs/integrations/README.md
airbyte-integrations/builds.md
Airbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing/publish
command described hereUpdating a connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampleAirbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing/publish
command described hereConnector Generator
-scaffold
in their name) have been updated with the latest scaffold by running./gradlew :airbyte-integrations:connector-templates:generator:testScaffoldTemplates
then checking in your changesTests
Unit
Put your unit tests output here.
Integration
Put your integration tests output here.
Acceptance
Put your acceptance tests output here.