Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide additional threads, retry until not failed #1395

Merged
merged 1 commit into from
Oct 20, 2021

Conversation

heatmiser
Copy link
Contributor

SUMMARY

Provisioner can intermittently fail during image pulls with "Failed to pull image" message. Additionally, when adding Projects to controller, "Project update failed" message can occur. Occasionally, the underlying error is "Error: Error writing blob to file "/var/tmp/storage/storage304273906/1": error happened during read: unexpected EOF

Research on this issue: lead to similar issue being reported in buildah project for container engine pulls: http://github.com/containers/buildah/issues/2224
Ultimately, containers.conf was "wired up" to allow raising the number of threads for simultaneous image layer copies (pulled/pushed).

Raising container engine parallel threads from 6 to 10 and adding retries until not failed for associated image pull tasks to resolve the issue.

ISSUE TYPE
COMPONENT NAME
  • provisioner
ADDITIONAL INFORMATION

To reproduce, deploy a 50 student workshop multiple times over the period of the day, you will experience intermittent failures on a small percentage of controllers.

@heatmiser
Copy link
Contributor Author

Fixes #1394

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants