-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
job failed with zipfile.BadZipFile: File is not a zip file error from time to time #12343
Comments
We think this bug was fixed in receptor 1.2.3, please make sure this receptor version is present in your awx ee image |
I have quay.io/ansible/receptor:devel in my ee repo, and the receptor version is 1.2.0+gce6c620. I will update it to use quay.io/ansible/receptor:latest to have version 1.2.3. I'm closing this issue and reopen it if I still have this issue with receptor 1.2.3. |
I'm reopening this issue, I still have it after the update of AWX and our custom EE : I'm using AWX 21.1.0 receptor --version
1.2.3 |
Did you run the above command inside of the control plane EE or inside of a pod for a running job? |
I am using the same docker image for the control plane EE and the pod where the job is running |
Can you share any information about the workload? How many hosts, how many tasks, how much stdout, are you gathering a lot of facts? Anything you can share might be helpful. |
It happens randomly in different kind of job template. For the last issue, it happens on a job part of a workflow. The job was launch on 1 host, about 15 tasks and no gather facts. |
This could be ansible/ansible-runner#998 |
This might have been fixed with ansible/ansible-runner#1161 |
We still see this error quite a lot on our setup.
ansible-runner --version The code from https://github.com/ansible/ansible-runner/pull/1162/files is there, but we still get this error. |
same problem came up in our setup: |
Is there any suggestion to fix this issue, because we get this quite often:
|
Same issue on latest version - BadZipFile("File is not a zip file") - on random job runs. We run the same job on 300+ hosts and it may occur once in 300 times... Any devs got time to check this? |
usually this relates to issues around log rotation you can bump up your max log container size for your cluster you should also enable reconnect |
Please confirm the following
Summary
From time to time some jobs failed with the following error :
The job has the ID 818890, here the logs os awx_task pod
In the UI the last modified is 11:00:20 AM (CET) or 9:00:20 UTC
but we can see in the logs of awx_task that the job finished at 09:01:07.
AWX version
19.5.0
Select the relevant components
Installation method
kubernetes
Modifications
yes
Ansible version
2.12
Operating system
No response
Web browser
No response
Steps to reproduce
Launch job template in AWX, from time to time it failed with this zipfile.BadZipFile: File is not a zip file error
Expected results
Job launch correctly
Actual results
job failed with zipfile.BadZipFile error
Additional information
WE use custom awx EE.
The text was updated successfully, but these errors were encountered: