Skip to content
This repository has been archived by the owner on Sep 20, 2024. It is now read-only.

Deadline timeout and logging #2312

Merged
merged 5 commits into from
Nov 26, 2021
Merged

Conversation

kalisp
Copy link
Member

@kalisp kalisp commented Nov 25, 2021

When Deadline nodes are located in the cloud and spun up dynamically it sometimes produces issue of OP not connecting to Mongo resulting in weird failures.

Added more logging to GlobalJobPreload for better debugging, added possibility to drive timeout by AVALON_TIMEOUT, added explicit high timeout to GlobalJobPrelod.

How to test it:

This is difficult, only just by introducing connection lag to DB which would be noticeable when DL job is triggered.

@kalisp kalisp added type: bug Something isn't working module: Deadline AWS Deadline related features labels Nov 25, 2021
@kalisp kalisp self-assigned this Nov 25, 2021
@iLLiCiTiT
Copy link
Member

NOTE: We should maybe not use AVALON_TIMEOUT but openpype's version of that (and set AVALON_TIMEOUT with openpype's variant)?

@kalisp
Copy link
Member Author

kalisp commented Nov 25, 2021

I was thinking about it and I think it would be better to keep *_TIMEOUT variables low, if possible.

vendor/deadline/custom/plugins/GlobalJobPreLoad.py Outdated Show resolved Hide resolved
std_output = subprocess.check_output(args,
cwd=os.path.dirname(openpype_app),
env=env)
print(">>> Process result {}".format(std_output))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now I am not sure if format() won't actually take care of it but as I reacall, output from this comand is bytes so it needs to be properly encoded to utf-8?

Suggested change
print(">>> Process result {}".format(std_output))
print(">>> Process result {}".format(std_output.decode("utf-8")))

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Its getting printed even without this, but the content length is imho limited.

vendor/deadline/custom/plugins/GlobalJobPreLoad.py Outdated Show resolved Hide resolved
kalisp and others added 2 commits November 25, 2021 18:29
Co-authored-by: Ondřej Samohel <33513211+antirotor@users.noreply.github.com>
Co-authored-by: Ondřej Samohel <33513211+antirotor@users.noreply.github.com>
Copy link
Member

@antirotor antirotor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

then lets do it :)

@kalisp kalisp merged commit 31f6397 into develop Nov 26, 2021
@kalisp kalisp deleted the bugfix/deadline_timeout_and_logging branch November 26, 2021 08:48
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
module: Deadline AWS Deadline related features type: bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants