-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflow logs button does not fetch archived logs but node summary logs button does #12759
Comments
This sounds like a duplicate of #12590. I haven't had a chance to review the PR yet |
I did notice this one missing, but the URI differences were the same.
The existing logic should cover this case 🤔 This also would theoretically impact 3.4.x as well, before the Archived/Live merge, meaning this is not a regression either. Are there any errors in your browser or in the Server logs? Could you check against 3.4.x? |
As mentioned in the issue, this case is properly covered when opening the log viewer from the LOGS button in the node summary view. It is not (or not properly) covered when opening the log viewer from the LOGS button in the workflow view. Maybe arguments passed to
The only error I see is one appearing even when I don't click on any LOGS button. I don't think it is related: And the log reflecting that from the server:
|
Yes that would be suspected as one is correct and the other is not. Unfortunately they are the same (note that The only difference is that in the node view, you're getting logs for a specific Pod, as opposed to all logs. But you did specify a Pod in your screenshot 🤔
No, that just means you don't have archived workflows enabled. It really shouldn't 500 though (that's a separate bug, EDIT: tracking in #12814)
What about this part? You didn't seem to reply to that and the 500 error would not occur in 3.4.x |
Yes, indeed. I tried all combinations and nothing shows when coming from workflow LOGS button..
I missed this part, will check in the coming days. Were you able to reproduce against 3.5.5? |
Hey @agilgur5, I was able to test against v3.4.0 with a minimal deploy and the issue exists as well. So I guess it's not a regression? |
I'd be curious if this is reproduced in latest 3.4.x. Specifically, I did some backtracking and found #9434 that was fixed prior to 3.4.0 (in 3.4.0-rc4 it would seem) as well as #9644 that was fixed in 3.4.1. It's possible there's a remaining bug from 3.4, although I'd be a little surprised it hadn't been reproduced and reported earlier 🤔. Particularly by the original reporters of those two issues; #9434 in particular had a lot of follow-up and is from an active contributor. Whereas 3.5 has had more changes (#11121 in particular) and so wouldn't be as surprising; #12590 might actually have the same root cause as this issue, though with different symptoms. |
I'm able to reproduce this on
|
Yea exactly the same root cause, just that the initial fix in #12591 was not quite at root cause and so missed the non-archived workflows case. But the root cause was just for archive logs, and not specific to archived workflows -- I fixed that in #12932
As far as I can tell, this bug does seem to exist in 3.4.x; my root cause fix is effectively a follow-up to #9464, which partially fixed #9434. Overall, this seems to be due to the change to
So this is a regression, but not a 3.5 regression, actually a 3.4 regression that somehow hadn't been caught till now
I'm still surprised by this, but maybe people don't use the top-level logs button as often? Or, perhaps more likely, when it doesn't work, they click a node and get logs from there which works and so no one really thought too much about it or filed a bug. I think I may have done something similar subconsciously while testing actually 😅 |
I was surprised too, this was pretty important to me and thought I had just misconfigured something for some time.. good thing you found the root cause. And thank you for looking into this 🙏 |
Pre-requisites
:latest
What happened/what did you expect to happen?
Hi!
The LOGS button on a workflow page does not load the archived logs when
archiveLogs
is set to True and workflow pods have been GCd. The LOGS buttons on the node summary does work properly, ie. loads and display logs from the archive.Don't know if this is relevant but my artifact repository is set in the
workflow-controller-configmap
and is an AWS S3 bucket.From the workflow LOGS button:
A GET is performed with URI
/api/v1/workflows/mwaa-data-prod/omniscient-dragon-zkc9v/log?logOptions.container=main&grep=&logOptions.follow=true&podName=omniscient-dragon-zkc9v
From the node summary LOGS button:
A GET is performed with URI
/artifact-files/mwaa-data-prod/workflows/omniscient-dragon-zkc9v/omniscient-dragon-zkc9v/outputs/main-logs
which properly fetches archived logs.Version
v3.5.5
Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: