-
Notifications
You must be signed in to change notification settings - Fork 964
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add volcano jobs phase metric #3650
Conversation
This implementation is too complex; we just need to add metrics at the state transition points. |
In practical scenarios, we only focus on the final states of success and failure, and we are not concerned with or do not need other states because they are not final states. |
You just need to find the place where the controller last modified the status in these two related places and make the changes. |
/assign |
ok |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/ok-to-test
squash the commit to one |
If i squash the commit to one, it seems like I need to re-create a PR, or is there another way? |
Why do we need to resubmit a PR when using git rebase or git squash? This is to update your git log view locally, and has nothing to do with the PR itself. |
ok ,i have squashed the commit to one. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
@hwdef: Closed this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/reopen |
@hwdef: Reopened this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Encountered this error again, It may be that volcano has just been deployed and the test process has been executed. The queue of volcano has not yet been opened. Please add
in volcano/.github/workflows/e2e_spark.yaml Line 68 in 320698e
target: ...
- name: Build lastest volcano images
run: |
eval $(minikube docker-env)
make TAG=latest update-development-yaml
make TAG=latest images
docker images | grep volcano
cat ./installer/volcano-development.yaml | grep image:
minikube kubectl -- replace --force -f ./installer/volcano-development.yaml
sleep 30s
...
|
Maybe it’s because the pull policy of the image is |
change to IfNotPresent? |
Yes, But I didn't think of a better replacement method. Please try add this
at volcano/.github/workflows/e2e_spark.yaml Line 64 in 320698e
target:
Do you have any better suggestions? |
/lgtm |
Please see my review opinions also:
|
Thank you for your suggestions. I have made changes to points 2 and 3, but I think using |
Signed-off-by: Prepmachine4 <prepmachine4@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: Monokaix The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
fix: #2493
I add all job phase shows in job apis.
The metric record event happend in
jobInformer.Informer()
received event.And the processing of record metrics is not use
cache lock
that maybe produces some inaccurate data but will improve some performance.But if it traverses all jobs every time during updates, I think there might be some trouble. Should we switch to incremental updates or scheduled metrics at intervals?