-
Notifications
You must be signed in to change notification settings - Fork 454
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Arguments missing for custom metrics collector #2109
Comments
Thank you for creating this @votti! I think, originally we didn't modify Custom Metrics Collector container args. So user can set any container args without unexpected parameters. Does it solve your problem ? Or you want to pass some other info to your container ? |
Ok let me try this again - I thought I need the trial name to submit the metrics to the correct I wonder how the kubeflowkatib/custom-metrics-collector:latest managed to do this - but i guess that code is still lost: #1263 Once I get this working, I think I have the know-how together to make a new |
Actually this works and gives the name of the current pod.
Thanks for pointing me to this! |
Great. @votti I guess, it gives you Pod name where Training + Metrics Collector containers are running, right ? |
Yes I was just updating my comment as I noticed this. But in my case the Pod name is the TrialName + suffix, so this will work. |
Thanks @votti. That means that Pod name must have Trial name, which might not work for every Kubernetes CRDs as Trial template (e.g. Argo Workflows). Also, we mutate Trial name to the Pod labels, so another solution is to fetch this data from Pod's labels. |
Somehow having the option/a flag to add the standard CLI arguments to the Then such a Eg having to hard code the |
@pod lables: in my case I do not find that label (should be |
We should think more about it, we can always inject args to container ENVs to not break the runtime.
@votti What version of Katib Controller you are using ? This feature was implemented recently (in Katib 0.15 release). |
@andreyvelich : 0.14 - will check in 0.15 soon |
The TrialName can be parse from the pod name. This seems currently a good way to get the trial name. For more discussion see: kubeflow#2109
It seems that we don't have any documentation for the custom metrics collector. It would be good to add the documentation for that and mention about Pod name. |
The TrialName can be parse from the pod name. This seems currently a good way to get the trial name. For more discussion see: kubeflow#2109
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
This issue has been automatically closed because it has not had recent activity. Please comment "/reopen" to reopen it. |
/kind bug
What steps did you take and what happened:
I want to run an experiment with a custom metrics collector using the following spec:
This is used for: #2019
What did you expect to happen:
I expected the metrics collector sidecar pod to be created to carry the arguments:
-t
trial.Name-m
: metricNames-o-type
: string(trial.Spec.Objective.Type)-s-db
: katibmanagerv1beta1.GetDBManagerAddr()which should have been added by the following code:
katib/pkg/webhook/v1beta1/pod/inject_webhook.go
Line 298 in 0a7453d
Anything else you would like to add:
Environment:
kubectl version
):Client Version: v1.26.0
Kustomize Version: v4.5.7
Server Version: v1.23.14-gke.1800
uname -a
): Linux LAPTOP-SS9UALAT 4.19.128-microsoft-standard SMP Tue Jun 23 12:58:10 UTC 2020 x86_64 x86_64 x86_64 GNU/LinuxImpacted by this bug? Give it a 👍 We prioritize the issues with the most 👍
The text was updated successfully, but these errors were encountered: