-
Notifications
You must be signed in to change notification settings - Fork 246
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Detect and label CPU Model Name #270
Comments
Would it be better to split the information into smaller pieces, like:
I think this would be a more reliable way of ensuring a certain generation of cpu. WDYT? |
Yes, I think this would work, |
++ |
Just to raise awareness of this issue, I want to share a nice library that is targeting this issue A library to track system architecture aspects incl. processor family & co |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Maybe we should just start with model, stepping, and nominal clock frequency /remove-lifecycle stale |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
I have created a side care container for NFD https://github.com/archspec/archspec-feature-discovery , leverage ArcSpec library, that give us the name of the microarch |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle rotten |
Is this issue available to be worked on? |
I think with #464 hopefully merged at some point this can be more useful |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-contributor-experience at kubernetes/community. |
/remove-lifecycle stale |
I have an example side-car container that helps with this https://github.com/archspec/archspec-feature-discovery Deploy the Node-feature-Discovery with the extra option $ kubectl apply -f manifests/
namespace/archspec-feature-discovery created
serviceaccount/archspec-feature-discovery created
role.rbac.authorization.k8s.io/archspec-feature-discovery created
rolebinding.rbac.authorization.k8s.io/archspec-feature-discovery created
daemonset.apps/archspec-feature-discovery created
$ kubectl get all -n archspec-feature-discovery
NAME READY STATUS RESTARTS AGE
pod/archspec-feature-discovery-ddhnq 1/1 Running 0 7m21s
pod/archspec-feature-discovery-rr9ts 1/1 Running 0 7m21s
NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE
daemonset.apps/archspec-feature-discovery 2 2 2 2 2 node-role.kubernetes.io/worker= 7m22
$kubectl get no -o json | jq .items[].metadata.labels |grep archspec
"archspec.io/cpu.family": "6",
"archspec.io/cpu.model": "79",
"archspec.io/cpu.target": "haswell",
"archspec.io/cpu.vendor": "GenuineIntel",
"archspec.io/cpu.family": "6",
"archspec.io/cpu.model": "79",
"archspec.io/cpu.target": "haswell",
"archspec.io/cpu.vendor": "GenuineIntel",
|
/help |
@ArangoGutierrez: Please ensure the request meets the requirements listed here. If this request no longer meets these requirements, the label can be removed In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
I am working on this issue. Please assign this issue to me. |
@fabiand Any requirements for kubeVirt I remember that some folks asked about CPU Model for kubeVirt as well? |
@marquiz @ArangoGutierrez Should we think of a plugin system to enable such third party libs or do you think the side-car container is the way to go to extend NFDs detecting functionality? |
One thing I dislike is that we are still using a hostMount which can cause some pain. |
The other thing we thought about was a grpc plugin system a dedicated channel for extensions. |
I would go with side-car for now. And it doesn't need to be a side-car, it can be a separate pod.
Could you elaborate a bit? What is the pain you're experiencing?
I wouldn't go there until there is a clear use case and reasoning to do this. It complicates matters - from the plugin's perspective, too, IMO. The extensions/communication is supposed to be node-local in any case. For this particular issue (CPU detection) we shouldn't need any extra plugins. Just read/parse |
Can someone assign this issue to @AshwiniGhanwat as she is already working on it. |
/assign AshwiniGhanwat |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs. This bot triages issues and PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
/remove-lifecycle stale |
@digambar15 @AshwiniGhanwat Any updates on this? |
/assign |
/unassign AshwiniGhanwat |
I think lmdaly is not working in this area anymore. |
@marquiz: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
An addition to the CPU source that adds a string label with the CPU Model name (model name as described under /proc/cpuinfo for eg.)
For example: : "Intel(R) Xeon(R) CPU E5-XXXX"
The reason for this, in heterogeneous clusters, a workload that requires a specific platform feature or configuration needs to run on a particular model type, ensuring that the workload does not end up on an undesirable platform.
A latency sensitive workload may want to run on the latest architecture to reap the benefits of this new architecture, whereas infrastructure tasks can run on older generations as performance is not as important.
The text was updated successfully, but these errors were encountered: