Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitoring accuracy and latency of reported information in KCNR #161

Closed
caohe opened this issue Jul 20, 2023 · 1 comment · Fixed by #237
Closed

Monitoring accuracy and latency of reported information in KCNR #161

caohe opened this issue Jul 20, 2023 · 1 comment · Fixed by #237
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@caohe
Copy link
Member

caohe commented Jul 20, 2023

What would you like to be added?

This issue proposes the implementation of a monitoring system to track the accuracy of the reported information in KCNR and to measure the latency in reporting this information. The following features should be added:

  • Accuracy Monitoring: Implement a mechanism to compare the reported data in KCNR with the actual resource allocation on each NUMA node.

  • Latency Monitoring: Measure the time taken from the moment a Pod is scheduled to the moment it's information is successfully reported to KCNR.

  • Visualization: Provide a dashboard that displays accuracy and latency metrics.

Why is this needed?

KCNR is a CRD that stores topology status and resource allocation information of a node. Katalyst gathers and reports these information to KCNR. However, there are concerns about the accuracy of the reported information as well as potential delays in reporting.

@caohe caohe added the enhancement New feature or request label Jul 20, 2023
@caohe caohe moved this to 🏗 In progress in Katalyst Development Dashboard Jul 20, 2023
@caohe caohe added this to the v0.4 milestone Jul 20, 2023
@caohe caohe assigned caohe and unassigned caohe Jul 20, 2023
@zhy76
Copy link
Contributor

zhy76 commented Jul 20, 2023

/assign

@caohe caohe changed the title Enhancing the capability for reporting node resource allocation information Monitoring accuracy and latency of reported information in KCNR Aug 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: 🏗 In progress
Development

Successfully merging a pull request may close this issue.

3 participants