Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Common system metrics collection #24416

Merged
merged 47 commits into from
Jul 7, 2021

Conversation

elevankoff
Copy link
Contributor

@elevankoff elevankoff commented May 22, 2021

I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en

Changelog category (leave one):

  • New Feature

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Collect common system metrics (in system.asynchronous_metrics and system.asynchronous_metric_log) about CPU usage, disk usage, memory usage, IO, network, files, load average, CPU frequencies, thermal sensors, EDAC counters, system uptime; also added metrics about the scheduling jitter and the time spent collecting the metrics. It works like atop in ClickHouse and allows to get monitoring data even if you have no additional tools installed. This closes #9430.

@robot-clickhouse robot-clickhouse added doc-alert pr-feature Pull request with new product feature labels May 22, 2021
@alexey-milovidov alexey-milovidov self-assigned this May 22, 2021
@azat
Copy link
Collaborator

azat commented Jun 2, 2021

Lots of environments already have such monitoring, looks like it worth adding a configuration directive to control these new collectors, and make it OFF by default?

Also maybe it worth looking at some existing tools (most popular these days I guess is node exporter) to make these metrics compatible?

@alexey-milovidov
Copy link
Member

@azat What is the reason for turning these metrics off?
Even if they are unused and unneeded - they are essentially free to collect.
And it's better to have them enabled by default...

The scenario is when a server collected these metrics for a year and at least once these metrics will save a day.

@alexey-milovidov
Copy link
Member

Screenshot_20210705_031150

@alexey-milovidov
Copy link
Member

Perf test Ok (the column has been renamed and the new name does not exist in old version).

@alexey-milovidov alexey-milovidov merged commit 80eaf85 into ClickHouse:master Jul 7, 2021
azat added a commit to azat/ClickHouse that referenced this pull request Jul 16, 2021
Since in ClickHouse#24416 asynchronous_metrics_update_period_s had been set to 1
and this became annoying.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr-feature Pull request with new product feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Collect global system metrics inside ClickHouse.
4 participants