-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Loki silently rejects data when chunks dir is too large #362
Comments
@sed-i maybe relevant: grafana/loki#364 |
Some context from grafana/loki#1502 Firstly, we need to be careful not to enable large_dir as a workaround on a /boot device (including where /boot is stored on /)
Secondly they moved away from this single large directory in v2.5, in these issues: However to take advantage of that we need to use Schema v12, and looking at the charm we are possibly still using v11: loki-k8s-operator/src/config_builder.py Line 104 in 6c26adf
|
We need to check if we can still reproduce this once we have compaction. We should likely do This happens on |
To help narrow doing our options on this issue, here are some potential approaches we could explore Approach 1: Configuring ingester configwe might consider playing around with the ingester configs to optimize Approach 2: Using tune2fs large_dirEnable Approach 3: TSDB with v12 schema
|
TSDB v12 seems like the right path forward. |
If you dive into the already referenced grafana/loki#1502, or particularly my comment about using a hashing approach to avoid having insane amounts of files: grafana/loki#1502 (comment) This is what has been used since forever for other services like EMail services which store a larger count of files and it should not be much of a problem to implement for Loki when regular file storage. |
Bug Description
When the chunks dir contains too many files, loki stops ingesting new data. (Setting
tune2fs -O large_dir /dev/vda2
resolved it.)To Reproduce
See canonical/cos-proxy-operator#130.
Environment
See canonical/cos-proxy-operator#130.
Relevant log output
Additional context
Observed by @dnegreira.
The text was updated successfully, but these errors were encountered: