You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Aug 23, 2023. It is now read-only.
We want to be able to reject all data where the metric name or associated tags / values contain non-ASCII chars. Some backends such as Cassandra don't accept all byte values as string chars, which can lead to MT "hanging" because Cassandra keeps rejecting the insert query while MT keeps retrying it.
There should be a flag which makes the input validation stricter and which makes it reject all non-ascii chars.
The text was updated successfully, but these errors were encountered:
Another option would be to only reject data which does not pass the validation by unicode/utf8.Valid(): https://golang.org/pkg/unicode/utf8/#Valid
This would be more permissive and it would likely solve the issue of cassandra rejecting certain byte values, although I'm not 100% sure about that.
@replay i have a question on how this works, more specifically why the archive code block looks like it has a limiter on the amount of retries but the main block does not.
I think its just spinning forever on failures i put a test in just for my sanity with how the code block is today and it goes a long long time on bad data. tehlers320@5b0f949
Yeah, i think this confirms it. As soon as the index is spinning its wheels on a bad chunk of data everything else stalls. scaled the error to highlight when the rest of metrictank stops operating
We think that we can most likely address this by simply always rejecting non-utf8 identifiers.
Maybe make it optional, but seems like that should always be sane.
Ascii is suposed to be a subset of utf-8 anyway.
We want to be able to reject all data where the metric name or associated tags / values contain non-ASCII chars. Some backends such as Cassandra don't accept all byte values as string chars, which can lead to MT "hanging" because Cassandra keeps rejecting the insert query while MT keeps retrying it.
There should be a flag which makes the input validation stricter and which makes it reject all non-ascii chars.
The text was updated successfully, but these errors were encountered: