-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: count unique things seen #123
Comments
Neat idea. I've wondered where the line gets drawn between technical metrics and user analytics, but this is something we might use too if it were available. IP addresses, OAuth client IDs, and maybe User-Agent strings would be things we might track like this, along with authenticated user IDs. |
@dpiddy to better predict the worst case memory usage, is there some sane max boundary on key length e.g. 128 or 256 bytes? |
I think maybe even 64 bytes would be a reasonable limit, that would handle a sha256 sum. |
@josephruscio wouldn't value length be more important as you basically need to make a set of values for the key and then count them at the bucket boundary? |
Yeah I think the value is what we're talking about. @dpiddy agree that 64 bytes would be reasonable. Larger values can just be hashed down to that, and would be preferable when/if the values for the logfmt tuples had whitespace. For example,
Probably wouldn't be that useful in a
would be. |
Yeah. I'd say the limit should be 64 bytes and the recommended usage when Another option would be to have l2met always hash but that has tradeoffs For my immediate use case for this feature I only need 10 bytes or less. On Thursday, October 31, 2013, Collin Van Dyck wrote:
|
@collinvandyck @freeformz correct, I accidentally used the wrong term, max length for any value in the set was my concern. |
I would like to have l2met produce a "unique things seen" metric from my log lines. Say I have an app that logs lines like this:
I would like a metric that counts how many unique
user_id
s were logged, in this case 3. A possible convention:these lines would cause a
user-id
metric to be emitted every interval and again in this case the value would be 3 based on these lines.The text was updated successfully, but these errors were encountered: