-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Have a single way to identify a project in our telemetry #507
Comments
Why
|
This issue still needs some thought. Some options:
|
I agree that using a uniquely generated projectId, not linked to the project name, is better. This ensures distinct IDs for different projects even if they share the same name. Ideally, this ID should be generated at the time of project creation with |
I have two questions:
|
summary:
Not all of the libraries we surveyed on #510 (comment) track projects uniquely. For example, neither of these projects do:
Two of the projects do track unique projects:
|
On today's Tech Design session we discussed this at length, without reaching an obvious conclusion. Long story short: this is hard ™️
Clearly no perfect solution exists, we will need to keep exploring. Footnotes |
Giving this a bit more thought. From the two examples I could find above,
|
I think it's safe to say that
|
After reviewing the discussion, references, and current implementation, I suggest the following improvements:
This approach aims to address the uniqueness issue, enhance security, and integrate seamlessly with our current workflow. @astrojuanlu, @noklam, @DimedS, @ankatiyar, @datajoely, @idanov, @merelcht, @yetudada - curious on your thoughts, concerns, suggestions 🙂 |
I love the idea of using the Project UUID as a salt for the value we transmit and store in our systems. I'm wondering if we could simplify it even more and
That way, if the project name (or any other property we consider) changes, for us it will automatically be a different project. Otherwise, your proposal @ElenaKhaustova as it stands in #507 (comment) sounds good to me! |
Thank you, @astrojuanlu. I like the idea! It provides the same security level and we don't have to save |
If the name changes back to the original, then the stored hash would be the same as the original project - so we'd see a discontinuity, but perhaps it would make even more sense 😄 I'll let others chime in, otherwise if there are no strong objections I'd say we can proceed 💪🏼 |
Thank you, @ElenaKhaustova. This is a strong solution with a high level of security! I support the idea of generating a unique project ID and storing it in the |
Thank you for writing up this detailed solution @ElenaKhaustova! This sounds like a strong solution to me. The name changing thing is interesting, but I don't think we should spend too much time pondering about this. I'd expect people to change their project name mostly at the beginning when they're still setting up and getting started with a project. Switches during a project seem unlikely, but even if that does happen I don't think that would hugely skew our metrics. |
Description
Let me show you a subset of the data that comes into our database from
kedro-telemetry
in our CLI commands table.Context
This task has two components:
project_name
sometimes has blank entriesExpected Result
We should only have a single project ID that we all trust and we should not have a blank entry for that ID.
The text was updated successfully, but these errors were encountered: