-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Types of user ids #5775
Comments
I would suggest NOT including any form of hashed email in the options for |
So far we have:
|
FWIW, in late July there was a discussion on https://openrtb-iabtechlab.slack.com/archives/C3Y6GHUTH/p1595433523254200 (TechLab Programmatic, general Slack). The use case similar to here, signaling a stable, publisher-generated UID:
The following was sketched but I don't recall seeing this discussion thread picked up again.
I pinged the channel to see if there was more discussion on eids enhancements. |
to me, On the ID5 side for example, we provide a field we call |
I agree that when describing IDs, it would be useful to distinguish among the various "dimensions" of IDs: Describes Person or Device/App
Set/Link of IDs
Source type
Actual source
Age of ID
I would keep all the "dimensions" distinct from uses of ID
|
Re @dmdabbs note, I think this is what you might be referring to? This is the spec we're currently using with OpenRTB, https://github.com/Advertising-ID-Consortium/IdentityLink-in-RTB |
@joshuakoran Just working my way through your list. I think the adcom "atype" field handles
-Can you further clarify what you meant by "Set/Link of IDs"? Do you expect this to be an array of other ids?
|
The "set/link of IDs" concept relates to sharing a common ID that maps one ID to other IDs (e.g., x-device link, link across two 1P domains such as required by first-party sets). The source (or perhaps better name is "controller") of ID is due to permissions/permitted uses tied to ID. The dimensions defining the ID (e.g., source/controller, type, time) are orthogonal to information tied to the ID (audience attributes/cohorts, restrictions against use for personalization, event-aggregates such as frequency counter, etc.) |
Hey @joshuakoran , mind adding a few example values for each that you think maybe relevant? I want to be sure clear on what you are suggesting. |
Sorry for the delay, finally coming up for air. Agree that adcom is the right model to improve upon. As we think about reducing discrepancies and adopting cross-publisher common ID schemes, such as being discussed here and in IAB TL, it seems we can improve how we annotate the interoperable IDs being used to improve engagement, measurement and optimization. The original question as I understood it was to provide enhanced standard descriptors (metadata) around user IDs + information associated with them, rather than the attribute data (e.g., interest taxonomies, demographic taxonomies, geo taxonomies) or event data (activity_type, optional value of activity such as a purchase transaction). I think the broad classification of ID metadata can be classed into two buckets of better describing the what-ness of the ID and “provenance” of the ID. WHAT concepts
The second type of ID is one that merely links other IDs, such as a “cluster ID.” This ID is generated server-side to associate various IDs together, either probabilistically OR deterministically. Marketers often use this for “x-device” or even same device “x-app” use cases. When publishers operating different domains link their IDs deterministically they may wish to create a shared ID for their use, which is analogous to the proposed “first-party” sets.
FROM WHERE “provenance” concepts When was it created? When was the last time it was verified as still active? Syndicating “stale” IDs to be activated in a walled garden or across the Open Web is technically feasible, but not adding value to marketers. Yet most marketers do not have visibility on the age or last seen date of the data syndicated on their behalf to improve media buying. Ensuring we know where IDs come from likely requires ensuring compact description and perhaps even signing the data. USE concepts I also recommend we keep the above annotations about IDs distinct from what processing operations are associated with them: Preference management (e.g., opt-in/-out of personalization) Examples (purely for illustration and not in formal spec format or optimized for transport efficiency): |
Thanks @joshuakoran what you're describing is going to be tough to express in JSON in a way that makes sense to everyone, let me take a first stab and we can iterate. wrt providence, I feel like the source and stype values do a good job describing that, so I'm going to leave them out for now. |
how about something like this? Anything else to add?
|
I am assuming all these params and values have to be well defined for any consumer to make sense out of it. Wouldn't it be better if we map combination of |
Hi @abhinavsinha001, I think you've raised a very good point, to be clear, I don't have a strong opinion yet about what this should look like, I'm channeling the Identity PMC. But to your point about well defined values you are exactly right. We need a way to ensure that creators don't declare there ID deterministic when it isn't. Wrt pseudoanonymous, all ids except email address is pseudoanonymous, and even email can be pseudoanonymous. So in my mind pseudoanonymous should go entirely. The consumer in this scenario is a DSP. As far as mapping pseudoanonymous and deterministic to a custom atype, atype isn't well understood or used. In theory that sounds like a good idea to me but in practice I'm not sure it would work. Thanks for your comments, what would be really helpful is a modified example. I don't want to be the only one doing the data modeling. |
Hi Jeff - "Wrt pseudoanonymous, all ids except email address is pseudoanonymous" I think that while many IDs we rely on may begin as "pseudonymous," I believe the regulations require organizations to have appropriate technical and/or operational measures in place to keep people's activity distinct from their offline identity (directly-identifiable ID, fkna PII) to be classed as "pseudonymous." |
sure, no disagreement from me on that pt. |
since the primary consumer here are DSPs, can we get some of them to weigh in on what they'd want to see and whether they want the granularity of separate fields or a single field like |
I agree we should get feedback from DSPs on this. I feel most of the parameters do not have any significance individually and can be represented broadly using Sample request leveraging atype value
Here is how we can maintain metadata for atype and parameters that define a particular atype value. Atype Metadata
ID metadata Params
|
Update: Just realized while on IAB-TL meeting - most of the fields and data are part of Data Transparency Standard 1.0 and there is an active discussion to map these fields to oRTB User object - we can use the same standards for eids type as well. |
ok, so sounds like we have something that describes the type of user id in the atype field and it's just a matter of defining how we want to support the atype designation:
I'd like to pause here, now that we have some firmer requirements and wait for DSPs to weigh in. Any disagreement with that approach? |
@abhinavsinha001 I like your example. For anyone who missed the 11/11 Identity PMC meeting, we agreed to move forward with this feature. The group felt we should proactively provide some real time metadata about the id to buyers in preparation for a future state with diminished 3rd party cookie availability. Each UserId module sub adapter will need to decide to support these fields. The PMC will define the standard. Are there any objections to @abhinavsinha001 data model? I'll cross post on our slack channel as well.
|
Just FYI PRAM is suggesting three types of Identifiers:
We can augment this by creation/last seen as described above + source (e.g., publisher, vendor, marketer), such that vendor=apple provides IDFA, and vendor=sharedid.org provides cookie ID. |
@joshuakoran I don't really understand the difference between 1. and 2. ... could you please explain a bit? |
Even if the output is a pseudonymous ID, the input mechanism has different friction/control for users. The user has binary control of generating / resetting ID in 1), but limited technical control over how the ID can be shared across domains. The user has 100% technical control of providing (different/same) ID to be shared across domains for 2). Once the ID in 2) is generated it has the same limits as 1), but the generation using different IDs (work email, home email as one example) is different than using the same laptop with same browser cookies at home and work. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
@jdwieland8282 did we land on a solution here? |
This hasn't come up lately, my recollection is that we would use the |
Type of issue
Question
Description
Do consumers of user ids set within either a new or existing userid module need to know more about how the UUID was generated? Or is the id itself sufficient. Would more DSPs integrate against a particular user id if they knew more about how it was generated?
We should consider a new attribute called "stype" (source type). Type would be passed along side the UUID to SSPs & DSPs.
Steps to reproduce
NA
Test page
NA
Expected results
The text was updated successfully, but these errors were encountered: