Skip to content

Latest commit

 

History

History
112 lines (85 loc) · 6.28 KB

request.md

File metadata and controls

112 lines (85 loc) · 6.28 KB

Request for data collection review form

  1. What questions will you answer with this data?
  1. Why does Mozilla need to answer these questions? Are there benefits for users? Do we need this information to address product or business requirements? Some example responses:
  • We need this information to evaluate utilizing DAP infrastructure to provide private advertising attribution in Firefox. This is hugely material as we move forward with our Privacy-Preserving Advertising capabilities.
  • To improve user advertising attribution privacy.
  • To improve advertiser attribution reporting capabilities.
  1. What alternative methods did you consider to answer these questions? Why were they not sufficient?
  • Alternative solutions explored include using OHTTP vs DAP and adding a new DOM API to Firefox, though both alternatives still require collecting the same data in Firefox.
  1. Can current instrumentation answer these questions?
  • We currently do not collect any data to provide attribution (ad conversions) in advertising reports due to the privacy implications of collecting this information. Current instrumentation only captures impression data from our native ad placements. However, advertisers need the impression and conversion data to measure the return on ad spend (ROAS). Historically, Mozilla needed to have a way to provide advertisers with the attribution data needed without violating our high standards on data privacy. Our current instrumentation combined with DAP will allow Mozilla to provide attribution measurement to advertisers without violating user privacy.

  • The reason for this request is to evaluate a privacy-preserving attribution method by collecting aggregated client attribution data through the Distributed Aggregated Protocol (DAP).

  1. List all proposed measurements and indicate the category of data collection for each measurement, using the Firefox data collection categories found on the Mozilla wiki.
  • Note: Measurements will only be captured by websites that trigger the custom event. This will initially be from Mozilla Ad Suppliers (i.e. MDN, Fakespot, Pocket) and approved Advertisers.
Measurement Description Data Collection Category Tracking Bug #
Task ID/Index/Size. These will come from the ad server and be used to identify the DAP task and data location for an ad in the task. Category 2 TBD
Conversion Key. This will come from the ad server and be used by advertisers to track conversions. Category 2 TBD
Ad View/Impression Totals Category 2 TBD
Ad Click Totals Category 2 TBD
Ad Conversion Totals. Conversions occur when an advertiser triggers the custom event for a conversion and there has been a recently related view/click. Category 2 TBD
Last Source Timestamp. This will keep track of the last timestamp an ad view/click occurred. Category 2 TBD
  1. Please provide a link to the documentation for this data collection which describes the ultimate data set in a public, complete, and accurate way.
  1. How long will this data be collected? Choose one of the following:
  • The Ads Engineering team wants to collect the aggregated data permanently, but the client data will be retained for 30 days after the last source event.
  • Potentially the data collection could be scoped to a shorter period and extended if the design is proven to be a viable long term solution.
  1. What populations will you measure?
  • All channels, countries, and locales. No filters.
  1. If this data collection is default on, what is the opt-out mechanism for users?
  • The collection is defaulted on, and the opt-out mechanism is considered to be private browsing. Users can change the default configuration settings for their standard browsing mode by changing the policies in about:preferences#privacy within Firefox.
  1. Please provide a general description of how you will analyze this data.
  • This data will be aggregated and sent through DAP to anonymize it. The collected ad impression, click, and conversion aggregations can then be used, along with task id/index, to associate the interacted ad and provide advertisers with privately collected conversion/attribution reporting.
  1. Where do you intend to share the results of your analysis?
  • The raw client data collected in Firefox will not be directly shared. The only data that will be shared are the anonymized aggregated results from DAP.
  • The aggregated results will be shared Internally within Mozilla and with potential Ad Suppliers and Advertisers, likely via a secure API and/or a reporting tool that prevents access to raw data.
  1. Is there a third-party tool (i.e. not Glean or Telemetry) you propose using for this data collection?
  • Yes
  • The aggregated client data will be input into report vectors, which will be split into encrypted shares of data and sent through the existing Mozilla DAP infrastructure, which uses ISRG, a third party, as a partner. ISRG can only decrypt its share of the individual report data associated with individual DAP tasks.
  • After the data has been processed through the DAP infrastructure, the overall aggregated data will be collected on an interval and output to BigQuery using existing Mozilla Data Engineering infrastructure.