Export existing fidesctl resource metadata to csv file #299

iamkelllly · 2022-01-05T04:16:08Z

In an organization, it would be helpful to export the rich metadata that fidesctl already captures in Registry, System, and Dataset resources so that it can provide the foundation for a data map (otherwise known as a data inventory, or data flow map).

This issue will solve for the target state experience:

fidesctl developer user runs fidesctl export command, ie fidesctl export path/to/folder/
.csv file is written to specified folder that contains Organization, System, Registry, and Dataset metadata, which includes:
- organization.name, address, email, phone
- organization.dponame, dpoaddress, dpoemail, dpophone
- organization.representative, repaddress, repemail, repphone
- organization.securitypolicy (url)
- system, dataset: export only what we already capture (resource name, use, subject type, data categories, qualifier)

This will be complete when the CSV file looks similar to the csv here: https://docs.google.com/spreadsheets/d/1AItjzt2DOvCyG9my2wDlUkiTFX8wr9w1gLpNG8VzwTE/edit#gid=0

The text was updated successfully, but these errors were encountered:

* Allow incoming fields to be specified on a saas config as being dependent on each other, not treated as an independent list of values. - Update graph_task.pre_process_input_data to be able to optionally separate independent fields from dependent fields when processing incoming data into a collection. * Add a test for pre_process_input_data when group_dependent_fields is set to True. - Fix bug where nesting of adding data to output is happening in the wrong place. * Add validation that grouped_inputs must all reference fields from the same collection. * Fix bug where empty dict was being added to array. * Fix bad yaml nesting and the fact that some extra endpoints were adding in the saas config test. * Fix potential bug where collection name doesn't exist because it didn't pass validation. * Add a test confirming if no grouped_input fields are specified, "fidesops_grouped_inputs" key just returns an empty list. * Grouped_inputs fields may not exist. * Allow grouped inputs to be reference or identity fields. * Put building the dataset graphs within the try/except because if this fails, this will be swallowed and difficult to debug. * Remove post-processor item that is being handled by separate PR. * Responding to CR - when storing grouped_inputs on internal collections, use set representation. * Set FIDESOPS_GROUPED_INPUTS key regardless. * Add the fidesops_grouped_inputs keys - they are now included in all outputs. - Switch the issubset. * Change grouped_inputs list->set type where we merge collections for saas configs. * Fix test after merge.

iamkelllly added the enhancement label Jan 5, 2022

iamkelllly added this to the Backlog milestone Jan 5, 2022

ThomasLaPiana assigned SteveDMurphy Jan 5, 2022

SteveDMurphy mentioned this issue Jan 14, 2022

Export System & Dataset as csv #317

Merged

5 tasks

SteveDMurphy closed this as completed in #317 Jan 21, 2022

iamkelllly modified the milestones: Backlog, fidesctl 1.3.0 Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export existing fidesctl resource metadata to csv file #299

Export existing fidesctl resource metadata to csv file #299

iamkelllly commented Jan 5, 2022 •

edited

Loading

Export existing fidesctl resource metadata to csv file #299

Export existing fidesctl resource metadata to csv file #299

Comments

iamkelllly commented Jan 5, 2022 • edited Loading

iamkelllly commented Jan 5, 2022 •

edited

Loading