Add cloud user and system account group by #797

eiffel777 · 2019-02-13T16:10:55Z

Description

This pull request adds group by's for the person and system username for the cloud realm. The person group by will show the first and last name of a person associated with a username. The System Username group by will show the username of the person associated with the data being shown. In order to use these group by's usernames from the cloud event logs files are loaded into the modw.systemaccount and modw.person tables using pipelines added to jobs_cloud_generic.json and jobs_cloud_openstack.json. The new pipelines take actions from existing pipelines used for adding usernames from job files and modifies them for use in the cloud realm. In order for the first and last name to be shown when using the User group by the xdmod-import-csv command with the -t names option should be used, which is the same method for loading full names for Jobs data.

Each VM session will now have a person and system username associated with it. A sessions may have multiple events by multiple people during its run. We have choosen to use the person who started the session as the person who is responsible for the session and the resources used by it.

A new ConfigFilesMigration.php file has made for upgrading from a previous installation of xdmod as the new group by's need to be added to the roles.json file. This file checks to see if the cloud.json file exists in the CONFIG_DIR/roles.d folder and if it does the group by's are added to the public and default roles. If the cloud.json file is not found an exception is thrown. Since the cloud.json file exists in the roles.d folder the writeJsonConfigFile function in /classes/OpenXdmod/Migration/ConfigFilesMigration.php cannot be used so I added a function called writeJsonPartialConfigFile that will write a partial json config file to whatever partial config file subdirectory that is specified. When upgrading all previous events in the database will default to -1 for the person and system username. If a person re-ingests their events those events will be updated to have the correct person and system username.

The user group by for the cloud realm was already created but was not active. To activate it entries were added for in to the roles.json and datawarehouse.json files.

Tests performed

Manually tested in docker and new regressions tests for each group by have been added and the integration tests have been modified to account for the new group by's.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project as found in the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

…adding people and usernames. adding updated regression and integration tests

…oud-user-group-by

…onfig files

…m database migration file

…ith generic cloud data to account for event_id being generated on the event table instead of generic_cloud_staging_event

…t in post_ingest_updates

…eline file

jpwhite4 · 2019-02-22T01:58:21Z

classes/OpenXdmod/Migration/ConfigFilesMigration.php

+     */
+    protected function writeJsonPartialConfigFile($directory, $name, array $data)
+    {
+        $json = Json::prettyPrint(json_encode($data));


Suggested change

$json = Json::prettyPrint(json_encode($data));

$json = json_encode($data, JSON_PRETTY_PRINT);

The Json::prettyPrint is terribly inefficient and only exists because the JSON_PRETTY_PRINT setting was not in php 5.3. We no longer support php 5.3 so Json::prettyPrint is deprecated and should not be used for new code.

jpwhite4 · 2019-02-22T01:59:40Z

classes/OpenXdmod/Migration/ConfigFilesMigration.php

+                return $file;
+            }
+        });
+


Need some error checking here. What happens if $partialConfigFile is an empty array?

jpwhite4 · 2019-02-22T02:08:20Z

classes/OpenXdmod/Migration/Version800To810/ConfigFilesMigration.php

+     */
+    public function execute()
+    {
+        $this->setCloudRolesFile();


Please explain why this config migration exists. You should be able to update the roles.d/cloud.json file and the RPM will install it in the config directory.

jpwhite4 · 2019-02-22T02:09:34Z

classes/OpenXdmod/Migration/Version800To810/ConfigFilesMigration.php

+        $rolesConfigFolder = $this->config->getPartialFilePaths('roles');
+
+        if($cloudFile = array_search(CONFIG_DIR."/roles.d/cloud.json", $rolesConfigFolder) === false){
+            throw new Exception("cloud.json file not found in roles.d folder");


What happens to this exception? Will it completely kill the upgrade process? If so how does the user recover?

jpwhite4 · 2019-02-22T02:10:04Z

classes/OpenXdmod/Migration/Version800To810/ConfigFilesMigration.php

+    private function addCloudRolesGroupBy($groupBy, $role)
+    {
+        if(!array_key_exists($role, $this->cloudRolesFile['+roles'])){
+            throw new Exception("Role not found in cloud.json file");


What happens to this exception? Will it completely kill the upgrade process? If so how does the user recover?

jpwhite4 · 2019-02-22T02:10:52Z

configuration/etl/etl.d/jobs_cloud_generic.json

+            "namespace": "ETL\\Ingestor",
+            "options_class": "IngestorOptions",
+            "truncate_destination": false,
+            "enabled": true


enabled: true is the default and does not need to be specified

jpwhite4 · 2019-02-22T02:12:57Z

configuration/etl/etl.d/jobs_cloud_generic.json

+                  "name" : "Cloud DB",
+                  "config" : "datawarehouse",
+                  "schema": "mod_shredder",
+                  "create_schema_if_not_exists": "true"


If the setting is the same as the default then you should not override it. E.g. create_schema_if_not_exists is specifed as true in the defaults at the top of the file.

jpwhite4 · 2019-02-22T02:20:42Z

db/migrations/8.0.0-8.1.0/modw_cloud.sql

@@ -4,7 +4,7 @@ USE modw_cloud;
 -- somme OpenStack events. The events that already exist in the staging and event
 -- need to have their mappings updated. Some of the updates include mapping the
 -- compute.instance.power_on.start event to POWER_ON_START event and
-- compute.instance.resume.start to REQUEST_RESUME and compute.instance.resume.end 
+-- compute.instance.resume.start to REQUEST_RESUME and compute.instance.resume.end


As a general rule I prefer to not have changes that only change whitespace. This is because it makes it more difficult for code reviewers to review correctly. For example, in this case, this file shows in the list of changed files relating to adding the user and system group bys. However there is no need fo this file to change at all for the purpose of this pull request.

…oud-user-group-by

classes/DataWarehouse/Query/Cloud/GroupBys/GroupByPerson.php

classes/OpenXdmod/DataWarehouseInitializer.php

jpwhite4 · 2019-02-28T16:45:23Z

classes/OpenXdmod/Migration/ConfigFilesMigration.php

+     * @param string $name The config file name (without ".json").
+     * @param array $data The data to store in the config file.
+     */
+    protected function writeJsonPartialConfigFile($directory, $name, array $data)


Question the need for another dedicated function that writes a json config file. I note that we already have a JSON::saveFile() that does exactly the same thing. The code that calls this function already knows about how to construct the path to the file. In fact you even use the sister JSON::loadFile() function a few lines before calling this one. See classes/OpenXdmod/Migration/Version800To810/ConfigFilesMigration.php lines 36 and 51

Json::loadFile(CONFIG_DIR."/roles.d/cloud.json");

…ace with using JSON::savefile

…d by Ben in a previous PR

chakrabortyr · 2019-02-28T21:24:51Z

LGTM once you address @jpwhite4's outlying comment.

…7/xdmod into add-cloud-user-group-by

…oud-user-group-by

jpwhite4 · 2019-03-01T20:33:40Z

Don't forget to squash and merge

@jpwhite4

* adding person and username group bys for cloud data and pipeline for adding people and usernames. adding updated regression and integration tests * removing unneeded migration code. adding function write out partial config files * update config migration script and remove unneeded use statements from database migration file * documetation updates and remove erroneous commits * formatting changes * removing unneeded function from user and systemaccount group bys * moving generation of event_id to event table instead of staging table * added comments and changed query for event_asset table when dealing with generic cloud data to account for event_id being generated on the event table instead of generic_cloud_staging_event * adding trucate_destination to etl.d file instead of truncate statement in post_ingest_updates * moving post ingest sql update action back to original location in pipeline file * documetation updates * fixing style issues and updating test artifacts for unit tests * fixes for passing unit and style tests * remove extra spaces to pass unit tests * adding new group bys to test artifact * addressing comments from @jpwhite4 * removing defaults from etl pipeline files * updating tests * updating jobs_cloud_generic to be correct * updating cloud person and username tests to use updated anonymized data * removing unnecessary function to write out partial config files. replace with using JSON::savefile * re-adding hide_sql_warning_codes in jobs_cloud_generic that were added by Ben in a previous PR * changing $value to $unused to pass linter

eiffel777 added 7 commits February 4, 2019 14:06

adding person and username group bys for cloud data and pipeline for …

f44ae71

…adding people and usernames. adding updated regression and integration tests

Merge branch 'xdmod8.1' of https://github.com/ubccr/xdmod into add-cl…

4f4aded

…oud-user-group-by

removing unneeded migration code. adding function write out partial c…

0924440

…onfig files

update config migration script and remove unneeded use statements fro…

2b847fe

…m database migration file

documetation updates and remove erroneous commits

20a15b3

formatting changes

0336cc3

removing unneeded function from user and systemaccount group bys

97bcccc

eiffel777 requested review from smgallo and chakrabortyr February 13, 2019 16:10

Greg Dean and others added 10 commits February 13, 2019 11:11

Merge branch 'xdmod8.1' into add-cloud-user-group-by

dd29c90

moving generation of event_id to event table instead of staging table

4d68f83

added comments and changed query for event_asset table when dealing w…

b15fe5c

…ith generic cloud data to account for event_id being generated on the event table instead of generic_cloud_staging_event

adding trucate_destination to etl.d file instead of truncate statemen…

ad4e4c6

…t in post_ingest_updates

moving post ingest sql update action back to original location in pip…

dcda852

…eline file

documetation updates

41d1b73

fixing style issues and updating test artifacts for unit tests

32bd200

fixes for passing unit and style tests

6ae66bf

remove extra spaces to pass unit tests

340db55

adding new group bys to test artifact

e5696ac

plessbd added enhancement Enhancement of the functionality of an existing feature Category:Cloud Cloud Realm labels Feb 18, 2019

plessbd added this to the 8.1.0 milestone Feb 18, 2019

jpwhite4 requested changes Feb 22, 2019

View reviewed changes

eiffel777 and others added 7 commits February 25, 2019 09:06

addressing comments from @jpwhite4

643246a

merging latest xdmod8.1 from upstream

4a8197f

removing defaults from etl pipeline files

70e2313

updating tests

3069b03

Merge branch 'xdmod8.1' of https://github.com/ubccr/xdmod into add-cl…

73f4d96

…oud-user-group-by

Merge branch 'xdmod8.1' into add-cloud-user-group-by

7a0e016

merging in xdmod8.1 changes

41fd953

eiffel777 added 3 commits February 28, 2019 09:19

updating jobs_cloud_generic to be correct

9c21703

updating cloud person and username tests to use updated anonymized data

dfeacb0

Merge branch 'xdmod8.1' of https://github.com/ubccr/xdmod into add-cl…

24f3155

…oud-user-group-by

jpwhite4 reviewed Feb 28, 2019

View reviewed changes

classes/DataWarehouse/Query/Cloud/GroupBys/GroupByPerson.php Show resolved Hide resolved

jpwhite4 reviewed Feb 28, 2019

View reviewed changes

classes/OpenXdmod/DataWarehouseInitializer.php Show resolved Hide resolved

jpwhite4 reviewed Feb 28, 2019

View reviewed changes

eiffel777 and others added 4 commits February 28, 2019 15:15

removing unnecessary function to write out partial config files. repl…

3b30045

…ace with using JSON::savefile

Merge branch 'xdmod8.1' into add-cloud-user-group-by

a947085

fixing conflict in jobs_cloud_generic

5d06bca

re-adding hide_sql_warning_codes in jobs_cloud_generic that were adde…

523ff88

…d by Ben in a previous PR

chakrabortyr previously approved these changes Feb 28, 2019

View reviewed changes

Merge branch 'xdmod8.1' into add-cloud-user-group-by

203c539

eiffel777 dismissed chakrabortyr’s stale review via 203c539 March 1, 2019 15:16

eiffel777 added 3 commits March 1, 2019 10:23

merging in xdmod8.1

5238cbc

Merge branch 'add-cloud-user-group-by' of https://github.com/eiffel77…

16faff4

…7/xdmod into add-cloud-user-group-by

changing $value to $unused to pass linter

f289ba1

eiffel777 requested review from chakrabortyr and jpwhite4 March 1, 2019 19:02

Merge branch 'xdmod8.1' of https://github.com/ubccr/xdmod into add-cl…

e942b35

…oud-user-group-by

chakrabortyr approved these changes Mar 1, 2019

View reviewed changes

jpwhite4 approved these changes Mar 1, 2019

View reviewed changes

Merge branch 'xdmod8.1' into add-cloud-user-group-by

959f4b9

eiffel777 merged commit 4f28be4 into ubccr:xdmod8.1 Mar 4, 2019

eiffel777 mentioned this pull request Mar 15, 2019

Change cloud person username fields to be not null #860

Merged

6 tasks

eiffel777 added new feature New functionality and removed enhancement Enhancement of the functionality of an existing feature labels Mar 28, 2019

eiffel777 mentioned this pull request Nov 8, 2019

Add truncate_destination directive back to openstack staging event action that was erroneously removed #1159

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cloud user and system account group by #797

Add cloud user and system account group by #797

eiffel777 commented Feb 13, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 22, 2019

jpwhite4 Feb 28, 2019

chakrabortyr commented Feb 28, 2019

jpwhite4 commented Mar 1, 2019

	$json = Json::prettyPrint(json_encode($data));
	$json = json_encode($data, JSON_PRETTY_PRINT);

Add cloud user and system account group by #797

Add cloud user and system account group by #797

Conversation

eiffel777 commented Feb 13, 2019

Description

Tests performed

Types of changes

Checklist:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chakrabortyr commented Feb 28, 2019

jpwhite4 commented Mar 1, 2019