Align model card requirements (nv-morpheus#1388)

Authors: - Devin Robison (https://github.com/drobison00) Approvers: - David Gardner (https://github.com/dagardner-nv) - Michael Demoret (https://github.com/mdemoret-nv) URL: nv-morpheus#1388
dagardner-nv · Nov 27, 2023 · 5f00e78 · 5f00e78
1 parent 119ec42
commit 5f00e78
Show file tree

Hide file tree

Showing 5 changed files with 13 additions and 107 deletions.
diff --git a/models/model-cards/abp-model-card.md b/models/model-cards/abp-model-card.md
@@ -94,10 +94,6 @@ limitations under the License.
 
 * Sample dataset consists of over 1000 nvidia-smi outputs <br>
 
-**Dataset License:** 
-
-* [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0) <br>
-
 ## Evaluation Dataset:
 
 **Link:** 
@@ -108,10 +104,6 @@ limitations under the License.
 
 * Sample dataset consists of over 1000 nvidia-smi outputs <br>
 
-**Dataset License:** 
-
-* [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)<br>
-
 ## Inference:
 
 **Engine:** 
@@ -213,10 +205,6 @@ limitations under the License.
 
 * N/A
 
-### What training is recommended for developers working with this model?
-
-* Familiarity with the Morpheus SDK is recommended for developers working with this model.
-
 ### Link the relevant end user license agreement 
 
 * [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)
@@ -256,10 +244,6 @@ limitations under the License.
 
 * N/A
 
-### Technical robustness and model security validated?
-
-* No
-
 ### Is the model and dataset compliant with National Classification Management Society (NCMS)?
 
 * No
@@ -308,10 +292,6 @@ limitations under the License.
 
 * N/A
 
-### Scanned for malware?
-
-* No
-
 ### Are we able to identify and trace source of dataset?
 
 * Yes

diff --git a/models/model-cards/dfp-model-card.md b/models/model-cards/dfp-model-card.md
@@ -83,9 +83,6 @@ The training dataset consists of AWS CloudTrail logs. It contains logs from two
 * [hammah-user123-training-part3.json](https://github.com/nv-morpheus/Morpheus/blob/branch-23.11/models/datasets/training-data/cloudtrail/hammah-user123-training-part3.json): 1000 records <br>
 * [hammah-user123-training-part4.json](https://github.com/nv-morpheus/Morpheus/blob/branch-23.11/models/datasets/training-data/cloudtrail/hammah-user123-training-part4.json): 387 records <br>
 
-**Dataset License:**
-* [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)<br>
-
 ## Evaluation Dataset:
 **Link:**
 * https://github.com/nv-morpheus/Morpheus/tree/branch-23.11/models/datasets/validation-data/cloudtrail <br>
@@ -98,9 +95,6 @@ The evaluation dataset consists of AWS CloudTrail logs. It contains logs from tw
 * [hammah-user123-validation-part2.json](https://github.com/nv-morpheus/Morpheus/blob/branch-23.11/models/datasets/validation-data/cloudtrail/hammah-user123-validation-part2.json): 300 records
 * [hammah-user123-validation-part3.json](https://github.com/nv-morpheus/Morpheus/blob/branch-23.11/models/datasets/validation-data/cloudtrail/hammah-user123-validation-part3.json): 247 records
 
-**Dataset License:**
-* [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)<br>
-
 ## Inference:
 **Engine:**
 * PyTorch
@@ -179,9 +173,6 @@ The evaluation dataset consists of AWS CloudTrail logs. It contains logs from tw
 ### What are the potential known risks to users and stakeholders?
 * None
 
-### What training is recommended for developers working with this model? If none, please state "none."
-* Familiarity with the Morpheus SDK is recommended for developers working with this model.
-
 ### Link the relevant end user license agreement
 * [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)
 
@@ -211,9 +202,6 @@ The evaluation dataset consists of AWS CloudTrail logs. It contains logs from tw
 ### Name target quality Key Performance Indicators (KPIs) for which this has been tested.
 * None
 
-### Technical robustness and model security validated?
-* No
-
 ### Is the model and dataset compliant with National Classification Management Society (NCMS)?
 * No
 
@@ -251,9 +239,6 @@ The evaluation dataset consists of AWS CloudTrail logs. It contains logs from tw
 ### Is data in dataset traceable?
 * No
 
-### Scanned for malware?
-* No
-
 ### Are we able to identify and trace source of dataset?
 * Yes ([fully synthetic dataset](https://github.com/nv-morpheus/Morpheus/tree/branch-23.11/models/datasets/training-data/cloudtrail))
 

diff --git a/models/model-cards/gnn-fsi-model-card.md b/models/model-cards/gnn-fsi-model-card.md
@@ -81,19 +81,13 @@ This model is an example of a fraud detection pipeline using a graph neural netw
 **Properties (Quantity, Dataset Descriptions, Sensor(s)):** 
 * A training data consists of raw 753 synthetic labeled credit card transaction data with data augmentation in a total of 12053 labeled transaction data. <br>
 
-**Dataset License:** 
-* [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0) <br>
-
 ## Evaluation Dataset:
 **Link:** 
 * [fraud-detection-validation-data.csv](models/dataset/fraud-detection-validation-data.csv) <br>
 
 **Properties (Quantity, Dataset Descriptions, Sensor(s)):** 
 * Data consists of raw 265 labeled credit card transaction synthetically created<br>
 
-**Dataset License:**
-* [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)<br>
-
 ## Inference:
 **Engine:** 
 * Triton <br>
@@ -124,16 +118,21 @@ This model is an example of a fraud detection pipeline using a graph neural netw
 
 ### What is the accent balance of the model validation data?
 * Not Applicable
+
 ### Describe measures taken to mitigate against unwanted bias.
 * Not Applicable
+
 ## Model Card ++ Explainability Subcard
 
 ### Name example applications and use cases for this model. 
 * The model is primarily designed for testing purposes and serves as a small pretrained model specifically used to evaluate and validate the GNN FSI pipeline. Its application is focused on assessing the effectiveness of the pipeline rather than being intended for broader use cases or specific applications beyond testing.
+
 ### Fill in the blank for the model technique.
 * This model is designed for developers seeking to test the GNN fraud detection pipeline with a small pretrained model on a synthetic dataset.
+
 ### Name who is intended to benefit from this model. 
 * The intended beneficiaries of this model are developers who aim to test the performance and functionality of the GNN fraud detection pipeline using synthetic datasets. It may not be suitable or provide significant value for real-world transactions. 
+
 ### Describe the model output.
 * This model outputs fraud probability score b/n (0 & 1). 
 
@@ -152,8 +151,6 @@ This model is an example of a fraud detection pipeline using a graph neural netw
 ### What are the potential known risks to users and stakeholders? 
 * None
 
-### What training is recommended for developers working with this model? If none, please state "none."
-* Familiarity with the Morpheus SDK is recommended for developers working with this model.
 ### Link the relevant end user license agreement 
 * [Apache 2.0](https://github.com/nv-morpheus/Morpheus/blob/branch-23.11/LICENSE)
 
@@ -170,17 +167,19 @@ This model is an example of a fraud detection pipeline using a graph neural netw
 
 ### Was model and dataset assessed for vulnerability for potential form of attack?
 * No
+
 ### Name applications for the model.
 * Used for testing fraud detection application in Morpheus pipeline, under the defined dataset schema description.
+
 ### Name use case restrictions for the model.
 * The model's use case is restricted to testing the Morpheus pipeline and may not be suitable for other applications.
+
 ### Has this been verified to have met prescribed quality standards?
 * No
 
 ### Name target quality Key Performance Indicators (KPIs) for which this has been tested. 
 * Not Applicable
-### Technical robustness and model security validated?
-* Not Applicable
+
 ### Is the model and dataset compliant with National Classification Management Society (NCMS)?
 * Not Applicable
 
@@ -189,38 +188,38 @@ This model is an example of a fraud detection pipeline using a graph neural netw
 
 ### Are there access restrictions to systems, model, and data?
 * No
+
 ### Is there a digital signature?
 * No
 
 ## Model Card ++ Privacy Subcard
 
 ### Generatable or reverse engineerable personally-identifiable information (PII)?
-
 * Neither
 
 ### Was consent obtained for any PII used?
 * Not Applicable (Data is extracted from synthetically created credit card transaction,refer[3] for the source of data creation)
 
 ### Protected classes used to create this model? (The following were used in model the model's training:)
-
 * Not applicable
 
 ### How often is dataset reviewed?
 * The dataset is initially reviewed upon addition, and subsequent reviews are conducted as needed or upon request for any changes.
 
 ### Is a mechanism in place to honor data
 * Yes
+
 ### If PII collected for the development of this AI model, was it minimized to only what was required? 
 * Not applicable
 
 ### Is data in dataset traceable?
 * No
-### Scanned for malware?
-* No
+
 ### Are we able to identify and trace source of dataset?
 * Yes
 
 ### Does data labeling (annotation, metadata) comply with privacy laws?
 * Not applicable
+
 ### Is data compliant with data subject requests for data correction or removal, if such a request was made?
 * Not applicable
diff --git a/models/model-cards/phishing-model-card.md b/models/model-cards/phishing-model-card.md
@@ -96,10 +96,6 @@ limitations under the License.
 
 * Dataset consists of SMSs <br>
 
-**Dataset License:** 
-
-* https://creativecommons.org/licenses/by/4.0/legalcode taken from https://archive.ics.uci.edu/dataset/228/sms+spam+collection <br>
-
 ## Evaluation Dataset:
 
 **Link:** 
@@ -110,10 +106,6 @@ limitations under the License.
 
 * Dataset consists of SMSs <br>
 
-**Dataset License:** 
-
-* https://creativecommons.org/licenses/by/4.0/legalcode taken from https://archive.ics.uci.edu/dataset/228/sms+spam+collection <br>
-
 ## Inference:
 
 **Engine:** 
@@ -207,9 +199,6 @@ limitations under the License.
 ### What are the potential known risks to users and stakeholders?
 * N/A
 
-### What training is recommended for developers working with this model?
-* Familiarity with the Morpheus SDK is recommended for developers working with this model.
-
 ### Link the relevant end user license agreement 
 * [Apache 2.0](http://www.apache.org/licenses/LICENSE-2.0)
 
@@ -240,9 +229,6 @@ limitations under the License.
 ### Name target quality Key Performance Indicators (KPIs) for which this has been tested. 
 * N/A
 
-### Technical robustness and model security validated?
-* No
-
 ### Is the model and dataset compliant with National Classification Management Society (NCMS)?
 * No
 
@@ -281,9 +267,6 @@ limitations under the License.
 ### Is data in dataset traceable?
 * N/A
 
-### Scanned for malware?
-* No
-
 ### Are we able to identify and trace source of dataset?
 * N/A