Limit Response info dataset queries #1665

noah-paige · 2024-10-25T16:06:09Z

Feature or Bugfix

Refactor

Detail

Change Response Types for Dataset, Dashboard, and Shares Modules:
- S3Datasets: getDataset, getDatasetTables, and getDatasetStorageLocation to only return Simplified Env / Org
  return types
- ++ Redshift Datasets
- ++ Dataset Shares

Relates

Security

Please answer the questions below briefly where applicable, or write N/A. Based on
OWASP 10.

Does this PR introduce or modify any input fields or queries - this includes
fetching data from storage outside the application (e.g. a database, an S3 bucket)?
- Is the input sanitized?
- What precautions are you taking before deserializing the data you consume?
- Is injection prevented by parametrizing queries?
- Have you ensured no eval or similar functions are used?
Does this PR introduce any functionality or component that requires authorization?
- How have you ensured it respects the existing AuthN/AuthZ mechanisms?
- Are you logging failed auth attempts?
Are you using or adding any cryptographic features?
- Do you use a standard proven implementations?
- Are the used keys controlled by the customer? Where are they stored?
Are you introducing any new policies/roles/users?
- Have you used the least-privilege principle? How?

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

noah-paige · 2024-10-25T16:07:55Z

Testing Completed:

All Dataset Tabs Load ++ getDataset returns with correct information on FE
All Dataset Tabs Load ++ getDatasetTables returns with correct information on FE
All Dataset Tabs Load ++ getDatasetStorageLocation returns with correct information on FE
All Dataset Tabs Load ++ getRedshiftDataset returns with correct information on FE
All Dataset Tabs Load ++ getRedshiftDatasetTable returns with correct information on FE
All Dashboard Tabs Load ++ getDashboard returns with correct information on FE
Shares List Tabs Load ++ getShareRequestsFromMe and getShareRequestsToMe returns with correct information on FE
ShareObjectView Loads + getShareObject returns with correct information

SofiaSazonova · 2024-10-28T11:42:34Z

Error in query ListDatasets (/console/datasets) "Cannot query field 'organization' on type 'DatasetBase'."

SofiaSazonova · 2024-10-28T12:00:23Z

Is it necessary to change Environment => SimplifiedEnvironment also in queries?

ListDataPipelines
GetDataPipeline
ListOmicsRun
getSagemakerNotebook
ListSagemakerNotebooks
ListSagemakerStudioUsers
getSagemakerStudioUser

SofiaSazonova

An error and a question

SofiaSazonova · 2024-10-28T12:03:25Z

Also, a comment: in dataset e.g. if we change name, label doesn't change (or it was a bug). Will it work the same for environments? If so, we risk to have confusing data displayed if use label instead of name

noah-paige · 2024-10-28T13:12:24Z

Error in query ListDatasets (/console/datasets) "Cannot query field 'organization' on type 'DatasetBase'."

Good catch resolved now

noah-paige · 2024-10-28T13:17:59Z

For the other listed queries:

ListDataPipelines
GetDataPipeline
ListOmicsRun
getSagemakerNotebook
ListSagemakerNotebooks
ListSagemakerStudioUsers
getSagemakerStudioUser

The above could be addressed but I left out of this PR for the following reason:

The risk that this PR mitigates is for resources that other (non-Owner) teams can view (i.e. shareable resources)
- This PR limits the amount of information that a team may be able to extract about the parent env/org of a dataset or dashboard that they have approved share access to
For other resources (pipelines, notebooks, omics runs, mlstudio) the user must be a part of the team which is invited to the env and org already - meaning there is no issue with the user also extracting information about parent env because they should already be able to

noah-paige · 2024-10-28T13:25:17Z

Also, a comment: in dataset e.g. if we change name, label doesn't change (or it was a bug). Will it work the same for environments? If so, we risk to have confusing data displayed if use label instead of name

This is outside of the scope of this PR yes? I actually think we should prevent updates to label for datasets or envs or any resources in dataall unless we are certain they have no impact on provisioned resources via CDK

SofiaSazonova · 2024-10-28T14:06:59Z

This is outside of the scope of this PR yes? I actually think we should prevent updates to label for datasets or envs or any resources in dataall unless we are certain they have no impact on provisioned resources via CDK

I guess so. I think it's quite rare case anyway

- Refactor - Change Response Types for Dataset, Dashboard, and Shares Modules: - S3Datasets: `getDataset`, `getDatasetTables`, and `getDatasetStorageLocation` to only return Simplified Env / Org return types - ++ Redshift Datasets - ++ Dataset Shares - <URL or Ticket> Please answer the questions below briefly where applicable, or write `N/A`. Based on [OWASP 10](https://owasp.org/Top10/en/). - Does this PR introduce or modify any input fields or queries - this includes fetching data from storage outside the application (e.g. a database, an S3 bucket)? - Is the input sanitized? - What precautions are you taking before deserializing the data you consume? - Is injection prevented by parametrizing queries? - Have you ensured no `eval` or similar functions are used? - Does this PR introduce any functionality or component that requires authorization? - How have you ensured it respects the existing AuthN/AuthZ mechanisms? - Are you logging failed auth attempts? - Are you using or adding any cryptographic features? - Do you use a standard proven implementations? - Are the used keys controlled by the customer? Where are they stored? - Are you introducing any new policies/roles/users? - Have you used the least-privilege principle? How? By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

### Feature or Bugfix - Security ### Detail * get-parameter CloudfrontDistributionDomainName from us-east-1 (#1687 ) * Added Token Validations (#1682) * add warning to untrust data.all account when removing an environment (#1685) * add custom domain support for apigw (#1679) * Lambda Event Logs Handling (#1678) * Upgrade Spark version to 3.3 (#1675) - a0c63a4 * ES Search Query Collect All Response (#1631) * Extend Tenant Perms Coverage (#1630) * Limit Response info dataset queries (#1665) * Add Removal Policy Retain to Bucket Policy IaC (#1660) * log API handler response only for LOG_LEVEL DEBUG. Set log level INFO for prod deployments (#1662) * Add permission checks to markNotificationAsRead + deleteNotification (#1654) * Added error view and unified utility to check tenant user (#1657 * Userguide signout flow (#1629) ### Relates - Security release ### Security Please answer the questions below briefly where applicable, or write `N/A`. Based on [OWASP 10](https://owasp.org/Top10/en/). - Does this PR introduce or modify any input fields or queries - this includes fetching data from storage outside the application (e.g. a database, an S3 bucket)? - Is the input sanitized? - What precautions are you taking before deserializing the data you consume? - Is injection prevented by parametrizing queries? - Have you ensured no `eval` or similar functions are used? - Does this PR introduce any functionality or component that requires authorization? - How have you ensured it respects the existing AuthN/AuthZ mechanisms? - Are you logging failed auth attempts? - Are you using or adding any cryptographic features? - Do you use a standard proven implementations? - Are the used keys controlled by the customer? Where are they stored? - Are you introducing any new policies/roles/users? - Have you used the least-privilege principle? How? By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license. --------- Co-authored-by: Noah Paige <69586985+noah-paige@users.noreply.github.com> Co-authored-by: Petros Kalos <kalosp@amazon.com>

Limit Response info dataset queries

d9cac0a

noah-paige added 4 commits October 25, 2024 12:54

fix datasets and add simplified env to dashboards

f6c146b

enforce env simplified share object

2e3c9f0

fix tests

0978498

fix GQL Query integ tests

8e6ab5d

SofiaSazonova requested changes Oct 28, 2024

View reviewed changes

SofiaSazonova assigned SofiaSazonova and unassigned SofiaSazonova Oct 28, 2024

Remove organization from listDataset query response

35b6fb3

SofiaSazonova approved these changes Oct 28, 2024

View reviewed changes

noah-paige self-assigned this Oct 28, 2024

noah-paige merged commit 8e947d9 into main Oct 28, 2024
9 checks passed

dlpzx mentioned this pull request Nov 6, 2024

2.6.1 Security features #1686

Merged

dlpzx deleted the fix/restrict-dataset-gql-types branch November 22, 2024 11:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit Response info dataset queries #1665

Limit Response info dataset queries #1665

noah-paige commented Oct 25, 2024 •

edited

Loading

noah-paige commented Oct 25, 2024 •

edited

Loading

SofiaSazonova commented Oct 28, 2024 •

edited

Loading

SofiaSazonova commented Oct 28, 2024

SofiaSazonova left a comment

SofiaSazonova commented Oct 28, 2024

noah-paige commented Oct 28, 2024

noah-paige commented Oct 28, 2024

noah-paige commented Oct 28, 2024

SofiaSazonova commented Oct 28, 2024

Limit Response info dataset queries #1665

Limit Response info dataset queries #1665

Conversation

noah-paige commented Oct 25, 2024 • edited Loading

Feature or Bugfix

Detail

Relates

Security

noah-paige commented Oct 25, 2024 • edited Loading

SofiaSazonova commented Oct 28, 2024 • edited Loading

SofiaSazonova commented Oct 28, 2024

SofiaSazonova left a comment

Choose a reason for hiding this comment

SofiaSazonova commented Oct 28, 2024

noah-paige commented Oct 28, 2024

noah-paige commented Oct 28, 2024

noah-paige commented Oct 28, 2024

SofiaSazonova commented Oct 28, 2024

noah-paige commented Oct 25, 2024 •

edited

Loading

noah-paige commented Oct 25, 2024 •

edited

Loading

SofiaSazonova commented Oct 28, 2024 •

edited

Loading