Introducing: polaris.readiness.ignore-offending-properties #2472

fivetran-kostaszoumpatianos · 2025-08-29T05:10:01Z

This PR introduces a polaris.readiness.ignore-offending-properties config that accepts a map of properties for which readiness checks are suppressed.

It can be used, for example, as follows:

polaris.readiness.ignore-offending-properties=\
  polaris.metrics.user-principal-tag.enable-in-api-metrics,\
  polaris.features."ALLOW_INSECURE_STORAGE_TYPES",\
  polaris.features."SUPPORTED_CATALOG_STORAGE_TYPES"

The check performed is case-insesitive.

fivetran-kostaszoumpatianos · 2025-09-02T08:52:37Z

@dimas-b @adutra @eric-maynard could you maybe take a look at this PR? thanks!

dimas-b · 2025-09-02T14:25:21Z

runtime/service/src/main/java/org/apache/polaris/service/config/ProductionReadinessChecks.java

+            .filter(
+                error ->
+                    config.ignoreOffendingProperties().stream()
+                        .noneMatch(prop -> prop.equalsIgnoreCase(error.offendingProperty())))


This approach LGTM in general. However, WDYT about adding a new "ID" property to Error, e.g. error.getId().

The ID could be a static constant for cases where there could only ever be one Error instance per "check" code (e.g. checkUserPrincipalMetricTag) or it could be a deterministic (hash) function of the "type" plus some parameters (e.g. checkInsecureStorageSettings could produce IDs like storage-17af38 and storage-46fq98).

The idea is that admin users should suppress specific error instances, but not "ranges" of errors. This way, if an admin user suppresses one particular check cases, new checks will still be visible when Polaris adds them. The value of error.offendingProperty() may still be too broad in some cases.

The "hash" part being deterministic will allow admin users to propagate the same configuration to all their deployment environments. At the same time, it is not easy to guess, which will force the admin user to review what exactly needs to be suppressed. Also, if the meaning of the error changes, we can change the ID, and it will force the admin users to reassess the implications (and re-suppress).

WDYT?

The idea is that admin users should suppress specific error instances

What would this flow look like, though? Is this to support cases like I want to allow setting config X, but not Y, and I want to allow setting config Z to A or B but not to C.? I fear we are at risk of overengineering this a bit. As it is, only admins have access to these configs.

from my POV, this is not so much about config A=X or A=Y, but more about "Polaris detected something dangerous about X". Now, if the admin user suppresses this warning, I do not want the suppression to automatically hide future warnings about "dangerous Y".

It may be related to some specific config, but may be not. I can imagine running as the root OS user falls under the same category of auto-detectable issues.

Filtering based on error ids would also fit well with the naming change that you propose @eric-maynard then we can call the ignore method: ignoreSelectedIssues since we will now have a way of filtering by issue. Maybe the parameter hash is a bit too much. For me, I would be ok deactivating a check altogether if I know that I have a dangerous config there. It depends on how cautious we want to be.

I'd be ok without the parameter hash for a start. However, having a concise but non-predictable error ID is important I think. That is to say, a user suppressing a particular error must first observe the error. It should not be easy to suppress something "proactively" :) At the same time the error ID should not be dependent on the runtime env. (i.e. be the same in all k8s pods, for example). WDYT?

eric-maynard · 2025-09-08T20:53:29Z

runtime/service/src/main/java/org/apache/polaris/service/config/ReadinessConfiguration.java

+   * production readiness.
+   */
+  @WithDefault("{}")
+  Set<String> ignoreOffendingProperties();


Why the asymmetry between this and ignoreSevereIssues? IIUC this is basically a subset of severe (?) issues that the admin wants to configure the readiness check to ignore. Maybe ignoreSelectIssues?

Because technically the check is based on the offending property and not the issue type. ignoreIssuesForSelectedOffendingProperties ? maybe too much?

github-actions · 2025-10-13T02:08:36Z

This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

Added parameter and filter

e6bbd03

github-project-automation bot added this to Basic Kanban Board Aug 29, 2025

github-project-automation bot moved this to PRs In Progress in Basic Kanban Board Aug 29, 2025

fivetran-kostaszoumpatianos mentioned this pull request Aug 29, 2025

Add user principal tag in metrics #2445

Merged

dimas-b reviewed Sep 2, 2025

View reviewed changes

eric-maynard reviewed Sep 8, 2025

View reviewed changes

github-actions bot added the Stale label Oct 13, 2025

github-actions bot closed this Oct 19, 2025

github-project-automation bot moved this from PRs In Progress to Done in Basic Kanban Board Oct 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introducing: polaris.readiness.ignore-offending-properties #2472

Introducing: polaris.readiness.ignore-offending-properties #2472

Uh oh!

fivetran-kostaszoumpatianos commented Aug 29, 2025

Uh oh!

fivetran-kostaszoumpatianos commented Sep 2, 2025

Uh oh!

dimas-b Sep 2, 2025

Uh oh!

eric-maynard Sep 8, 2025

Uh oh!

dimas-b Sep 8, 2025

Uh oh!

fivetran-kostaszoumpatianos Sep 11, 2025 •

edited

Loading

Uh oh!

dimas-b Sep 12, 2025

Uh oh!

eric-maynard Sep 8, 2025

Uh oh!

fivetran-kostaszoumpatianos Sep 11, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Introducing: polaris.readiness.ignore-offending-properties #2472

Introducing: polaris.readiness.ignore-offending-properties #2472

Uh oh!

Conversation

fivetran-kostaszoumpatianos commented Aug 29, 2025

Uh oh!

fivetran-kostaszoumpatianos commented Sep 2, 2025

Uh oh!

dimas-b Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

eric-maynard Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

dimas-b Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

fivetran-kostaszoumpatianos Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dimas-b Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

eric-maynard Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

fivetran-kostaszoumpatianos Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fivetran-kostaszoumpatianos Sep 11, 2025 •

edited

Loading