Per Feature Feature Flags #5632

dibyom · 2022-10-11T21:34:55Z

Feature request

TEP-033 proposed an enable-api-fields flag for features that require an API field. This allows us to group and enable/disable all "alpha"/"beta" features together.

An alternative approach is having a way to enable/disable each feature by via its own flag (such as the approach taken by Kubernetes feature gates)

This issue is for tracking experimentation for per feature feature gates.

Use case

A user would like to enable a specific alpha feature but not any other alpha features

The text was updated successfully, but these errors were encountered:

dibyom · 2022-10-11T22:01:53Z

One though I just had - per TEP-033, fields that are behind an alpha/beta flag can be gradually transitioned from alpha to beta to GA. For features that are not controlled by a field, TEP-033 says they should have their own flag. But I do not think it is possible to move them gradually from alpha to beta to GA and instead they must either be enabled or disabled within a particular API version

Yongxuanzhang · 2022-10-20T15:56:15Z

comment for tracking this issue

Yongxuanzhang · 2022-10-20T16:00:58Z

I encounter this issue when implementing trusted resources in #5581
We have a feature flag ResourceVerificationMode to let users choose from skip, enforce and warn. So if it is set to skip the feature will not be enabled.
This is an alpha feature and it may not be ideal to let users enable 2 flags (ResourceVerificationMode and enable-api-fields).

dibyom · 2022-10-20T19:30:14Z

@chuangw6 also mentioned the same thing for his ConfigSource status field proposal

chuangw6 · 2022-10-20T20:47:17Z

We have a feature flag ResourceVerificationMode to let users choose from skip, enforce and warn. So if it is set to skip the feature will not be enabled.

Thanks @dibyom. This is a good idea.
#5670 is the implementation that adds a dedicated feature flag to control the provenance field in taskrun/pipelinerun status.

afrittoli · 2022-10-20T21:24:00Z

To give extra background, the reason we decided to have a single flag instead of per-feature ones is the amount of testing that that would require, because while features are (often) clearly separated from one another at the API level, they may be more intertwined in the controller code - so enabling one feature out of 10 may behave differently from enabling all 10.

The alternative to feature flags (whether per feature or just one) would be to create v2alphaN APIs and v2betaN APIs.
Multiple features would be included in each alpha and beta version, similar to the way features are grouped under one flag today.

About k8s feature flags, I may be wrong but I think they mainly relate to back-end behaviour, even though there is no clear distinction documented like for Tekton.

If we decide to have feature-specific flags for all features, we shall document the test strategy. I believe each feature should then come with a dedicated set of e2e tests (like today) to be executed in an environment with only that feature enabled.
We could replace the current alpha test with an env where all features in alpha state are enabled.

dibyom · 2022-11-04T01:57:05Z

To give extra background, the reason we decided to have a single flag instead of per-feature ones is the amount of testing that that would require, because while features are (often) clearly separated from one another at the API level, they may be more intertwined in the controller code - so enabling one feature out of 10 may behave differently from enabling all 10.

I agree, the testing story is complicated but I don't think that by itself should be why we should only have a single flag. The most common use case for alpha features is to try out a single alpha feature, not all of them altogether.

Feature flags implementations being intertwined is not great either - wherever possible a feature flag should only control the behavior of that specific feature and nothing else. I agree this can be tricky - K8s checks for this sort of thing in their Production Readiness Reviews - we could do something similar and have some guidelines to specifically check for this when adding a new feature gate.

From a testing standpoint I think a reasonable option is to test will all feature flags on and all of them off (like we do today). We can document this and users who turn on only a subset of the flags can be responsible for their own testing.

The alternative to feature flags (whether per feature or just one) would be to create v2alphaN APIs and v2betaN APIs.
Multiple features would be included in each alpha and beta version, similar to the way features are grouped under one flag today.

In theory yes, but I think the complexity involved in maintaining N active API versions is far greater than just having feature flags (e.g. there can only be one api version that is stored so that version has to be able to represent info from all N API versions)

About k8s feature flags, I may be wrong but I think they mainly relate to back-end behaviour, even though there is no clear distinction documented like for Tekton.

I think many of the feature gates are for backend features but there also user facing ones such as the PodSecurity one that gates the new PodSecurityAdmission feature which replaces PodSecurityPolicies

If we decide to have feature-specific flags for all features, we shall document the test strategy. I believe each feature should then come with a dedicated set of e2e tests (like today) to be executed in an environment with only that feature enabled.

💯 . I'm not sure if it is realistic to test every combination of flags but we can definitely clearly state what we test and users who use a different subset of flags can be responsible for their own testing. That being said, I do think we should do our best to make sure a feature gate is only scoped to that particular feature.

We could replace the current alpha test with an env where all features in alpha state are enabled.

Agreed!

tekton-robot · 2023-02-02T02:11:21Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale with a justification.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

tekton-robot · 2023-03-04T02:24:52Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
Rotten issues close after an additional 30d of inactivity.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle rotten

Send feedback to tektoncd/plumbing.

tekton-robot · 2023-04-03T02:59:20Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen with a justification.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/close

Send feedback to tektoncd/plumbing.

tekton-robot · 2023-04-03T02:59:22Z

@tekton-robot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen with a justification.
Mark the issue as fresh with /remove-lifecycle rotten with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/close

Send feedback to tektoncd/plumbing.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

dibyom · 2023-05-03T14:24:16Z

Reopening this in light of #6592 - we should consider whether per feature flags can help prevent issues such as #6607

lbernick · 2023-05-08T20:39:39Z

I wanted to share this doc with some thoughts from a k8s maintainer on some of the issues they've run into w/ per-feature flags. A lot of it is related to rollbacks which I think we've been less concerned about than k8s has. To make rollbacks easier, they drop unsupported fields rather than rejecting CRDs containing them, as we do. I still think we should implement per-feature flags, but this doc could help us understand what issues we might run into.

The issues are basically:

adding version validation in implementation in addition to apiserver operations (this has caused problems for us as well with "enable-api-fields", see Versioned validation of referenced Pipelines/Tasks #6616)
concerns about rolling back from a more stable implementation of a feature to a less stable implementation of a feature
different versions of k8s or values of feature flags between apiserver and node (not relevant for us)
minimal feedback on alpha features

JeromeJu · 2023-10-26T18:21:27Z

Looks like we can also close this with the completion of #7090 at v0.53.0 milestone.

dibyom added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 11, 2022

xchapter7x added this to Tekton Community Roadmap Oct 11, 2022

xchapter7x moved this to Todo in Tekton Community Roadmap Oct 11, 2022

dibyom mentioned this issue Oct 11, 2022

Update API compatibility policy for V1/GA #5633

Closed

lbernick mentioned this issue Nov 23, 2022

Array indexing for v1beta1 #5769

Closed

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 2, 2023

tekton-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 4, 2023

tekton-robot closed this as completed Apr 3, 2023

github-project-automation bot moved this from Todo to Done in Tekton Community Roadmap Apr 3, 2023

dibyom reopened this May 3, 2023

github-project-automation bot moved this from Done to In Progress in Tekton Community Roadmap May 3, 2023

dibyom removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label May 3, 2023

lbernick mentioned this issue May 4, 2023

Decoupling API versioning and Feature versioning for features turned on by default #6592

Closed

2 tasks

lbernick added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label May 4, 2023

chuangw6 mentioned this issue May 9, 2023

Promote the provenance field in status #6495

Merged

5 tasks

lbernick mentioned this issue May 16, 2023

Versioned validation of referenced Pipelines/Tasks #6616

Closed

JeromeJu mentioned this issue Jun 12, 2023

Change the Storage Version to V1 Types #6444

Merged

7 tasks

lbernick mentioned this issue Jun 16, 2023

Better support for downgrades/rollbacks #6841

Open

tekton-robot assigned JeromeJu Jun 20, 2023

JeromeJu mentioned this issue Jul 24, 2023

Epic: Improve feature versioning #6966

Closed

13 tasks

vdemeester mentioned this issue Aug 3, 2023

[TEP-0138]: Decouple API and Feature Versioning Proposal tektoncd/community#1034

Merged

JeromeJu mentioned this issue Feb 12, 2024

TEP0138 Per-feature Flags Improvement to Decouple API and Feature Versioning #7177

Closed

10 tasks

JeromeJu closed this as completed Feb 12, 2024

github-project-automation bot moved this from In Progress to Done in Tekton Community Roadmap Feb 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Per Feature Feature Flags #5632

Per Feature Feature Flags #5632

dibyom commented Oct 11, 2022 •

edited

Loading

dibyom commented Oct 11, 2022 •

edited

Loading

Yongxuanzhang commented Oct 20, 2022

Yongxuanzhang commented Oct 20, 2022

dibyom commented Oct 20, 2022

chuangw6 commented Oct 20, 2022

afrittoli commented Oct 20, 2022

dibyom commented Nov 4, 2022

tekton-robot commented Feb 2, 2023

tekton-robot commented Mar 4, 2023

tekton-robot commented Apr 3, 2023

tekton-robot commented Apr 3, 2023

dibyom commented May 3, 2023

lbernick commented May 8, 2023

JeromeJu commented Oct 26, 2023

Per Feature Feature Flags #5632

Per Feature Feature Flags #5632

Comments

dibyom commented Oct 11, 2022 • edited Loading

Feature request

Use case

dibyom commented Oct 11, 2022 • edited Loading

Yongxuanzhang commented Oct 20, 2022

Yongxuanzhang commented Oct 20, 2022

dibyom commented Oct 20, 2022

chuangw6 commented Oct 20, 2022

afrittoli commented Oct 20, 2022

dibyom commented Nov 4, 2022

tekton-robot commented Feb 2, 2023

tekton-robot commented Mar 4, 2023

tekton-robot commented Apr 3, 2023

tekton-robot commented Apr 3, 2023

dibyom commented May 3, 2023

lbernick commented May 8, 2023

JeromeJu commented Oct 26, 2023

dibyom commented Oct 11, 2022 •

edited

Loading

dibyom commented Oct 11, 2022 •

edited

Loading