You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request. Searching for pre-existing feature requests helps us consolidate datapoints for identical requirements into a single place, thank you!
Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request.
If you are interested in working on this issue or have submitted a pull request, please leave a comment.
Is your feature request related to a problem? Please describe.
During installation of Consul via Helm chart, mesh-gateway pod is crash-looped with error in initContainer mesh-gateway-init
2023-05-18T11:40:53.115Z [ERROR] Unable to read ACL token; retrying: err="Unexpected response code: 403 (ACL not found)"
2023-05-18T11:40:53.115Z [ERROR] Unable to read ACL token from a Consul server; please check that your server cluster is healthy: err="Unexpected response c
ode: 403 (ACL not found)"
2023-05-18T11:40:53.115Z [ERROR] Consul login failed: error="Unexpected response code: 403 (ACL not found)"
For fetching ACL token, there is a workaround of retrying to fetch ACL tokens in mesh-gateway-init container.
However, the retry logic only takes into consideration of local Raft replication timeout and hard-coded to 2 seconds as of now.
When using WAN Federation, via Mesh Gateway, a customer may face a timed-out issue.
In a secondary server log, the blocking query RPC call is made after the 2 seconds timeout in the mesh-gateway-init container. As can be seen from the server logs
We request to include the ACL replication time between DC into consideration of setting this timeout (e.g. have this timeout value configurable or wait for the replication to happen).
Use Case(s)
Consul on Kubernetes WAN Federation via Mesh Gateways
Consul: v1.13.3+ent
Consul-k8s: v0.49.8
The text was updated successfully, but these errors were encountered:
mkentala
changed the title
Feature Request: All the ACL Replication Time to be a Configurable Value or Wait for the Replication to Happen
Feature Request: Allow the ACL Replication Time to be a Configurable Value or Wait for the Replication to Happen
Jul 21, 2023
The timeout has now been bumped from 2 seconds to 60 seconds in #2656 in order to prevent this from happening. The reason we chose not to make it configurable was because surfacing this field, which is nestled in the weeds of how we perform login felt like it would be very hard, from a UX perspective, to cleanly describe via a line item in the values file. Bumping it to 60 seconds though, should allow for ample time to ensure replication indeed has occurred successfully even in environments where the ping times are very high.
Community Note
Is your feature request related to a problem? Please describe.
During installation of Consul via Helm chart,
mesh-gateway
pod is crash-looped with error in initContainermesh-gateway-init
For fetching ACL token, there is a workaround of retrying to fetch ACL tokens in
mesh-gateway-init
container.consul-k8s/control-plane/subcommand/common/common.go
Line 191 in ed8325b
However, the retry logic only takes into consideration of local Raft replication timeout and hard-coded to 2 seconds as of now.
When using WAN Federation, via Mesh Gateway, a customer may face a timed-out issue.
In a secondary server log, the blocking query RPC call is made after the 2 seconds timeout in the
mesh-gateway-init
container. As can be seen from the server logsManual Workaround
consul-k8s-control-plane acl-init
to
consul-k8s/charts/consul/templates/mesh-gateway-deployment.yaml
Feature Description
We request to include the ACL replication time between DC into consideration of setting this timeout (e.g. have this timeout value configurable or wait for the replication to happen).
Use Case(s)
Consul on Kubernetes WAN Federation via Mesh Gateways
Consul: v1.13.3+ent
Consul-k8s: v0.49.8
The text was updated successfully, but these errors were encountered: