-
Notifications
You must be signed in to change notification settings - Fork 764
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Helm Upgrade Failing #3250
Comments
Just to compliment, this is what I get when checking the readiness endpoint of
Not very informative, which tracks back to this issue #696 |
Have confirmed that the version upgrade will work only if the constraints and templates are never deployed while gatekeeper is installed.
|
Another update: I have found that if the following is run before the |
Commenting to keep this alive, any ideas? |
If you create a config resource with example config: apiVersion: config.gatekeeper.sh/v1alpha1
kind: Config
metadata:
name: config
namespace: "gatekeeper-system"
spec:
readiness:
statsEnabled: true |
3.14.0 also contained a readiness fix, maybe upgrading to that version would remediate the issue? https://github.com/open-policy-agent/gatekeeper/releases/tag/v3.14.0 |
Hi @maxsmythe
|
Glad I could help! |
What steps did you take and what happened:
Trying to bump our gatekeeper version from
v3.12.0
tov3.13.3
using helm.$ helm upgrade --install gatekeeper gatekeeper/gatekeeper --namespace gatekeeper-system --version 3.13.3 -f values.yaml --wait --timeout=180s
This just times out when trying to directly update, there was no change in the values file.
In debug mode I everything runs as expected until:
ready.go:281: [debug] Deployment is not ready: gatekeeper-system/gatekeeper-audit. 0 out of 1 expected pods are ready
I can see the audit and controller-manager pods are trying to launch, however the pods never become ready:
Doing a describe on these shows that the Readiness probe is failing, which I think suggests an application failure to launch:
Readiness probe failed: HTTP probe failed with statuscode: 500
I managed to successfully bump the version if I uninstall all constraints/templates/mutations, uninstall gatekeeper, then reinstall the lot again from scratch.
What didn't work is if I leave gatekeeper itself installed, then delete constraints/templates/mutations, then try bumping the version.
So this has lead me to the conclusion that gatekeeper needs to be entirely 'clean' before upgrading which doesn't feel right.
In the past we have successfully bumped the version with no issues.
What did you expect to happen:
The new version was installed, and gatekeeper to continue working as normal.
Anything else you would like to add:
Relevant values for audit and controller manager:
Environment:
kubectl version
): v1.26.12The text was updated successfully, but these errors were encountered: