New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

KEP-4872: Harden Kubelet serving cert validation #4911

Open

g-gaston wants to merge 2 commits into kubernetes:master from g-gaston:harden-kubelet-cert-validation

g-gaston commented Oct 9, 2024

One-line PR description: Add first version of doc

Issue link: Harden Kubelet Serving Certificate Validation in Kube-API server #4872

Other comments: I left a few TODOs with things I think can use a discussion.


          Add KEP-4872 Harden Kubelet serving cert validation

8f10dbc

k8s-ci-robot added the cncf-cla: yes label

k8s-ci-robot requested review from mikedanese and ritazh

October 9, 2024 02:32

k8s-ci-robot added kind/kep sig/auth labels

Contributor

k8s-ci-robot commented Oct 9, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: g-gaston
Once this PR has been reviewed and has the lgtm label, please assign ritazh for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/sig-auth/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the size/L label

g-gaston changed the title ~~KEP-4872: Add Harden Kubelet serving cert validation~~ KEP-4872: Harden Kubelet serving cert validation

sftim reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/kep.yaml

+              # The following PRR answers are required at alpha release
+              # List the feature gate name and the components for which it must be enabled
+              feature-gates:
+                - name: KubeletCertCNValidation

Contributor

sftim Oct 10, 2024

How about: NodeCertificateNameValidation?

The API server is verifying a certificate presented by a node, whatever software that node is running (which, OK, is very likely to be kubelet).

sftim reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/kep.yaml Outdated Show resolved Hide resolved


          Update keps/sig-auth/4872-harden-kubelet-cert-validation/kep.yaml

16c59e6

Co-authored-by: Tim Bannister <[email protected]>

aojea reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/README.md


		Provided an actor with control of a node can impersonate another node, the impact would be:

		* Break confidentiality of the requests sent by the Kube-API server to the kubelet (e.g kubectl exec/logs).These are usually user-driven requests. That gives the threat actor the possibility of producing incorrect or mis-leading feedback. In the exec case, it could allow a threat actor to issue prompts for credentials. In addition, the exec commands might contain user secrets.

Member

aojea Nov 5, 2024

if this actor is already owning a node in the cluster then exec and logs does not need to impersonate a node, they just target pods in a node, and require the kubelet to terminate the connection

liggitt reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/README.md

+              #### Metrics
+              In order to help cluster administrators determine if it's safe to enable the feature, we propose to add a new metric `kube_apiserver_validation_kubelet_cert_cn_errors` that will track the number of errors due to the new CN validation.
+              If the feature gate is disabled, we will still add the validation code to the HTTP transport, however, if the validation fails we won't return an error, we will just increment the metric counter.

Member

liggitt Oct 23, 2024

Hmm... normally we want all code for the new feature to be completely inert when the gate is disabled. That way if there are bugs or non-functional regressions (like performance issues, etc), disabling the feature gate gives a way to mitigate that.

I like the idea of surfacing the metric of how many CNs would fail the validation, but I would only do the check and publish the metric if the feature gate is enabled. Someone wanting to dry run could enable the gate to active the code but pass --disable-kubelet-cert-cn-validation=false to avoid failing requests if the validation fails.

aojea reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/README.md


		##### e2e tests

		End-to-end tests won't be needed as unit and integration tests will cover all the scenarios.

Member

aojea Nov 5, 2024

I don't know if you will be able to do everything with integrations , but since a lot of e2e test use logs, exec and portforward, I assume it will be implicitly covered by those test once the feature gate is enabled

aojea reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/README.md

+              This vulnerability can be exploited through ARP poisoning or other routing attacks, allowing a rogue node to obtain a certificate for an IP it does not own and reroute traffic to itself.
+              When the Kube API server connects to a kubelet, it verifies that the serving certificate is signed by a trusted CA and that the IP or hostname it’s connecting to is included in the certificate's SANs.
+              If a rogue node obtained a certificate for an IP it does not own and reroute traffic to itself, it would be able to impersonate a Node that reports that IP.

Member

aojea Nov 5, 2024

can you expand on the "reroute traffic to itself" problem?
I may not get all the full details of the scenario, nodeA with IP1 is removed but somehow actorB manages to get its certificate, spawn a nodeB with nodeA name? IP2 or IP1??? , join it to the cluster and set the address IP1 on node.status.addresses ?

aojea reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/README.md


		We will remove the metric once the feature is GA.

		> TODO: let's discuss this in the review. We could consider adding the node name to the metric or even keeping the metric post GA if it's valuable.

Member

aojea Nov 5, 2024

node name has the cardinality problem, but leaving the metric sounds like a good thing to me, it can help to detect issues with the node certificates due to bugs per example

aojea reviewed

View reviewed changes

keps/sig-auth/4872-harden-kubelet-cert-validation/README.md

+              #### Alpha
+              * Add feature flag for gating usage, off by default
+              * Add flag to disable extra validation

Member

aojea Nov 5, 2024

remember you need to add the flag only if the feature flag is enabled, otherwise if the feature does not progress removing the flag can not be possible without breaking the users that are already setting it

Member

aojea commented Nov 5, 2024

+1 sounds a great addition, simple and very effective

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

sftim sftim left review comments

aojea aojea left review comments

liggitt liggitt left review comments

mikedanese Awaiting requested review from mikedanese

ritazh Awaiting requested review from ritazh

Labels

cncf-cla: yes kind/kep sig/auth size/L