📖 Update autoscaling from zero enhancement proposal with support for platform-aware autoscale from zero #11962

aleskandro · 2025-03-12T12:27:20Z

What this PR does / why we need it:

This PR updates the contract between the cluster-autoscaler Cluster API provider and the infrastructure provider's controllers that reconcile the Infrastructure Machine Template to support platform-aware autoscale from 0 in clusters consisting of nodes heterogeneous in CPU architecture and OS.

With this commit, the infrastructure providers implementing controllers to reconcile the status of their Infrastructure Machine Templates for supporting autoscale from 0 will be able to fill the status.nodeInfo stanza with additional information about the nodes.

The status.nodeInfo stanza has type corev1.NodeSystemInfo to reflect the same content, the rendered nodes' objects would store in their status field.

The cluster-autoscaler can use that information to build the node template labels kubernetes.io/arch and kubernetes.io/os if that information is present.

Suppose the pending pods that trigger the cluster autoscaler have a node selector or a requiredDuringSchedulingIgnoredDuringExecution node affinity concerning the architecture or operating system of the node where they can execute. In that case, the autoscaler will be able to filter the nodes groups options according to the architecture or operating system requested by the pod.

The users could already provide this information to the cluster autoscaler through the labels capacity annotation. However, there is no similar capability to support future labels/taints through information set by the reconcilers of the status of Infrastructure Machine Templates.

Which issue(s) this PR fixes

Related to #11961

/area provider/core

k8s-ci-robot · 2025-03-12T12:27:27Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign enxebre for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot · 2025-03-12T12:27:29Z

Welcome @aleskandro!

It looks like this is your first PR to kubernetes-sigs/cluster-api 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes-sigs/cluster-api has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

k8s-ci-robot · 2025-03-12T12:27:29Z

Hi @aleskandro. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

JoelSpeed

Seems very straightforward, no objection from me

elmiko

this generally makes sense to me, and i don't have any objections.

i would like to get some attention on the API field change though, just to make sure folks agree about the name and placement.

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

aleskandro · 2025-03-13T19:55:56Z

Thanks @elmiko. To give more context about the choice of naming and structure for everyone to discuss alternatives.

I couldn't consider the architecture as part of the capacity field because the values of keys in that stanza are expected to be a Quantity. I tried to trace back the reasoning the community did about the capacity field and I noticed that the Node objects' status field has the same field - capacity - with the same content we set in the Infrastructure Machine Template objects.

The Node objects' status field also has a nodeInfo field with the same name and type that I'm now proposing for the Infrastructure Machine Template objects.

Among the alternatives I was evaluating, this seemed the most reasonable to me.

elmiko · 2025-03-14T18:24:23Z

makes sense to me @aleskandro , no objection from me.

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

chrischdi · 2025-03-18T12:58:06Z

/hold

Especially for getting consensus on and resolve #11962 (comment)

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

sbueringer · 2025-03-28T15:10:54Z

@JoelSpeed If you find some time, PTAL :)

sbueringer · 2025-04-07T08:55:18Z

/ok-to-test

sbueringer · 2025-04-09T03:06:20Z

lgtm from my side. Pending resolution of #11962 (comment)

(I'll be on PTO for the next few weeks, but fine for me if there is consensus with other maintainers)

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

aleskandro · 2025-04-10T10:18:41Z

/label tide/merge-method-squash

elmiko · 2025-04-30T13:16:20Z

i am happy with the state this is in.

/lgtm

k8s-ci-robot · 2025-04-30T13:16:27Z

LGTM label has been added.

Git tree hash: 1e88b9ce31402b9352a67b7a6637e488da7436ac

aleskandro · 2025-05-01T13:50:50Z

/unhold

fabriziopandini

Just a few nits from my side.
Considering the type of changes we are introducing, I think we should consider addressing #12033 as integral part of the implementation.

Last nit, could you kindly also add a new line in the Implementation History down below

fabriziopandini · 2025-05-05T12:07:03Z

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

+)
+
+// NodeInfo contains information about the node's architecture and operating system.
+type NodeInfo struct {


I would suggest to use NodeSystemInfo, so there is a more direct link with the corresponding struct in corev1 types.

Hi @fabriziopandini, the previous version was using NodeSystemInfo. I changed it due to #11962 (comment). cc @sbueringer

I'm ok with either way as far as we have consensus :).

I would favor NodeInfo because I think the field and struct name should be the same if there is no good reason to diverge. And I think the field name should be the same as in upstream

I'm ok with both options, but the consistency with corev1.Node in this case IMO could help

I agree with both your points, but leaning a bit more into @fabriziopandini's option to be consistent with corev1.Node.

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

fabriziopandini · 2025-05-05T12:15:16Z

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

@@ -246,6 +297,8 @@ metadata:
    capacity.cluster-autoscaler.kubernetes.io/taints: "key1=value1:NoSchedule,key2=value2:NoExecute"
 ```

+If the `capacity.cluster-autoscaler.kubernetes.io/labels` annotation specifies a label that would otherwise be generated from the `capacity` or `node-info` annotations, the autoscaler will use the label defined in `capacity.cluster-autoscaler.kubernetes.io/labels`, overriding any labels produced by processing the other annotations.


TBH i struggle to follow, might be because this sentence enters into autoscaler implementation details.

What about simplifying by writing what takes precedence between capacity or node-info on the template vs annotations on MachineSet or MachineDeployment?

Changed to

The capacity.cluster-autoscaler.kubernetes.io/labels annotation takes precedence over other sources when conflicts occur.
For instance, if the kubernetes.io/arch label is specified in capacity.cluster-autoscaler.kubernetes.io/labels, its value supersedes that specified by node-info.cluster-autoscaler.kubernetes.io/architecture.

k8s-ci-robot · 2025-05-08T15:27:20Z

New changes are detected. LGTM label has been removed.

aleskandro · 2025-05-08T15:40:25Z

Just a few nits from my side. Considering the type of changes we are introducing, I think we should consider addressing #12033 as integral part of the implementation.

Hi @fabriziopandini , thanks for your review.

I could take #12033 and resolve it in a separate PR, if this sounds good to you.

Last nit, could you kindly also add a new line in the Implementation History down below

Done

…atform-aware autoscale from zero This commit updates the contract between the cluster-autoscaler Cluster API provider and the infrastructure provider's controllers that reconcile the Infrastructure Machine Template to support platform-aware autoscale from 0 in clusters consisting of nodes heterogeneous in CPU architecture and OS. With this commit, the infrastructure providers implementing controllers to reconcile the status of their Infrastructure Machine Templates for supporting autoscale from 0 will be able to fill the status.nodeInfo stanza with additional information about the nodes. The status.nodeInfo stanza has type corev1.NodeSystemInfo to reflect the same content, the rendered nodes' objects would store in their status field. The cluster-autoscaler can use that information to build the node template labels `kubernetes.io/arch` and `kubernetes.io/os` if that information is present. Suppose the pending pods that trigger the cluster autoscaler have a node selector or a requiredDuringSchedulingIgnoredDuringExecution node affinity concerning the architecture or operating system of the node where they can execute. In that case, the autoscaler will be able to filter the nodes groups options according to the architecture or operating system requested by the pod. The users could already provide this information to the cluster autoscaler through the labels capacity annotation. However, there is no similar capability to support future labels/taints through information set by the reconcilers of the status of Infrastructure Machine Templates.

fabriziopandini · 2025-05-13T08:27:00Z

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

+Its definition would look like this:
+
+```go
+package platform


Just to double check, are we sure we want to expose those types in a non-versioned package?
Also, given this is going to be part of the contract, shouldn't we use a package name that enforce this link (and that we can eventually expand with other contract related stuff)?

We were considering not to have these types in the cluster api repo, but delegating implementations to define their based on the contract defined here

fabriziopandini · 2025-05-13T08:36:05Z

docs/proposals/20210310-opt-in-autoscaling-from-zero.md

+The `capacity.cluster-autoscaler.kubernetes.io/labels` annotation takes precedence over other sources when conflicts occur.
+For instance, if the `kubernetes.io/arch` label is specified in `capacity.cluster-autoscaler.kubernetes.io/labels`, its value supersedes that specified by `node-info.cluster-autoscaler.kubernetes.io/architecture`.


I'm getting back to this because I'm still a little bit confused.

I was assuming that capacity.cluster-autoscaler.kubernetes.io and node-info.cluster-autoscaler.kubernetes.io serve a different pourpuse, so value cannot overlap (the example with both reporting arch is confusing).

Instead what can happen IMO is that each annotation overlaps with the corresponding struct in templates (capacity.cluster-autoscaler.kubernetes.io with status.capacity and node-info.cluster-autoscaler.kubernetes.io with status.nodeIndfo), and I was expecting this paragraph to define which of the two takes precedence

The issue here is that node-info and status.node(System)?Info will render as labels for the scheduler simulator in the autoscaler side. So either we deny capacity labels to specify labels like kubernetes.io/arch or we have to define which one takes precedence over the other.

The autoscaler and scheduler do not use status.nodeSystemInfo directly, but obey labels for pods that specify predicates for them

k8s-ci-robot added area/provider/core Issues or PRs related to the core provider cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Mar 12, 2025

k8s-ci-robot requested review from elmiko and JoelSpeed March 12, 2025 12:27

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Mar 12, 2025

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch from 4fbe927 to 666c952 Compare March 12, 2025 12:31

JoelSpeed reviewed Mar 12, 2025

View reviewed changes

elmiko reviewed Mar 13, 2025

View reviewed changes

docs/proposals/20210310-opt-in-autoscaling-from-zero.md Show resolved Hide resolved

chrischdi reviewed Mar 17, 2025

View reviewed changes

docs/proposals/20210310-opt-in-autoscaling-from-zero.md Outdated Show resolved Hide resolved

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch from 666c952 to 106bcbd Compare March 17, 2025 15:12

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 18, 2025

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch from 106bcbd to 20f9c76 Compare March 27, 2025 15:47

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Mar 27, 2025

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch 2 times, most recently from d6e69f8 to 85da18e Compare March 27, 2025 15:50

aleskandro requested review from elmiko, JoelSpeed and chrischdi March 27, 2025 15:54

sbueringer reviewed Mar 28, 2025

View reviewed changes

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch 2 times, most recently from 8349163 to 6d2fb96 Compare March 31, 2025 12:03

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Apr 7, 2025

JoelSpeed reviewed Apr 9, 2025

View reviewed changes

k8s-ci-robot added the tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges. label Apr 10, 2025

aleskandro requested review from sbueringer and JoelSpeed April 17, 2025 07:03

k8s-ci-robot assigned elmiko Apr 30, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 30, 2025

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label May 1, 2025

fabriziopandini reviewed May 5, 2025

View reviewed changes

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 8, 2025

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch from 6758a68 to 31e4d30 Compare May 8, 2025 15:39

aleskandro force-pushed the update-kep-scale-from-0-multiarch branch from 31e4d30 to d0cb4af Compare May 8, 2025 15:42

fabriziopandini reviewed May 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📖 Update autoscaling from zero enhancement proposal with support for platform-aware autoscale from zero #11962

📖 Update autoscaling from zero enhancement proposal with support for platform-aware autoscale from zero #11962

aleskandro commented Mar 12, 2025

k8s-ci-robot commented Mar 12, 2025

k8s-ci-robot commented Mar 12, 2025

k8s-ci-robot commented Mar 12, 2025

JoelSpeed left a comment

elmiko left a comment

aleskandro commented Mar 13, 2025

elmiko commented Mar 14, 2025

chrischdi commented Mar 18, 2025

sbueringer commented Mar 28, 2025

sbueringer commented Apr 7, 2025

sbueringer commented Apr 9, 2025

aleskandro commented Apr 10, 2025

elmiko commented Apr 30, 2025

k8s-ci-robot commented Apr 30, 2025

aleskandro commented May 1, 2025

fabriziopandini left a comment

fabriziopandini May 5, 2025

aleskandro May 8, 2025

sbueringer May 13, 2025 •

edited

Loading

fabriziopandini May 13, 2025

aleskandro May 13, 2025

fabriziopandini May 5, 2025

aleskandro May 8, 2025

k8s-ci-robot commented May 8, 2025

aleskandro commented May 8, 2025 •

edited

Loading

fabriziopandini May 13, 2025

aleskandro May 13, 2025

fabriziopandini May 13, 2025

aleskandro May 13, 2025 •

edited

Loading

		The `capacity.cluster-autoscaler.kubernetes.io/labels` annotation takes precedence over other sources when conflicts occur.
		For instance, if the `kubernetes.io/arch` label is specified in `capacity.cluster-autoscaler.kubernetes.io/labels`, its value supersedes that specified by `node-info.cluster-autoscaler.kubernetes.io/architecture`.

📖 Update autoscaling from zero enhancement proposal with support for platform-aware autoscale from zero #11962

Are you sure you want to change the base?

📖 Update autoscaling from zero enhancement proposal with support for platform-aware autoscale from zero #11962

Conversation

aleskandro commented Mar 12, 2025

k8s-ci-robot commented Mar 12, 2025

k8s-ci-robot commented Mar 12, 2025

k8s-ci-robot commented Mar 12, 2025

JoelSpeed left a comment

Choose a reason for hiding this comment

elmiko left a comment

Choose a reason for hiding this comment

aleskandro commented Mar 13, 2025

elmiko commented Mar 14, 2025

chrischdi commented Mar 18, 2025

sbueringer commented Mar 28, 2025

sbueringer commented Apr 7, 2025

sbueringer commented Apr 9, 2025

aleskandro commented Apr 10, 2025

elmiko commented Apr 30, 2025

k8s-ci-robot commented Apr 30, 2025

aleskandro commented May 1, 2025

fabriziopandini left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sbueringer May 13, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

k8s-ci-robot commented May 8, 2025

aleskandro commented May 8, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aleskandro May 13, 2025 • edited Loading

Choose a reason for hiding this comment

sbueringer May 13, 2025 •

edited

Loading

aleskandro commented May 8, 2025 •

edited

Loading

aleskandro May 13, 2025 •

edited

Loading