Commit graph

136988 commits

Author SHA1 Message Date
Rodrigo Campos
a138a4825e tests: Wait for pod to be removed on kubelet restart with idsPerPod
The test starts the kubelet with a non-default setting for idsPerPod,
runs a pod, deletes it, and then restarts the kubelet.

The issue is that the kubelet guarantees that no two pods userns
mappings overlap (for security reasons). But we are not waiting for the
pod to be removed, the deleteSync() call only waits for the API server
to remove the pod.

So, the pod is on disk (and maybe even running!) when we restart the
kubelet. As the previous configuration is incompatible with the new one
after restart if pods are running, the kubelet failing is the right
thing. We should just wait for the pod to be deleted from the kubelet
too, before restarting it with an incompatible configuration.

So, this commit just changes the pod deleteion (before done in
e2eoutput.TestContainerOutput() just waiting for the API server) to wait
for the kubelet to delete the pod.

Signed-off-by: Rodrigo Campos <rodrigo@amutable.com>
2026-04-09 11:45:22 +02:00
Kubernetes Release Robot
bc15d50fd2 CHANGELOG: Update directory for v1.36.0-rc.0 release 2026-04-08 18:00:32 +00:00
Kubernetes Prow Robot
6e753bd2b4
Merge pull request #138261 from dims/bump-golang-version-from-1.26.1-to-1.26.2
Bump golang version from 1.26.1 to 1.26.2
2026-04-08 02:53:31 +05:30
Davanum Srinivas
cd3d2cf43f
Bump golang version from 1.26.1 to 1.26.2 2026-04-07 16:07:35 -04:00
Omer Aplatony
cfe5b54d0a
Adds polling for HPA reconciliation_duration unit test (#138059)
* Adds polling for HPA reconciliation_duration unit test

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

* using struct name

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

---------

Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2026-04-07 21:45:33 +05:30
Kubernetes Prow Robot
093c91d226
Merge pull request #138199 from neolit123/1.36-fix-pause-image-drift
pause: fix version drift and enforce full SemVer validation
2026-04-06 20:17:30 +05:30
Lubomir I. Ivanov
1c87f399bf build/dependencies.yaml: enforce PATCH level validation for pause
As it can be seen in build/pause/CHANGELOG.md the PATCH
level version for pause was introduced due to requirements
from the pause image for Windows. This however invalidated
our build/depedencies.yaml validation as it only accounted for
the MAJOR.MINOR version of pause (e.g. 3.10, not 3.10.1).

Enforce full SemVer validation for the pause image dependents.
2026-04-06 10:55:31 +02:00
Lubomir I. Ivanov
bc2179d882 pause: fix version drift in various files
The latest pause version is 3.10.2 but due to the introduction
of the PATCH level version to the pause image (previously was
only MAJOR.MINOR), various files have remained on an older
version. Either 3.10 or 3.10.1. Our validation with
build/dependencies.yaml ./hack/verify-external-dependencies.sh
did not account for that.
2026-04-03 16:11:09 +02:00
Kubernetes Prow Robot
d3b9c54bd9
Merge pull request #138178 from liggitt/deflake-wait
Make cleanup aware of uid differences
2026-04-02 23:09:09 +05:30
Jordan Liggitt
1140c34121
Make cleanup aware of uid differences 2026-04-02 11:13:44 -04:00
Kubernetes Prow Robot
9a6c7a68f3
Merge pull request #137991 from yashsingh74/cni-bumps
Update CNI plugins to v1.9.1
2026-04-02 17:55:04 +05:30
yashsingh74
afdb5e5d1f
Update CNI plugins to v1.9.1
Signed-off-by: yashsingh74 <yashsingh1774@gmail.com>
2026-04-01 14:06:34 +05:30
Kubernetes Prow Robot
eb8032bb83
Merge pull request #138030 from askervin/5gH-kubelet-fix-nodeAllocatableResourceClaimStatuses
kubelet: do not destroy nodeAllocatableResourceClaimStatuses
2026-04-01 07:12:12 +05:30
Kubernetes Prow Robot
6fbf7cc85e
Merge pull request #138035 from alaypatel07/kep-5304/bump-cdi-spec
kep-5304: bump cdi spec version to 0.5.0
2026-04-01 04:08:13 +05:30
Alay Patel
3906ec754c kep-5304: bump cdi spec version to 0.5.0
Signed-off-by: Alay Patel <alayp@nvidia.com>
2026-03-31 16:55:57 -04:00
Kubernetes Prow Robot
f2772372fb
Merge pull request #138131 from liggitt/deflake-pod-subresource-test
Deflake TestPodSubresourceAuth by waiting for effective permissions before testing
2026-03-31 22:02:19 +05:30
Jordan Liggitt
84be835316
Deflake TestPodSubresourceAuth by waiting for effective permissions before testing 2026-03-31 10:27:08 -04:00
Kubernetes Prow Robot
7e705d7851
Merge pull request #137978 from ania-borowiec/flake
Fix flakiness in integration test for TopologyAwareScheduling with Basic Policy
2026-03-31 16:00:14 +05:30
Kubernetes Prow Robot
610490d1e1
Merge pull request #138087 from dims/fix-memory-manager-metrics-cleanup
e2e_node: wait for pod drain before asserting zero pods in Memory Manager Metrics
2026-03-30 18:00:15 +05:30
Kubernetes Prow Robot
1ecdebcb76
Merge pull request #138049 from ndixita/pod-events-fix
Pod events fix
2026-03-30 07:36:11 +05:30
Davanum Srinivas
10efa46fbb
e2e_node: wait for pod drain before asserting zero pods in Memory Manager Metrics
The Memory Manager Metrics BeforeEach asserts that zero pods are
running on the node after a kubelet config update. This hard assertion
flakes when a preceding serial test's namespace deletion hasn't
completed yet — framework namespace cleanup is async and the kubelet
restart in updateKubeletConfig can delay in-flight pod termination.

CI logs show leftover pods from MemoryQoS tests (memqos-burstable,
memqos-no-limit, etc.), Probe Stress tests (50-container pods), and
Summary API PSI tests (memory-pressure-pod), all still Running when
the assertion fires 4-7ms after the previous test finishes.

Replace the immediate Expect(count).To(BeZero()) with an Eventually
poll (2 minute timeout, 5 second interval) that gives pods time to
drain after the kubelet restart. The existing printAllPodsOnNode
diagnostic output is preserved inside the poll for debugging.

Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2026-03-28 15:27:25 -04:00
Kubernetes Prow Robot
c6a95ffd4c
Merge pull request #137996 from pacoxu/inplace-disable
set InPlacePodLevelResourcesVerticalScaling to false if needed
2026-03-28 08:42:11 +05:30
ndixita
3e7c6e3c83
Simplify isPodLevelResourcesResizeInProgress to check for absence of actuated resources
Signed-off-by: ndixita <ndixita@google.com>
2026-03-27 23:03:20 +00:00
Ania Borowiec
e622b86069
remove unnecessary fixes 2026-03-27 09:32:12 +00:00
Kubernetes Prow Robot
16c5a6be07
Merge pull request #138024 from tallclair/push-vkpxrwntwkxk
Fix race condition in updating the PodStatus cache
2026-03-27 04:50:17 +05:30
Natasha Sarkar
ba1ac16d2b fix erroneously reporting a PLR resize in progress on pod creation 2026-03-26 20:51:18 +00:00
Tim Allclair
65cc1c23a5 Fix race condition in updating the PodStatus cache 2026-03-26 17:47:17 +00:00
Kubernetes Prow Robot
3fea8a2fef
Merge pull request #134947 from aojea/dra_status_check
Fine-grained Authorization for ResourceClaim Status Updates
2026-03-26 22:34:18 +05:30
Antonio Ojea
adbf3b5aa5
Add granular authorization for DRA ResourceClaim status updates
This commit introduces the DRAResourceClaimGranularStatusAuthorization
feature gate (Beta in 1.36) to enforce fine-grained authorization checks
on ResourceClaim status updates.

Previously, 'update' permission on 'resourceclaims/status' allowed modifying
the entire status. To enforce the principle of least privilege for DRA
drivers and the scheduler, this change introduces synthetic subresources and
verb prefixes:

- 'resourceclaims/binding': Required to update 'status.allocation' and
  'status.reservedFor'.
- 'resourceclaims/driver': Required to update 'status.devices'. Evaluated
  on a per-driver basis using 'associated-node:<verb>' (for node-local
  ServiceAccounts) or 'arbitrary-node:<verb>' (for cluster-wide controllers).
2026-03-26 13:22:09 +00:00
Kubernetes Prow Robot
b30567c744
Merge pull request #135828 from HirazawaUi/5607-alpha-2-stage
Kubelet: Add alpha-2 stage implementation for UserNamespacesHostNetworkSupport feature gate
2026-03-26 15:08:18 +05:30
Antti Kervinen
9fd706ccd9 kubelet: do not destroy nodeAllocatableResourceClaimStatuses
The kubelet status manager was not preserving the
pod.status.nodeAllocatableResourceClaimStatuses field set by the
scheduler during pod status merges. This caused the information to the
to be destroyed by the kubelet's next status sync, making the field
always appear empty.

Add the same preservation pattern already used for
ResourceClaimStatuses and ExtendedResourceClaimStatus to both
mergePodStatus() and isPodStatusByKubeletEqual().

Signed-off-by: Antti Kervinen <antti.kervinen@intel.com>
2026-03-25 19:40:59 +02:00
Kubernetes Prow Robot
1ed5965382
Merge pull request #137946 from liggitt/pleg-reenable
Revert "Switch PLEGOnDemandRelist default to `false` for 1.36"
2026-03-25 15:28:21 +05:30
Kubernetes Prow Robot
473b7635de
Merge pull request #138006 from tallclair/push-kooxxktxovkr
Flaky test fix for 'should restart failing container when pod restartPolicy is Always'
2026-03-25 02:18:16 +05:30
Kubernetes Prow Robot
da97d71f14
Merge pull request #137897 from nojnhuh/dra-gang
scheduler: fix race in DRA pending allocation sharing
2026-03-24 23:40:18 +05:30
Jon Huhn
61cf993c6b scheduler: fix race in DRA pending allocation sharing 2026-03-24 12:07:31 -05:00
Tim Allclair
72ed617db1 Flaky test fix for 'should restart failing container when pod restartPolicy is Always' 2026-03-24 16:38:46 +00:00
Kubernetes Prow Robot
36925dc0c7
Merge pull request #137482 from aman4433/fix-apf-timeout-header-test-race
apiserver: tolerate APF header race with timeout handler in priority-and-fairness tests
2026-03-24 20:16:23 +05:30
Kubernetes Prow Robot
051af779af
Merge pull request #137989 from hoteye/fix-preferredimports-owners-typo
Fix malformed OWNERS entries used by maintainers
2026-03-24 16:12:19 +05:30
Kubernetes Prow Robot
c1ebc2c3e2
Merge pull request #136589 from tosi3k/preemption-mode
Add Workload-Aware Preemption fields to Workload and PodGroup APIs
2026-03-24 15:14:16 +05:30
Ania Borowiec
c49126e031
Fix flakiness in integration test for TopologyAwareScheduling with Basic Policy 2026-03-24 09:42:00 +00:00
Paco Xu
7c65919285 set InPlacePodLevelResourcesVerticalScaling to false if PodLevelResources is set to false 2026-03-24 16:57:46 +08:00
Antoni Zawodny
ccaaf9d3a5 Marry WAP logic with the new API fields 2026-03-24 09:36:00 +01:00
Antoni Zawodny
59c9f75133 Add Workload-Aware Preemption fields to Workload and PodGroup APIs
Co-authored-by: Omar Sayed <omarsayed@google.com>
2026-03-24 09:03:50 +01:00
Jordan Liggitt
0519b8e784
Revert "Switch PLEGOnDemandRelist default to false for 1.36"
This reverts commit 4a69899d8b.
2026-03-24 01:48:19 -04:00
Kubernetes Prow Robot
eca347edbf
Merge pull request #137957 from tallclair/push-vxyyxtvrypxt
Fix user namespace test cleanup race
2026-03-24 10:24:21 +05:30
Kubernetes Prow Robot
e93471350d
Merge pull request #137954 from tallclair/push-qqlqpuxopyzs
Fix restartable init container startup race
2026-03-24 10:24:14 +05:30
hoteye
93f0ba1add Fix malformed OWNERS entries used by maintainers 2026-03-24 10:34:00 +08:00
Kubernetes Prow Robot
32d6fb1759
Merge pull request #137949 from bitoku/use-atomic
cri-client: use atomic.Bool for useStreaming to fix data race
2026-03-24 07:28:19 +05:30
Tim Allclair
2e60e1407b Fix user namespace test cleanup race 2026-03-23 23:22:44 +00:00
Kubernetes Prow Robot
61d4e7ae86
Merge pull request #137684 from ndixita/ippr-plr-beta-enable
[PodLevelResources] Graduate InPlacePodLevelResourcesVerticalScaling feature to beta
2026-03-24 03:44:16 +05:30