Commit graph

137020 commits

Author SHA1 Message Date
Kubernetes Prow Robot
b31119d205
Merge pull request #138485 from QiWang19/disable-memqos-rollback-tests
skip MemoryQoS rollback test until implementation is resolved
2026-04-21 07:41:52 +05:30
Qi Wang
2aaa5b654b skip MemoryQoS rollback test until implementation is resolved
skip MemoryQoS rollback test until we figure out the mechanism to rollback.

Signed-off-by: Qi Wang <qiwan@redhat.com>
2026-04-20 12:41:45 -04:00
Kubernetes Prow Robot
03779bbd00
Merge pull request #138430 from michaelasp/gateMemoryQoS
Gate setting memory QoS behind feature gate
2026-04-17 09:05:03 +05:30
Michael Aspinwall
bb2bb3cd74 Gate setting memory QoS behind feature gate 2026-04-16 23:34:26 +00:00
Kubernetes Release Robot
b2f73c0d6b CHANGELOG: Update directory for v1.33.11 release 2026-04-15 21:55:21 +00:00
Kubernetes Release Robot
1845002159 CHANGELOG: Update directory for v1.34.7 release 2026-04-15 21:54:36 +00:00
Kubernetes Release Robot
f8627a6859 CHANGELOG: Update directory for v1.35.4 release 2026-04-15 18:12:53 +00:00
Kubernetes Prow Robot
b75d1f6dca
Merge pull request #138324 from MaybeSam05/fix-scheduler-inflight-uid-race-138316
scheduler: fix inFlightPods leak when pod is recreated during scheduling failure
2026-04-15 18:45:01 +05:30
Kubernetes Release Robot
cec8f06d2c CHANGELOG: Update directory for v1.36.0-rc.1 release 2026-04-14 20:19:03 +00:00
Samarth Verma
064f0956ad scheduler: address recreated pod review feedback 2026-04-14 10:14:36 -04:00
Kubernetes Prow Robot
eb51fbf7c6
Merge pull request #138346 from dashpole/update_otel_prop
Update go.opentelemetry.io/otel to v1.41.0
2026-04-14 02:34:37 +05:30
Kubernetes Prow Robot
02b9792136
Merge pull request #138354 from dims/update-moby-spdystream-v0.5.1
Update github.com/moby/spdystream from v0.5.0 to v0.5.1
2026-04-14 01:04:12 +05:30
Davanum Srinivas
f6209104d2
Update github.com/moby/spdystream from v0.5.0 to v0.5.1
Kubernetes-commit: 7e9c2c8eef26f99aa2f94d8e09d6d32de86c7769
2026-04-13 13:57:52 -04:00
Samarth Verma
48c8f0deb1 scheduler: skip requeueing recreated pods on scheduling failure 2026-04-13 11:53:50 -04:00
David Ashpole
feda787db4
update go.opentelemetry.io/otel to v1.41.0 2026-04-13 14:23:49 +00:00
Kubernetes Prow Robot
462e759d19
Merge pull request #138318 from jsafrane/fix-dra-mirroring
DRA: use full image name in test manifests
2026-04-13 19:22:11 +05:30
Samarth Verma
dc47d8840d scheduler: fix inFlightPods leak when pod is recreated during scheduling failure
handleSchedulingFailure can refresh podInfo from the informer before AddUnschedulableIfNotPresent. A delete and recreate with the same name may change the Pod UID while inFlightPods still tracks the UID from Pop, so Done and queueing-hint lookups must use that in-flight UID.

Add an explicit in-flight UID parameter, thread it through queueing-hint lookups, cover the same-name recreation case with a regression test, and check the returned error in updated test call sites.
2026-04-12 12:28:52 -04:00
Kubernetes Prow Robot
a389506254
Merge pull request #138303 from richabanker/mvp-error-log
Downgrade log level from Error to Info for unserved GVRs in peer proxy
2026-04-11 21:20:08 +05:30
Jan Safranek
a2b55fc7af DRA: use full image name in test manifests
In air-gaped environment, the image must have `name:tag` to be parsed and
updated with a private registry.
2026-04-10 17:10:40 +02:00
Kubernetes Prow Robot
0275382260
Merge pull request #138244 from fanzhangio/device-manager-numa-fix
devicemanager: constrain topology hints to device NUMA nodes
2026-04-10 18:48:20 +05:30
Kubernetes Prow Robot
550504bb25
Merge pull request #138300 from michaelasp/svmOverload
Reduce number of chaos workers to prevent timeouts
2026-04-10 15:58:19 +05:30
Richa Banker
45ab866615 Downgrade log level from Error to Info for unserved GVRs in peer proxy
Signed-off-by: Richa Banker <richabanker@google.com>
2026-04-09 12:46:58 -07:00
Michael Aspinwall
28b6b1e274 Reduce number of chaos workers to prevent timeouts 2026-04-09 19:33:19 +00:00
Kubernetes Prow Robot
f5c7b42274
Merge pull request #138299 from xmudrii/update-go
Bump images and versions to go 1.26.2 and distroless iptables
2026-04-10 00:44:19 +05:30
Marko Mudrinić
bf7ccb893e
Bump images and versions to go 1.26.2 and distroless iptables
Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>
2026-04-09 20:22:09 +02:00
Kubernetes Prow Robot
40007b6452
Merge pull request #138210 from Mujib-Ahasan/featuregate-WorkloadWithJob
Rename feature gate `EnableWorkloadWithJob` to `WorkloadWithJob`
2026-04-09 23:12:21 +05:30
Fan Zhang
3b5f150153 devicemanager: constrain topology hints to device NUMA nodes
On platforms with many OS-visible NUMA nodes that carry no devices
(e.g. NVIDIA GB200 with 36 NUMA nodes, only 1–2 hosting GPUs),
IterateBitMasks enumerates O(2^n) combinations and stalls the
kubelet for minutes.

Introduce deviceNUMANodes(), which collects the NUMA node IDs from
all registered devices for a resource regardless of allocation state.
generateDeviceTopologyHints() now iterates only over those nodes,
reducing n from 34 to 1–2 on affected hardware.

This fix uses allDevices ensures minAffinitySize and Preferred flags
are computed identically for behavior-preserving, making safe for
backport.

deviceNUMANodes() has a explicit runtime subset guard to guarantee to
return a subset of cadvisor-reported NUMA topology, regardless what
device-plugins report.

Kubernetes-bug: https://github.com/kubernetes/kubernetes/issues/135541

Signed-off-by: Fan Zhang <fanzhang@nvidia.com>
2026-04-09 10:32:44 -07:00
Kubernetes Prow Robot
0f7926ebf6
Merge pull request #138277 from BenTheElder/publishing-1.36
Add release-1.36 publishing bot rules
2026-04-09 21:50:20 +05:30
Kubernetes Prow Robot
f330d3b75f
Merge pull request #138281 from Jefftree/fix-map-sv
Pin MutatingAdmissionPolicy storage version to v1beta1
2026-04-09 19:14:23 +05:30
Jefftree
8254743864 Pin MutatingAdmissionPolicy storage version to v1beta1 2026-04-08 23:44:34 -04:00
Benjamin Elder
7ef0d61c4d Add release-1.36 publishing bot rules
go run k8s.io/publishing-bot/cmd/update-rules@latest -branch=release-1.36 --rules=./staging/publishing/rules.yaml -o ./staging/publishing/rules.yaml

then manually re-add the comments (TODO: make this tool preserve comments)
2026-04-08 16:21:39 -07:00
Kubernetes Release Robot
bc15d50fd2 CHANGELOG: Update directory for v1.36.0-rc.0 release 2026-04-08 18:00:32 +00:00
Kubernetes Prow Robot
6e753bd2b4
Merge pull request #138261 from dims/bump-golang-version-from-1.26.1-to-1.26.2
Bump golang version from 1.26.1 to 1.26.2
2026-04-08 02:53:31 +05:30
Davanum Srinivas
cd3d2cf43f
Bump golang version from 1.26.1 to 1.26.2 2026-04-07 16:07:35 -04:00
Omer Aplatony
cfe5b54d0a
Adds polling for HPA reconciliation_duration unit test (#138059)
* Adds polling for HPA reconciliation_duration unit test

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

* using struct name

Signed-off-by: Omer Aplatony <omerap12@gmail.com>

---------

Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2026-04-07 21:45:33 +05:30
Kubernetes Prow Robot
093c91d226
Merge pull request #138199 from neolit123/1.36-fix-pause-image-drift
pause: fix version drift and enforce full SemVer validation
2026-04-06 20:17:30 +05:30
Lubomir I. Ivanov
1c87f399bf build/dependencies.yaml: enforce PATCH level validation for pause
As it can be seen in build/pause/CHANGELOG.md the PATCH
level version for pause was introduced due to requirements
from the pause image for Windows. This however invalidated
our build/depedencies.yaml validation as it only accounted for
the MAJOR.MINOR version of pause (e.g. 3.10, not 3.10.1).

Enforce full SemVer validation for the pause image dependents.
2026-04-06 10:55:31 +02:00
Mujib Ahasan
bfe8f6b729 lint small fix
Signed-off-by: Mujib Ahasan <ahasanmujib8@gmail.com>
2026-04-04 20:19:57 +05:30
Mujib Ahasan
b9b0ff440d remove accidently commited file
Signed-off-by: Mujib Ahasan <ahasanmujib8@gmail.com>
2026-04-04 12:53:30 +05:30
Lubomir I. Ivanov
bc2179d882 pause: fix version drift in various files
The latest pause version is 3.10.2 but due to the introduction
of the PATCH level version to the pause image (previously was
only MAJOR.MINOR), various files have remained on an older
version. Either 3.10 or 3.10.1. Our validation with
build/dependencies.yaml ./hack/verify-external-dependencies.sh
did not account for that.
2026-04-03 16:11:09 +02:00
Kubernetes Prow Robot
d3b9c54bd9
Merge pull request #138178 from liggitt/deflake-wait
Make cleanup aware of uid differences
2026-04-02 23:09:09 +05:30
Jordan Liggitt
1140c34121
Make cleanup aware of uid differences 2026-04-02 11:13:44 -04:00
Kubernetes Prow Robot
9a6c7a68f3
Merge pull request #137991 from yashsingh74/cni-bumps
Update CNI plugins to v1.9.1
2026-04-02 17:55:04 +05:30
yashsingh74
afdb5e5d1f
Update CNI plugins to v1.9.1
Signed-off-by: yashsingh74 <yashsingh1774@gmail.com>
2026-04-01 14:06:34 +05:30
Kubernetes Prow Robot
eb8032bb83
Merge pull request #138030 from askervin/5gH-kubelet-fix-nodeAllocatableResourceClaimStatuses
kubelet: do not destroy nodeAllocatableResourceClaimStatuses
2026-04-01 07:12:12 +05:30
Kubernetes Prow Robot
6fbf7cc85e
Merge pull request #138035 from alaypatel07/kep-5304/bump-cdi-spec
kep-5304: bump cdi spec version to 0.5.0
2026-04-01 04:08:13 +05:30
Alay Patel
3906ec754c kep-5304: bump cdi spec version to 0.5.0
Signed-off-by: Alay Patel <alayp@nvidia.com>
2026-03-31 16:55:57 -04:00
Kubernetes Prow Robot
f2772372fb
Merge pull request #138131 from liggitt/deflake-pod-subresource-test
Deflake TestPodSubresourceAuth by waiting for effective permissions before testing
2026-03-31 22:02:19 +05:30
Jordan Liggitt
84be835316
Deflake TestPodSubresourceAuth by waiting for effective permissions before testing 2026-03-31 10:27:08 -04:00
Kubernetes Prow Robot
7e705d7851
Merge pull request #137978 from ania-borowiec/flake
Fix flakiness in integration test for TopologyAwareScheduling with Basic Policy
2026-03-31 16:00:14 +05:30