Commit graph

16836 commits

Author SHA1 Message Date
Kubernetes Prow Robot
b4e4d2cfc1
Merge pull request #139237 from aojea/webhook_idle_
webhook use resolved endpoint IP instead of cached
2026-06-11 01:49:53 +05:30
Kubernetes Prow Robot
f7a260bfc5
Merge pull request #139609 from amritansh1502/promote-agnhost-2.64.0
Promote agnhost e2e test image to 2.64.0
2026-06-10 23:09:06 +05:30
Amritansh Amritansh
64915c6459 Promote agnhost e2e test image to 2.64.0 2026-06-10 14:18:35 +05:30
Kubernetes Prow Robot
5d4ee2a71b
Merge pull request #136695 from chrishenzie/resource-quota-flake
Fix race condition in ResourceQuota e2e test
2026-06-10 02:27:45 +05:30
Humble Devassy Chirammal
79df48076c csi: update CSI sidecar images in test manifests
Update CSI sidecar container images to their latest stable releases
across the e2e test manifests:

- csi-provisioner:                          v6.2.0 -> v6.3.0
- csi-attacher:                             v4.11.0 -> v4.12.0
- csi-resizer:                              v2.1.0  -> v2.2.0
- csi-snapshotter:                          v8.5.0  -> v8.6.0
- csi-node-driver-registrar:                v2.16.0 -> v2.17.0
- livenessprobe:                            v2.18.0 -> v2.19.0
- csi-external-health-monitor-controller:   v0.17.0 -> v0.18.0

No change to hostpathplugin (v1.17.1 is current) or
csi-snapshot-metadata (v1.0.0 is current).

Signed-off-by: Humble Devassy Chirammal <humble.devassy@gmail.com>
2026-06-09 12:55:31 +05:30
Kubernetes Prow Robot
546c46f0d8
Merge pull request #139006 from rzlink/fix/e2e-common-race-conditions
test/e2e/common/node: poll for eventually-consistent state to reduce flakiness
2026-06-06 03:36:32 +05:30
Kubernetes Prow Robot
a5098cf9a1
Merge pull request #139439 from pohly/dra-kind-runtime
DRA E2E: simplify kind.yaml
2026-06-02 20:13:47 +05:30
Patrick Ohly
2a66149148 DRA E2E: simplify kind.yaml
Runtime config can be set via the kind config, which is simpler than setting
the apiserver parameter.

DynamicResourceAllocation is enabled by default nowadays, but still needs to be
set for the current n-3 skew testing which picks 1.33 (1.37 still in alpha).
Similar for NodeLogQuery (GA in 1.36).
2026-06-02 15:43:43 +02:00
Kubernetes Prow Robot
e4606044ad
Merge pull request #139427 from Jefftree/bump-etcd-3-7-0-rc-0
etcd: update etcd image to v3.7.0-rc.0
2026-06-02 16:39:54 +05:30
Kubernetes Prow Robot
35bed76715
Merge pull request #139416 from bart0sh/PR238-fix-apiserver-runtime-config
DRA e2e: fix apiServer runtime-config
2026-06-02 09:06:48 +05:30
Jefftree
2544f4dda2 etcd: update etcd image to v3.7.0-rc.0 2026-06-01 20:29:57 -04:00
Kubernetes Prow Robot
89243359c2
Merge pull request #138235 from zylxjtu/dev
test/e2e_node_windows: Add Windows node-level e2e tests
2026-06-02 01:56:57 +05:30
Ed Bartosh
bdae66a87b DRA e2e: fix apiServer runtime-config
The kubeadm v1beta4 ClusterConfiguration changed ExtraArgs from
map[string]string to []Arg (list of {name, value} pairs). The
scheduler and controllerManager sections already used the new list
format, but apiServer.extraArgs still used the old map format, causing
the --runtime-config flag to be silently dropped when kind uses v1beta4.

Without runtime-config, resource.k8s.io/v1beta1, v1beta2 and v1alpha3
default to disabled, so the API server skips them with "has no
resources" and tests using those API versions get 404.
2026-06-01 18:30:44 +03:00
Kubernetes Prow Robot
38592b8d39
Merge pull request #139322 from vshkrabkov/e2e-test/gang-scheduling
E2E test for gang scheduling
2026-05-29 05:38:46 +05:30
vshkrabkov
4a4319f666 e2e test for gang scheduling 2026-05-28 14:28:38 +00:00
Kubernetes Prow Robot
e1ed8dd068
Merge pull request #139203 from gnufied/add-e2e-for-deleted-pvc-offline-expansion
Add an e2e for offline volume expansion
2026-05-28 06:56:51 +05:30
Antonio Ojea
ede2fddcfd
webhook use resolved endpoint IP instead of cached
Previously, the webhook transport was switched from HTTP/2 to HTTP/1.1 to work around HTTP/2's single-connection multiplexing, which prevented concurrent requests from load-balancing across multiple backend pods. However, under HTTP/1.1, connections are kept alive and cached as idle in the transport's pool.

Because Go's http.Transport keys its connection cache by the request's URL Host (in this case the service name) and we overrode the DialContext to perform dynamic endpoint resolution, when a new request is sent, if there is an idle connection in the pool matching the service hostname, the connection is reused and the dialer is skipped.
2026-05-27 20:26:34 +00:00
Kubernetes Prow Robot
c2fb6b8dec
Merge pull request #139199 from marosset/windows-skip-memory-backed-emptydir-conromance
Skip memory backed emptyDir e2e tests on Windows
2026-05-27 23:46:46 +05:30
Hemant Kumar
3884c64fbe Fix g.Expect with framework.ExpectNoError 2026-05-27 13:09:02 -04:00
Hemant Kumar
01a01e2ac4 Add additional check for restore pvc.Capacity from pv annotation 2026-05-26 17:57:16 -04:00
Hemant Kumar
46a4487a62 Add an e2e for offline volume expansion 2026-05-26 17:25:03 -04:00
zylxjtu
312d00189c Add Windows node-level e2e tests to a dedicated directory
Introduce test/e2e_node_windows/ as a self-contained Windows node e2e
test suite, fully isolated from the Linux-focused test/e2e_node/ package.
All new files are gated with //go:build windows, and the tree includes
a scoped OWNERS file so it lands under an agreed governance model.

In hack/lib/golang.sh, skip building test/e2e_node/e2e_node.test when
KUBE_BUILD_PLATFORMS targets Windows. Windows has a separate e2e_node
test binary which does not currently need to be bundled in an archive.

Document the Windows feature label in test/e2e/feature/feature.go.
2026-05-26 18:04:09 +00:00
Kubernetes Prow Robot
e09c6f3997
Merge pull request #138241 from SergeyKanzhelev/remove-unused-e2e-features
Remove unused e2e features
2026-05-23 21:56:43 +05:30
dom4ha
88948acc38 Remove v1alpha2 API definitions
Update client-go lister and informer imports to v1alpha3
2026-05-22 12:50:19 +00:00
dom4ha
8a52fb2ea9 Migrate references to v1alpha3 in tests, controllers, and remaining files 2026-05-22 12:50:19 +00:00
Kubernetes Prow Robot
e61737540c
Merge pull request #139098 from Jefftree/skip-hostcleanup-no-external-ip
e2e: skip HostCleanup test when worker has no NodeExternalIP
2026-05-21 10:16:43 +05:30
Mark Rossetti
4684fc6c67
Skip memory backed emptyDir e2e tests on Windows
Signed-off-by: Mark Rossetti <marosset@microsoft.com>
2026-05-20 19:25:08 +00:00
Kubernetes Prow Robot
661b086ee0
Merge pull request #139141 from bart0sh/PR236-DRAExtended-e2e
DRA E2E: improve DRAExtendedResource tests go GA graduation
2026-05-20 15:47:54 +05:30
Kubernetes Prow Robot
d0ab3fc757
Merge pull request #139160 from danwinship/named-port-non-implementation
Add an NP e2e test of correct fallback when failing to implement named ports
2026-05-20 05:05:45 +05:30
Dan Winship
7c824ea051 Remove stale OWNERS file
Just inherit the parent OWNERS
2026-05-19 16:19:17 -04:00
Kubernetes Prow Robot
04976d6f60
Merge pull request #138844 from skitt/bump-tools
Bump all tool dependencies
2026-05-20 01:31:45 +05:30
Dan Winship
700d621fd6 Add an NP e2e test of correct fallback when failing to implement named ports 2026-05-19 09:56:42 -04:00
Sai Ramesh Vanka
3f3a340240 Fix DRA extended resource quota test race condition
Signed-off-by: Sai Ramesh Vanka <svanka@redhat.com>
2026-05-19 08:57:23 +05:30
Arda Güçlü
4593dbe07e
Move kubectl apply e2e tests to dedicated file (#139086)
* Move kubectl apply e2e tests to dedicated file

* Move prune test under apply.go
2026-05-18 21:43:53 +05:30
Ed Bartosh
68c8c393d3 DRA E2E: improve DRAExtendedResource tests
Added a test verifying that when both a device plugin and a DRA
driver advertise the same resource on one node, the device plugin
wins (filterExtendedResources takes the DRA path only when
allocatable == 0).

Asserted ExtendedResourceClaimStatus in the existing "process
extended resources after device plugin uninstall" test to confirm
the DRA path is taken after DP removal.
2026-05-18 19:04:28 +03:00
Kubernetes Prow Robot
e789abc1ee
Merge pull request #138367 from timmy-wright/timmy/fix-cleanup
fix: accumulate subpath cleanup actions in makeMounts to prevent handle leak
2026-05-18 19:55:54 +05:30
Sergey Kanzhelev
39e5b92c39 test: assign owner SIGs to features in test/e2e/feature/feature.go 2026-05-16 18:51:00 +00:00
Sergey Kanzhelev
efec7c444e test: remove unused features from test/e2e/feature/feature.go 2026-05-16 18:51:00 +00:00
Jefftree
7fe9bbb5c5 e2e: skip HostCleanup test when worker has no NodeExternalIP 2026-05-15 15:45:52 -04:00
Kubernetes Prow Robot
6b0e464c7a
Merge pull request #138799 from rzlink/fix/windows-e2e-pod-overhead
e2e/windows: respect pod.Spec.Overhead in Memory Limits and Kubelet-Stats tests
2026-05-14 03:07:48 +05:30
Kubernetes Prow Robot
eaa04458b2
Merge pull request #138961 from soltysh/simplify_exec
Replace remote executor code with the DefaultExecutor from kubectl
2026-05-13 16:00:28 +05:30
Dawei Wei
b763eaf594 test/e2e/common/node: poll for eventually-consistent state to reduce flakiness
These tests have race conditions where they assume immediate state
visibility after a pod transitions to Running. The current code works
on fast runtimes but is fundamentally racy: kubelet log streaming,
log file flushing, and container status updates are eventually
consistent, not synchronous.

Switching to gomega.Eventually polling makes the tests deterministic.
The success path on fast runtimes is unchanged (polling succeeds on
first attempt), but the tests now correctly handle scenarios where
state takes a moment to propagate. This benefits any environment
where containers may take longer to start (VM-isolated runtimes such
as Kata, gVisor, and Windows Hyper-V; overloaded CI VMs; shared
multi-tenant clusters).

- ephemeral_containers.go (both 'should be added' and 'should update'
  tests): the 'polo' log-content check is polled via gomega.Eventually
  with f.Timeouts.PodStartShort. The container may report Running
  before its first stdout has been flushed.

- lifecycle_hook.go ('ignore terminated container'): use
  f.Timeouts.PodDelete instead of gracePeriod*time.Second for the
  termination wait. The actual correctness check (container's intrinsic
  StartedAt/FinishedAt < sleepSeconds) is unchanged and unaffected by
  how long we waited.

- pods.go ('retrieving logs from the container over websockets'):
  poll the websocket open and read via gomega.Eventually. The container
  can be reported Running before its first stdout line has been flushed,
  so opening the websocket immediately may return an empty or partial
  buffer.
2026-05-12 11:38:18 -07:00
Kubernetes Prow Robot
8e161c8126
Merge pull request #138963 from dims/remove-flaky-notation
Remove [Flaky] for green tests
2026-05-12 22:04:40 +05:30
Kubernetes Prow Robot
300eda2f2f
Merge pull request #138746 from humblec/update-etcd-image-3.6.11
etcd: update etcd image to v3.6.11
2026-05-12 17:39:32 +05:30
Stephen Kitt
e223a05ec7
Bump github.com/vektra/mockery/v3
mockery ignores auto-generated code by default, so this requires a
couple of configuration changes.

Signed-off-by: Stephen Kitt <skitt@redhat.com>
2026-05-12 11:55:38 +02:00
Kubernetes Prow Robot
874a7b40b0
Merge pull request #138617 from esotsal/kubeletHealthCheckRefactor
Move kubeletHealthCheck from e2enode to node as HealthCheck
2026-05-12 02:26:10 +05:30
Maciej Szulik
61e5542da9
Replace remote executor code with the DefaultExecutor from kubectl
Signed-off-by: Maciej Szulik <soltysh@gmail.com>
2026-05-11 16:59:31 +02:00
Maciej Szulik
8ecec5b156
Eliminate too many ExecWithOptions helpers
Signed-off-by: Maciej Szulik <soltysh@gmail.com>
2026-05-11 16:59:07 +02:00
Davanum Srinivas
1f59ea104a
Remove [Flaky] for green tests
Drops f.WithFlaky() from two test blocks where the tag has become stale:

- [sig-node] kubelet host cleanup with volume mounts [HostCleanup]
  (covers both NFS sub-tests: active and sleeping client pods)
- [sig-storage] PersistentVolumes-local "should set different fsGroup
  for second pod if first pod is deleted" (covers all 8 volume-type
  variants from the parameterized parent)

Testgrid evidence -- both dashboards show consistent passes across all
30 recent runs:
  https://testgrid.k8s.io/google-gce#gci-gce-flaky&include-filter-by-regex=Flaky
  https://testgrid.k8s.io/sig-testing-misc#gce-cos-master-flaky-repro&include-filter-by-regex=Flaky

History:
- HostCleanup was tagged [Flaky] in PR 41659 (merged 2017-04-13) as a
  quick workaround for parallel-execution interference with disruptive
  tests; the follow-up "remove [Flaky]" PR mentioned in that body never
  landed. Root-cause issue 31272 ("Hung volumes can wedge the kubelet")
  remains open.
- fsGroup test was tagged [Flaky] in PR 75015 (merged 2019-03-06) to
  skip a race in DesiredStateOfWorld re-adding terminating-pod volumes.
  Root-cause issue 73168 ("Do not remount volume again after it is
  detached") remains open. The obsolete TODO comment referencing that
  issue is also removed.

If either test regresses, the safe rollback is to restore f.WithFlaky()
and reopen the conversation on issue 31272 / 73168.
2026-05-11 08:26:29 -04:00
Tim Wright
7f8d1138ca
fix: accumulate subpath cleanup actions in makeMounts to prevent handle leak in Windows 2026-05-11 20:36:29 +12:00