Closed
Description
Which jobs are flaking?
pull-kubernetes-node-kubelet-containerd-flaky
Which tests are flaking?
E2eNode Suite: [It] [sig-node] Device Plugin [NodeFeature:DevicePlugin] [Serial] DevicePlugin [Serial] [Disruptive] Keeps device plugin assignments across node reboots (no pod restart, no device plugin re-registration) [Flaky]
Since when has it been flaking?
I've noticed it only once today (Oct 30 2024), but I can see similar failures in other jobs, e.g. #128114
Testgrid link
Reason for failure (if possible)
From the first look the failure could happen because of the bug in the device plugin registration process that prevents it from re-registering after Kubelet reboot. It doesn't happen consistently though.
Anything else we need to know?
No response
Relevant SIG(s)
/sig node
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
Done