Nodes unreachable under high memory; kubelet not evicting pods #11312

asemarian · 2024-11-13T07:43:56Z

Hello,

I’m running a Kubernetes cluster with k3s, and I’ve been experiencing intermittent issues with some of the nodes. Occasionally, nodes become unreachable, changing from “Ready” to “NotReady,” causing all workloads on them to be inaccessible. I often need to reboot the node to resolve the issue. The problem is very similar to the one described here.

In the most recent incident, I noticed that the Prometheus pod was consuming unusually high memory, which seemed to trigger the issue (no memory limits are set on the pod). My question is: why isn’t memory pressure kicking in and prompting the kubelet to evict pods? I’m using the kube-hetzner project, mostly with default settings. The node in question has 3 vCPUs and 4 GB RAM. These are the kubelet args taken from /etc/rancher/k3s/config.yaml:

"kubelet-arg":
- "cloud-provider=external"
- "volume-plugin-dir=/var/lib/kubelet/volumeplugins"
- "kube-reserved=cpu=50m,memory=300Mi,ephemeral-storage=1Gi"
- "system-reserved=cpu=250m,memory=300Mi"

It's worth mentioning that I couldn't reproduce the issue. I stress-tested the node but the pods are always killed before they make the node unstable.

Is there a configuration I can adjust to ensure the kubelet has enough headroom to start evicting pods before the node becomes completely unreachable? I’ve observed that, in another managed cluster we use, nodes never go down under similar conditions. Instead, the node is marked with MemoryPressure and the eviction process starts to prevent node instability.

Any insights on how to achieve similar resilience would be greatly appreciated! Thank you.

The text was updated successfully, but these errors were encountered:

brandond · 2024-11-13T07:48:34Z

Does this node have swap enabled? Why aren't you setting memory limits that are at least lower than what's available on your node? Prometheus is pretty intense, I'm not sure I'd try to run it with less than 4gb allocated just to it, let alone running it on a node with only 4gb total.

asemarian · 2024-11-13T10:28:27Z

Hi @brandond. Thanks for the prompt response.

I don't want to give the impression that the problem is with Prometheus itself; the same problem happened in the past on nodes that were not running Prometheus. To be clear, Prometheus does indeed use a lot of memory, somewhere in the range of 2.5Gi to be more precise. With that said, setting a memory limit did not resolve the issue. The pod was never OOM killed for whatever reason. What's more, the same issue occurred on other nodes with 8Gi of memory. And no, swap is not enabled on any of our nodes.

brandond · 2024-11-13T18:44:39Z

I'd probably try to figure out why the nodes are unreachable. All "NotReady" means is that the kubelet has stopped updating the Node heartbeat timestamp. As to why that is happening, you'd have to get into the logs on the node. Is K3s crashing? Is the kernel crashing? Is the node just thrashing in OOM because you haven't set any limits?

github-project-automation bot added this to K3s Development Nov 13, 2024

github-project-automation bot moved this to New in K3s Development Nov 13, 2024

caroline-suse-rancher moved this from New to In Triage in K3s Development Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nodes unreachable under high memory; kubelet not evicting pods #11312

Nodes unreachable under high memory; kubelet not evicting pods #11312

asemarian commented Nov 13, 2024

brandond commented Nov 13, 2024

asemarian commented Nov 13, 2024 •

edited

Loading

brandond commented Nov 13, 2024

Nodes unreachable under high memory; kubelet not evicting pods #11312

Nodes unreachable under high memory; kubelet not evicting pods #11312

Comments

asemarian commented Nov 13, 2024

brandond commented Nov 13, 2024

asemarian commented Nov 13, 2024 • edited Loading

brandond commented Nov 13, 2024

asemarian commented Nov 13, 2024 •

edited

Loading