Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support chart, node-exporter: tolerate 2i2c.org/community tainted nodes #3209

Merged

Conversation

consideRatio
Copy link
Contributor

@consideRatio consideRatio commented Oct 2, 2023

The prometheus-node-exporter chart declares node taint tolerations by default in a way that it allows node-exporter to schedule on nodes with a "NoSchedule" taint, no matter what kind of key/value that goes with the effect of "NoSchedule".

This makes more sense for us than to declare tolerations for all individual taints we may declare, such as 2i2c.org/community.

Without this fix, we end up without node-exporter running on 2i2c.org/community tainted nodes, which in turn makes us unable to get statistics about user pods cpu and memory usage etc for pods on such nodes.

@consideRatio consideRatio requested a review from a team as a code owner October 2, 2023 09:38
@consideRatio consideRatio changed the title support, node-exporter: tolerate 2i2c.org/community tainted nodes support chart, node-exporter: tolerate 2i2c.org/community tainted nodes Oct 2, 2023
@github-actions

This comment was marked as resolved.

Comment on lines 87 to -102
prometheus-node-exporter:
tolerations:
# Tolerate tainted jupyterhub user nodes
- key: hub.jupyter.org_dedicated
value: user
effect: NoSchedule
- key: hub.jupyter.org/dedicated
value: user
effect: NoSchedule
# Tolerate tainted dask worker nodes
- key: k8s.dask.org_dedicated
value: worker
effect: NoSchedule
- key: k8s.dask.org/dedicated
value: worker
effect: NoSchedule
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are the chart's defaults:

tolerations:
  - effect: NoSchedule
    operator: Exists

Copy link
Member

@yuvipanda yuvipanda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch!

@consideRatio
Copy link
Contributor Author

Thank you @yuvipanda for reviewing!!

@consideRatio consideRatio merged commit 6223891 into 2i2c-org:master Oct 2, 2023
32 checks passed
@github-actions
Copy link

github-actions bot commented Oct 2, 2023

🎉🎉🎉🎉

Monitor the deployment of the hubs here 👉 https://github.com/2i2c-org/infrastructure/actions/runs/6384535727

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: Done 🎉
Development

Successfully merging this pull request may close these issues.

2 participants