You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While troubleshooting vanilla Falco and previous memory related issues it
proved to be useful to have an understanding how the thread cache is growing.
To make it more visible, expose the current thread table size as a new
prometheus metric, e.g. rox_collector_events{type="threadCacheSize"}.
The numbers we're interested in could be obtained via libsinsp inspector
function get_thread_count(). Since this metric is not directly dependent on
event stream, we need to decide when exactly to take the counters current value
-- to not do unnecessary work if it's changing slowly, but to be fine-grained
enough to notice relevant spikes.
When troubleshooting vanilla Falco, the hacky solution I used was to log thread
count with throttled logging, and it was providing enough information. Thus, the
proposal is to update thread counter metric based on the number of processes
received, but with some throttling, e.g. when we receive every n'th process we
capture current thread table size.
While troubleshooting vanilla Falco and previous memory related issues it
proved to be useful to have an understanding how the thread cache is growing.
To make it more visible, expose the current thread table size as a new
prometheus metric, e.g.
rox_collector_events{type="threadCacheSize"}
.The numbers we're interested in could be obtained via libsinsp inspector
function
get_thread_count()
. Since this metric is not directly dependent onevent stream, we need to decide when exactly to take the counters current value
-- to not do unnecessary work if it's changing slowly, but to be fine-grained
enough to notice relevant spikes.
When troubleshooting vanilla Falco, the hacky solution I used was to log thread
count with throttled logging, and it was providing enough information. Thus, the
proposal is to update thread counter metric based on the number of processes
received, but with some throttling, e.g. when we receive every n'th process we
capture current thread table size.
Part of #1320
The text was updated successfully, but these errors were encountered: