Plugin: ebpf.plugin Module: oomkill
Monitor applications that reach out of memory.
Attach tracepoint to internal kernel functions.
This collector is only supported on the following platforms:
This collector supports collecting metrics from multiple instances of this integration, including remote instances.
The plugin needs setuid because it loads data inside kernel. Netada sets necessary permission during installation time.
The plugin checks kernel compilation flags (CONFIG_KPROBES, CONFIG_BPF, CONFIG_BPF_SYSCALL, CONFIG_BPF_JIT), files inside debugfs, and presence of BTF files to decide which eBPF program will be attached.
The default configuration for this integration does not impose any limits on data collection.
This thread will add overhead every time that an internal kernel function monitored by this thread is called.
Check if your kernel was compiled with necessary options (CONFIG_KPROBES, CONFIG_BPF, CONFIG_BPF_SYSCALL, CONFIG_BPF_JIT) in /proc/config.gz
or inside /boot/config file. Some cited names can be different accoring preferences of Linux distributions.
When you do not have options set, it is necessary to get the kernel source code from https://kernel.org or a kernel package from your distribution, this last is preferred. The kernel compilation has a well definedd pattern, but distributions can deliver their configuration files
with different names.
Now follow steps:
initrd
) if it is necessary.This thread needs to attach a tracepoint to monitor when a process schedule an exit event. To allow this specific feaure, it is necessary to mount debugfs
(mount -t debugfs none /sys/kernel/debug/
).
The configuration file name for this integration is ebpf.d/oomkill.conf
.
You can edit the configuration file using the edit-config
script from the
Netdata config directory.
cd /etc/netdata 2>/dev/null || cd /opt/netdata/etc/netdata
sudo ./edit-config ebpf.d/oomkill.conf
Overwrite default configuration reducing number of I/O events
There are no configuration examples.
Metrics grouped by scope.
The scope defines the instance that the metric belongs to. An instance is uniquely identified by a set of labels.
These metrics show cgroup/service that reached OOM.
This scope has no labels.
Metrics:
Metric | Dimensions | Unit |
---|---|---|
cgroup.oomkills | cgroup name | kills |
services.oomkills | a dimension per systemd service | kills |
These metrics show cgroup/service that reached OOM.
Labels:
Label | Description |
---|---|
app_group | The name of the group defined in the configuration. |
Metrics:
Metric | Dimensions | Unit |
---|---|---|
app.oomkill | kills | kills |
There are no alerts configured by default for this integration.