Infrastructure Integration¶
Configuration:¶
-
Ensure that Docker is installed and running on your server.
-
Add
epoch-dd-agent
user todocker
group. Execute the following command
usermod -aG docker epoch-dd-agent
Execute the following command to make sure epoch-dd-agent
user is added to docker
group
id epoch-dd-agent
You should be able to see docker
in the groups=
section.
Restart the collector service by executing the following command.
/etc/init.d/epoch-collectors restart
Note : This integration may trigger WARNING's
in the collector logs complaining about Permission denied
errors while performing a disk metrics check.
To fix these WARNING's
please edit /etc/nutanix/epoch-dd-agent/conf.d/disk.yaml.default
and specify the files-systems and the directories that you want excluded from the check. The excluded file systems are specified under "excluded_filesystems" section and the excluded directories are specified in "excluded_mountpoint_re" section of the configuration file. An example below illustrates how to exclude file systems devtmpfs
, proc
or tmpfs
and directories /var/lib/docker/(devicemapper|overlay).*
.
Following is a sample disk.yaml.default
file.
init_config:
instances:
- use_mount: no
excluded_filesystems:
- devtmpfs
- proc
- tmpfs
excluded_mountpoint_re: '/var/lib/docker/(devicemapper|overlay).*'
- Edit
docker_daemon.yaml
to configure the agent. Please referdocker_daemon.yaml
for all available configuration options.init_config: instances: url: "unix://var/run/docker.sock"
Infrastructure Datasources¶
Datasource | Available Aggregations | Unit | Description |
---|---|---|---|
docker.cpu.system | avg max min sum |
percent | The percent of time the CPU is executing system calls on behalf of processes of this container |
docker.cpu.system.95percentile | avg max min sum |
percent | 95th percentile of docker.cpu.system |
docker.cpu.system.avg | avg max min sum |
percent | Average value of docker.cpu.system |
docker.cpu.system.count | avg max min sum |
sample/second | The rate that the value of docker.cpu.system was sampled |
docker.cpu.system.max | avg max min sum |
percent | Max value of docker.cpu.system |
docker.cpu.system.median | avg max min sum |
percent | Median value of docker.cpu.system |
docker.cpu.user | avg max min sum |
percent | The percent of time the CPU is under direct control of processes of this container |
docker.cpu.user.95percentile | avg max min sum |
percent | 95th percentile of docker.cpu.user |
docker.cpu.user.avg | avg max min sum |
percent | Average value of docker.cpu.user |
docker.cpu.user.count | avg max min sum |
sample/second | The rate that the value of docker.cpu.user was sampled |
docker.cpu.user.max | avg max min sum |
percent | Max value of docker.cpu.user |
docker.cpu.user.median | avg max min sum |
percent | Median value of docker.cpu.user |
docker.cpu.usage | avg max min sum |
percent | The percent of CPU time obtained by this container |
docker.cpu.throttled | avg max min sum |
Number of times the cgroup has been throttled | |
docker.mem.cache | avg max min sum |
byte | The amount of memory that is being used to cache data from disk (e.g. memory contents that can be associated precisely with a block on a block device) |
docker.mem.cache.95percentile | avg max min sum |
byte | 95th percentile value of docker.mem.cache |
docker.mem.cache.avg | avg max min sum |
byte | Average value of docker.mem.cache |
docker.mem.cache.count | avg max min sum |
sample/second | The rate that the value of docker.mem.cache was sampled |
docker.mem.cache.max | avg max min sum |
byte | Max value of docker.mem.cache |
docker.mem.cache.median | avg max min sum |
byte | Median value of docker.mem.cache |
docker.mem.rss | avg max min sum |
byte | The amount of non-cache memory that belongs to the container's processes. Used for stacks, heaps, etc. |
docker.mem.rss.95percentile | avg max min sum |
byte | 95th percentile value of docker.mem.rss |
docker.mem.rss.avg | avg max min sum |
byte | Average value of docker.mem.rss |
docker.mem.rss.count | avg max min sum |
sample/second | The rate that the value of docker.mem.rss was sampled |
docker.mem.rss.max | avg max min sum |
byte | Max value of docker.mem.rss |
docker.mem.rss.median | avg max min sum |
byte | Median value of docker.mem.rss |
docker.mem.swap | avg max min sum |
byte | The amount of swap currently used by the container |
docker.mem.swap.95percentile | avg max min sum |
byte | 95th percentile value of docker.mem.swap |
docker.mem.swap.avg | avg max min sum |
byte | Average value of docker.mem.swap |
docker.mem.swap.count | avg max min sum |
sample/second | The rate that the value of docker.mem.swap was sampled |
docker.mem.swap.max | avg max min sum |
byte | Max value of docker.mem.swap |
docker.mem.swap.median | avg max min sum |
byte | Median value of docker.mem.swap |
docker.container.size_rw | avg max min sum |
byte | Total size of all the files in the container which have been created or changed by processes running in the container |
docker.container.size_rw.95percentile | avg max min sum |
byte | 95th percentile of docker.container.size_rw |
docker.container.size_rw.avg | avg max min sum |
byte | Average value of docker.container.size_rw |
docker.container.size_rw.count | avg max min sum |
sample/second | The rate that the value of docker.container.size_rw was sampled |
docker.container.size_rw.max | avg max min sum |
byte | Max value of docker.container.size_rw |
docker.container.size_rw.median | avg max min sum |
byte | Median value of docker.container.size_rw |
docker.container.size_rootfs | avg max min sum |
byte | Total size of all the files in the container |
docker.container.size_rootfs.95percentile | avg max min sum |
byte | 95th percentile of docker.container.size_rootfs |
docker.container.size_rootfs.avg | avg max min sum |
byte | Average value of docker.container.size_rootfs |
docker.container.size_rootfs.count | avg max min sum |
sample/second | The rate that the value of docker.container.size_rw was sampled |
docker.container.size_rootfs.max | avg max min sum |
byte | Max value of docker.container.size_rootfs |
docker.container.size_rootfs.median | avg max min sum |
byte | Median value of docker.container.size_rootfs |
docker.containers.running | avg max min sum |
The number of containers running on this host | |
docker.containers.stopped | avg max min sum |
The number of containers stopped on this host | |
docker.images.available | avg max min sum |
The number of top-level images | |
docker.images.intermediate | avg max min sum |
The number of intermediate images, which are intermediate layers that make up other images | |
docker.mem.limit | avg max min sum |
byte | The memory limit for the container, if set |
docker.mem.limit.95percentile | avg max min sum |
byte | 95th percentile of docker.mem.limit. Ordinarily this value will not change |
docker.mem.limit.avg | avg max min sum |
byte | Average value of docker.mem.limit. Ordinarily this value will not change |
docker.mem.limit.count | avg max min sum |
sample/second | The rate that the value of docker.mem.limit was sampled |
docker.mem.limit.max | avg max min sum |
byte | Max value of docker.mem.limit. Ordinarily this value will not change |
docker.mem.limit.median | avg max min sum |
byte | Median value of docker.mem.limit. Ordinarily this value will not change |
docker.mem.sw_limit | avg max min sum |
byte | The swap + memory limit for the container, if set |
docker.mem.sw_limit.95percentile | avg max min sum |
byte | 95th percentile of docker.mem.sw_limit. Ordinarily this value will not change |
docker.mem.sw_limit.avg | avg max min sum |
byte | Average value of docker.mem.sw_limit. Ordinarily this value will not change |
docker.mem.sw_limit.count | avg max min sum |
sample/second | The rate that the value of docker.mem.sw_limit was sampled |
docker.mem.sw_limit.max | avg max min sum |
byte | Max value of docker.mem.sw_limit. Ordinarily this value will not change |
docker.mem.sw_limit.median | avg max min sum |
byte | Median value of docker.mem.sw_limit. Ordinarily this value will not change |
docker.mem.in_use | avg max min sum |
fraction | The fraction of used memory to available memory, if the limit is set |
docker.mem.in_use.95percentile | avg max min sum |
fraction | 95th percentile of docker.mem.in_use |
docker.mem.in_use.avg | avg max min sum |
fraction | Average value of docker.mem.in_use |
docker.mem.in_use.count | avg max min sum |
sample/second | The rate that the value of docker.mem.in_use was sampled |
docker.mem.in_use.max | avg max min sum |
fraction | Max value of docker.container.mem.in_use |
docker.mem.in_use.median | avg max min sum |
fraction | Median value of docker.container.mem.in_use |
docker.mem.sw_in_use | avg max min sum |
fraction | The fraction of used swap + memory to available swap + memory, if the limit is set |
docker.mem.sw_in_use.95percentile | avg max min sum |
fraction | 95th percentile of docker.mem.sw_in_use |
docker.mem.sw_in_use.avg | avg max min sum |
fraction | Average value of docker.mem.sw_in_use |
docker.mem.sw_in_use.count | avg max min sum |
sample/second | The rate that the value of docker.mem.sw_in_use was sampled |
docker.mem.sw_in_use.max | avg max min sum |
fraction | Max value of docker.container.mem.sw_in_use |
docker.mem.sw_in_use.median | avg max min sum |
fraction | Median value of docker.container.mem.sw_in_use |
docker.io.read_bytes | avg max min sum |
byte/second | Bytes read per second from disk by the processes of the container |
docker.io.read_bytes.95percentile | avg max min sum |
byte/second | 95th percentile of docker.io.read_bytes |
docker.io.read_bytes.avg | avg max min sum |
byte/second | Average value of docker.io.read_bytes |
docker.io.read_bytes.count | avg max min sum |
sample/second | The rate that the value of docker.io.read_bytes was sampled |
docker.io.read_bytes.max | avg max min sum |
byte/second | Max value of docker.container.io.read_bytes |
docker.io.read_bytes.median | avg max min sum |
byte/second | Median value of docker.container.io.read_bytes |
docker.io.write_bytes | avg max min sum |
byte/second | Bytes written per second to disk by the processes of the container |
docker.io.write_bytes.95percentile | avg max min sum |
byte/second | 95th percentile of docker.io.write_bytes |
docker.io.write_bytes.avg | avg max min sum |
byte/second | Average value of docker.io.write_bytes |
docker.io.write_bytes.count | avg max min sum |
sample/second | The rate that the value of docker.io.write_bytes was sampled |
docker.io.write_bytes.max | avg max min sum |
byte/second | Max value of docker.container.io.write_bytes |
docker.io.write_bytes.median | avg max min sum |
byte/second | Median value of docker.container.io.write_bytes |
docker.image.virtual_size | avg max min sum |
byte | Size of all layers of the image on disk |
docker.image.size | avg max min sum |
byte | Size of all layers of the image on disk |
docker.net.bytes_rcvd | avg max min sum |
byte/second | Bytes received per second from the network |
docker.net.bytes_rcvd.95percentile | avg max min sum |
byte/second | 95th percentile of docker.net.bytes_rcvd |
docker.net.bytes_rcvd.avg | avg max min sum |
byte/second | Average value of docker.net.bytes_rcvd |
docker.net.bytes_rcvd.count | avg max min sum |
sample/second | The rate that the value of docker.net.bytes_rcvd was sampled |
docker.net.bytes_rcvd.max | avg max min sum |
byte/second | Max value of docker.container.net.bytes_rcvd |
docker.net.bytes_rcvd.median | avg max min sum |
byte/second | Median value of docker.container.net.bytes_rcvd |
docker.net.bytes_sent | avg max min sum |
byte/second | Bytes sent per second to the network |
docker.net.bytes_sent_bytes.95percentile | avg max min sum |
byte/second | 95th percentile of docker.net.bytes_sent_bytes |
docker.net.bytes_sent_bytes.avg | avg max min sum |
byte/second | Average value of docker.net.bytes_sent_bytes |
docker.net.bytes_sent_bytes.count | avg max min sum |
sample/second | The rate that the value of docker.net.bytes_sent_bytes was sampled |
docker.net.bytes_sent_bytes.max | avg max min sum |
byte/second | Max value of docker.container.net.bytes_sent_bytes |
docker.net.bytes_sent_bytes.median | avg max min sum |
byte/second | Median value of docker.container.net.bytes_sent_bytes |
docker.data.used | avg max min sum |
byte | Storage pool disk space used |
docker.data.free | avg max min sum |
byte | Storage pool disk space free |
docker.data.total | avg max min sum |
byte | Storage pool disk space total |
docker.data.percent | avg max min sum |
percent | The percent of storage pool used |
docker.metadata.used | avg max min sum |
byte | Storage pool metadata space used |
docker.metadata.free | avg max min sum |
byte | Storage pool metadata space free |
docker.metadata.total | avg max min sum |
byte | Storage pool metadata space total |
docker.metadata.percent | avg max min sum |
percent | The percent of storage pool metadata used |
Events¶
Docker events are enabled by default and can be disabled using the collect_events
flag in docker_daemon.yaml
. The following events are collected by default:
- Delete Image
- Die
- Error
- Fail
- Kill
- Out of memory (oom)
- Pause
- Restart container
- Restart Daemon
- Update