Skip to content

Infrastructure Integration

Configuration:

  1. Ensure that Docker is installed and running on your server.

  2. Add epoch-dd-agent user to docker group. Execute the following command

usermod -aG docker epoch-dd-agent

Execute the following command to make sure epoch-dd-agent user is added to docker group

id epoch-dd-agent

You should be able to see docker in the groups= section.

Restart the collector service by executing the following command.

/etc/init.d/epoch-collectors restart

Note : This integration may trigger WARNING's in the collector logs complaining about Permission denied errors while performing a disk metrics check.

To fix these WARNING's please edit /etc/nutanix/epoch-dd-agent/conf.d/disk.yaml.default and specify the files-systems and the directories that you want excluded from the check. The excluded file systems are specified under "excluded_filesystems" section and the excluded directories are specified in "excluded_mountpoint_re" section of the configuration file. An example below illustrates how to exclude file systems devtmpfs, proc or tmpfs and directories /var/lib/docker/(devicemapper|overlay).*.

Following is a sample disk.yaml.default file.

    init_config:

    instances:
      - use_mount: no
        excluded_filesystems:
          - devtmpfs
          - proc
          - tmpfs
        excluded_mountpoint_re: '/var/lib/docker/(devicemapper|overlay).*'
  1. Edit docker_daemon.yaml to configure the agent. Please refer docker_daemon.yaml for all available configuration options.
    init_config:
    
    instances:
        url: "unix://var/run/docker.sock"
    

Infrastructure Datasources

Datasource Available Aggregations Unit Description
docker.cpu.system avg max min sum percent The percent of time the CPU is executing system calls on behalf of processes of this container
docker.cpu.system.95percentile avg max min sum percent 95th percentile of docker.cpu.system
docker.cpu.system.avg avg max min sum percent Average value of docker.cpu.system
docker.cpu.system.count avg max min sum sample/second The rate that the value of docker.cpu.system was sampled
docker.cpu.system.max avg max min sum percent Max value of docker.cpu.system
docker.cpu.system.median avg max min sum percent Median value of docker.cpu.system
docker.cpu.user avg max min sum percent The percent of time the CPU is under direct control of processes of this container
docker.cpu.user.95percentile avg max min sum percent 95th percentile of docker.cpu.user
docker.cpu.user.avg avg max min sum percent Average value of docker.cpu.user
docker.cpu.user.count avg max min sum sample/second The rate that the value of docker.cpu.user was sampled
docker.cpu.user.max avg max min sum percent Max value of docker.cpu.user
docker.cpu.user.median avg max min sum percent Median value of docker.cpu.user
docker.cpu.usage avg max min sum percent The percent of CPU time obtained by this container
docker.cpu.throttled avg max min sum Number of times the cgroup has been throttled
docker.mem.cache avg max min sum byte The amount of memory that is being used to cache data from disk (e.g. memory contents that can be associated precisely with a block on a block device)
docker.mem.cache.95percentile avg max min sum byte 95th percentile value of docker.mem.cache
docker.mem.cache.avg avg max min sum byte Average value of docker.mem.cache
docker.mem.cache.count avg max min sum sample/second The rate that the value of docker.mem.cache was sampled
docker.mem.cache.max avg max min sum byte Max value of docker.mem.cache
docker.mem.cache.median avg max min sum byte Median value of docker.mem.cache
docker.mem.rss avg max min sum byte The amount of non-cache memory that belongs to the container's processes. Used for stacks, heaps, etc.
docker.mem.rss.95percentile avg max min sum byte 95th percentile value of docker.mem.rss
docker.mem.rss.avg avg max min sum byte Average value of docker.mem.rss
docker.mem.rss.count avg max min sum sample/second The rate that the value of docker.mem.rss was sampled
docker.mem.rss.max avg max min sum byte Max value of docker.mem.rss
docker.mem.rss.median avg max min sum byte Median value of docker.mem.rss
docker.mem.swap avg max min sum byte The amount of swap currently used by the container
docker.mem.swap.95percentile avg max min sum byte 95th percentile value of docker.mem.swap
docker.mem.swap.avg avg max min sum byte Average value of docker.mem.swap
docker.mem.swap.count avg max min sum sample/second The rate that the value of docker.mem.swap was sampled
docker.mem.swap.max avg max min sum byte Max value of docker.mem.swap
docker.mem.swap.median avg max min sum byte Median value of docker.mem.swap
docker.container.size_rw avg max min sum byte Total size of all the files in the container which have been created or changed by processes running in the container
docker.container.size_rw.95percentile avg max min sum byte 95th percentile of docker.container.size_rw
docker.container.size_rw.avg avg max min sum byte Average value of docker.container.size_rw
docker.container.size_rw.count avg max min sum sample/second The rate that the value of docker.container.size_rw was sampled
docker.container.size_rw.max avg max min sum byte Max value of docker.container.size_rw
docker.container.size_rw.median avg max min sum byte Median value of docker.container.size_rw
docker.container.size_rootfs avg max min sum byte Total size of all the files in the container
docker.container.size_rootfs.95percentile avg max min sum byte 95th percentile of docker.container.size_rootfs
docker.container.size_rootfs.avg avg max min sum byte Average value of docker.container.size_rootfs
docker.container.size_rootfs.count avg max min sum sample/second The rate that the value of docker.container.size_rw was sampled
docker.container.size_rootfs.max avg max min sum byte Max value of docker.container.size_rootfs
docker.container.size_rootfs.median avg max min sum byte Median value of docker.container.size_rootfs
docker.containers.running avg max min sum The number of containers running on this host
docker.containers.stopped avg max min sum The number of containers stopped on this host
docker.images.available avg max min sum The number of top-level images
docker.images.intermediate avg max min sum The number of intermediate images, which are intermediate layers that make up other images
docker.mem.limit avg max min sum byte The memory limit for the container, if set
docker.mem.limit.95percentile avg max min sum byte 95th percentile of docker.mem.limit. Ordinarily this value will not change
docker.mem.limit.avg avg max min sum byte Average value of docker.mem.limit. Ordinarily this value will not change
docker.mem.limit.count avg max min sum sample/second The rate that the value of docker.mem.limit was sampled
docker.mem.limit.max avg max min sum byte Max value of docker.mem.limit. Ordinarily this value will not change
docker.mem.limit.median avg max min sum byte Median value of docker.mem.limit. Ordinarily this value will not change
docker.mem.sw_limit avg max min sum byte The swap + memory limit for the container, if set
docker.mem.sw_limit.95percentile avg max min sum byte 95th percentile of docker.mem.sw_limit. Ordinarily this value will not change
docker.mem.sw_limit.avg avg max min sum byte Average value of docker.mem.sw_limit. Ordinarily this value will not change
docker.mem.sw_limit.count avg max min sum sample/second The rate that the value of docker.mem.sw_limit was sampled
docker.mem.sw_limit.max avg max min sum byte Max value of docker.mem.sw_limit. Ordinarily this value will not change
docker.mem.sw_limit.median avg max min sum byte Median value of docker.mem.sw_limit. Ordinarily this value will not change
docker.mem.in_use avg max min sum fraction The fraction of used memory to available memory, if the limit is set
docker.mem.in_use.95percentile avg max min sum fraction 95th percentile of docker.mem.in_use
docker.mem.in_use.avg avg max min sum fraction Average value of docker.mem.in_use
docker.mem.in_use.count avg max min sum sample/second The rate that the value of docker.mem.in_use was sampled
docker.mem.in_use.max avg max min sum fraction Max value of docker.container.mem.in_use
docker.mem.in_use.median avg max min sum fraction Median value of docker.container.mem.in_use
docker.mem.sw_in_use avg max min sum fraction The fraction of used swap + memory to available swap + memory, if the limit is set
docker.mem.sw_in_use.95percentile avg max min sum fraction 95th percentile of docker.mem.sw_in_use
docker.mem.sw_in_use.avg avg max min sum fraction Average value of docker.mem.sw_in_use
docker.mem.sw_in_use.count avg max min sum sample/second The rate that the value of docker.mem.sw_in_use was sampled
docker.mem.sw_in_use.max avg max min sum fraction Max value of docker.container.mem.sw_in_use
docker.mem.sw_in_use.median avg max min sum fraction Median value of docker.container.mem.sw_in_use
docker.io.read_bytes avg max min sum byte/second Bytes read per second from disk by the processes of the container
docker.io.read_bytes.95percentile avg max min sum byte/second 95th percentile of docker.io.read_bytes
docker.io.read_bytes.avg avg max min sum byte/second Average value of docker.io.read_bytes
docker.io.read_bytes.count avg max min sum sample/second The rate that the value of docker.io.read_bytes was sampled
docker.io.read_bytes.max avg max min sum byte/second Max value of docker.container.io.read_bytes
docker.io.read_bytes.median avg max min sum byte/second Median value of docker.container.io.read_bytes
docker.io.write_bytes avg max min sum byte/second Bytes written per second to disk by the processes of the container
docker.io.write_bytes.95percentile avg max min sum byte/second 95th percentile of docker.io.write_bytes
docker.io.write_bytes.avg avg max min sum byte/second Average value of docker.io.write_bytes
docker.io.write_bytes.count avg max min sum sample/second The rate that the value of docker.io.write_bytes was sampled
docker.io.write_bytes.max avg max min sum byte/second Max value of docker.container.io.write_bytes
docker.io.write_bytes.median avg max min sum byte/second Median value of docker.container.io.write_bytes
docker.image.virtual_size avg max min sum byte Size of all layers of the image on disk
docker.image.size avg max min sum byte Size of all layers of the image on disk
docker.net.bytes_rcvd avg max min sum byte/second Bytes received per second from the network
docker.net.bytes_rcvd.95percentile avg max min sum byte/second 95th percentile of docker.net.bytes_rcvd
docker.net.bytes_rcvd.avg avg max min sum byte/second Average value of docker.net.bytes_rcvd
docker.net.bytes_rcvd.count avg max min sum sample/second The rate that the value of docker.net.bytes_rcvd was sampled
docker.net.bytes_rcvd.max avg max min sum byte/second Max value of docker.container.net.bytes_rcvd
docker.net.bytes_rcvd.median avg max min sum byte/second Median value of docker.container.net.bytes_rcvd
docker.net.bytes_sent avg max min sum byte/second Bytes sent per second to the network
docker.net.bytes_sent_bytes.95percentile avg max min sum byte/second 95th percentile of docker.net.bytes_sent_bytes
docker.net.bytes_sent_bytes.avg avg max min sum byte/second Average value of docker.net.bytes_sent_bytes
docker.net.bytes_sent_bytes.count avg max min sum sample/second The rate that the value of docker.net.bytes_sent_bytes was sampled
docker.net.bytes_sent_bytes.max avg max min sum byte/second Max value of docker.container.net.bytes_sent_bytes
docker.net.bytes_sent_bytes.median avg max min sum byte/second Median value of docker.container.net.bytes_sent_bytes
docker.data.used avg max min sum byte Storage pool disk space used
docker.data.free avg max min sum byte Storage pool disk space free
docker.data.total avg max min sum byte Storage pool disk space total
docker.data.percent avg max min sum percent The percent of storage pool used
docker.metadata.used avg max min sum byte Storage pool metadata space used
docker.metadata.free avg max min sum byte Storage pool metadata space free
docker.metadata.total avg max min sum byte Storage pool metadata space total
docker.metadata.percent avg max min sum percent The percent of storage pool metadata used

Events

Docker events are enabled by default and can be disabled using the collect_events flag in docker_daemon.yaml. The following events are collected by default:

- Delete Image
- Die
- Error
- Fail
- Kill
- Out of memory (oom)
- Pause
- Restart container
- Restart Daemon
- Update