For the production-grade cluster, we usually want to set up some kind of monitoring. At the date, there is not a specific way to monitor Docker service and tasks in Swarm mode. We did this for Swarm2k with Telegraf, InfluxDB, and Grafana.
InfluxDB is a time-series database, which is easy to install because of no dependency. InfluxDB is useful to store metrics, information about events, and use them for later analysis. For Swarm2k, we used InfluxDB to store information of cluster, nodes, events, and for tasks with Telegraf.
Telegraf is pluggable and has a certain number of input plugins useful to observe the system environment.
We developed a new plugin for Telegraf to store stats into InfluxDB. This plugin can be found at http://github.com/chanwit/telegraf. Data may contain values, tags, and timestamp. Values will be computed or aggregated based on timestamp. Additionally, tags will allow you to group these values together...