ns8-metrics

This module implements the metrics engine for NethServer 8. The module is rootless and runs as a non-privileged user.

It is composed by the following services:

Behavior:

there is only one instance of the module inside all the cluster, the instance runs only on the leader node
it automatically monitors all cluster nodes
if a leader node becomes a worker, the module is automatically removed on the worker node
Prometheus listens on well-known port 9091 (standard port is 9090, but it has been changed to avoid conflicts with Cockpit)
Alertmanager listens on well-known port 9093
alert-proxy listens on well-known port 9095
Grafana is disabled by default, if a Traefik route is configured Grafana will be run on the well-known port 3000

The configuration for Prometheus and Alertmanager is created when Prometheus service is restarted. The module is restarted when a new node is added or removed from the cluster. The alert-proxy service is restarted during a subscription-change event: if there is a valid subscription, the service will start sending alerts to my.nethesis.it or my.nethserver.com.

Available alerts:

no SWAP is configured
SWAP is getting full
One ore more backups have failed
Paritions are getting full

By default, the system will send alerts only to Nethesis portals. Mail notifications can be enabled by setting the mail_to parameter, see the Configure section.

Install

The module is automatically installed by the cluster initialization script.

Configure

Launch configure-module, by setting the following parameters:

prometheus_path: path to access Prometheus web UI, if left blank Prometheus will be not exposed
grafana_path: path to access Grafana web UI, if left blank grafana will be stopped; if enabled default credentials are admin/admin
mail_to: list of email addresses to receive alerts, this requires that mail notifications are enabled at cluster level
mail_from: email address used to send alerts, if left blank the default value is alertmanager@<node_fqdn>
mail_template: name of the template to use to send alerts, if left blank the default template is used

Example:

api-cli run module/metrics1/configure-module --data '{"prometheus_path": "prometheus", "grafana_path": "grafana", "mail_to": ["[email protected]"], "mail_from": "[email protected]", "mail_template": ""}'

You can send a test alert to verify the mail configuration:

runagent -m metrics1 test-alert

Configuration files are saved inside the state directory. The most important files and directory are:

prometheus.yml: Prometheus configuration
- prometheus.d: directory containing node configuration files
- rules.d: directory containing custom alert rules
alertmanager.yml: Alertmanager configuration
- templates.d: directory containing custom alert templates
local.yml: Grafana configuration, if enabled

Customimze alert rules

All alert rules are defined in the rules.d directory. Files can't be modified directly and will be overwritten on module update.

You can create a custom rule by adding the configuration to redis. A very curated list of rules can be found at Awesome Prometheus alerts.

To add a rule, enter the module, then create a rule file and reload the module:

echo '<alert_definition>' | redis-cli -x hset module/metrics1/custom_alerts <alert_id>

Example:

echo 'alert: HostMemoryUnderMemoryPressure
    expr: (rate(node_vmstat_pgmajfault[5m]) > 1000)
    for: 0m
    labels:
      severity: warning
    annotations:
      summary: Host memory under memory pressure (instance {{ $labels.instance }})
      description: "The node is under heavy memory pressure. High rate of loading memory pages from disk.\n  VALUE = {{ $value }}\n  LABELS = {{ $labels }}"' | redis-cli -x hset module/metrics1/custom_alerts mempressure

Apply the changes:

runagent -m metrics1 systemctl --user restart prometheus alertmanager

Customize alert mail template

You can change the mail template used to send alerts by creating a custom template in the templates.d directory.

First, create a template. Make sure to not change custom_mail_subject and custom_mail_html names, as they are used by the module to render the mail. Execute the following commands:

echo '<template>' | redis-cli -x hset module/metrics1/custom_templates <template_name>

Example:

echo '{{ define "custom_mail_subject" }}Alert on {{ range .Alerts.Firing }}{{ .Labels.instance }} {{ end }}{{ end }}
{{ define "custom_mail_html" }}
<html>
<head>
<title>Alert!</title>
</head>
<body>
{{ range .Alerts.Firing }}
<p>{{ .Labels.alertname }} on {{ .Labels.instance }}<br/>
{{ if ne .Annotations.summary "" }}{{ .Annotations.summary }}{{ end }}</p>
<p>Details:</p>
<p>
{{ range .Annotations.SortedPairs }}
  {{ .Name }} = {{ .Value }}<br/>
{{ end }}
</p>
<p>
{{ range .Labels.SortedPairs }}
  {{ .Name }} = {{ .Value }}<br/>
{{ end }}
</p>
{{ end }}
</body></html>
{{ end }}' | redis-cli -x hset module/metrics2/custom_templates mail

Apply the changes:

runagent -m metrics1 systemctl --user restart prometheus alertmanager

Then, configure the module to use the new template:

api-cli run module/metrics1/configure-module --data '{"prometheus_path": "prometheus", "grafana_path": "grafana", "mail_from": "[email protected]", "mail_to": ["[email protected]"], "mail_template": "custom_mail_html"}'

You can test the template rendering using the following command:

runagent -m metrics1
podman exec -ti alertmanager amtool template render --template.glob='/etc/alertmanager/templates/*.tmpl' --template.text='{{ template "custom_mail_html" . }}'
podman exec -ti alertmanager amtool template render --template.glob='/etc/alertmanager/templates/*.tmpl' --template.text='{{ template "custom_mail_subject" . }}'

Testing

Test the module using the test-module.sh script:

./test-module.sh <NODE_ADDR> ghcr.io/nethserver/metrics:latest

The tests are made using Robot Framework

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
alert-proxy		alert-proxy
imageroot		imageroot
tests		tests
ui		ui
LICENSE		LICENSE
README.md		README.md
build-images.sh		build-images.sh
renovate.json		renovate.json
test-module.sh		test-module.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ns8-metrics

Install

Configure

Customimze alert rules

Customize alert mail template

Testing

About

Releases

Packages

Contributors 3

Languages

License

NethServer/ns8-metrics

Folders and files

Latest commit

History

Repository files navigation

ns8-metrics

Install

Configure

Customimze alert rules

Customize alert mail template

Testing

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages