Skip to main content

Creating Alerts with Netdata Alerts Configuration Manager

The Netdata Alerts Configuration Manager enables users with Business subscriptions to create alerts from the Netdata Dashboard with an intuitive user interface.

Using Alerts Configuration Manager

  1. Go to the Metrics tab and navigate to the chart you want to configure an alert for.

  2. Click the Alert icon on the top right corner of the chart.

  3. The Alert Configuration Manager will open up with the default thresholds. You can modify the configuration as required and the alert definition on the right will be updated dynamically.

  4. If you want more fine-grained control or access to more advanced settings, enable Show advanced

  5. Copy the alert definition that is generated in the code box and add it to an existing health configuration file or a new custom file under <path to netdata install>/etc/netdata/health.d/ on a Parent Agent or a Standalone Child Agent.

  6. Reload Netdata Alert Health checks <path to netdata install>/usr/sbin/netdatacli reload-health and the new alert is now configured.

Alerts Configuration Manager Sections

  • Alert Detection MethodAn alert is triggered whenever a metric crosses a threshold:
    • Based on a standard threshold
    • Based on metric variance
    • Based on the anomaly rate of the metric
  • Metrics Lookup, Filtering and Formula Section

    • Metrics Lookup: This is the Evaluate line of fields in the modal and it defines the parameters for the database lookup that is needed to get the value that will be compared against the alert definition. It corresponds to the lookup line of the Alert configuration file. The Alerts Configuration Manager provides a default selection for the lookup and can be modified to suit your requirements. The parameters that can be modified are:

      • METHOD (avg, sum, min, max, cv, stddev)
      • COMPUTATION (sum of all dimensions or individually for each dimension)
      • DIMENSIONS (All dimensions, or a selection of dimensions)
      • DURATION (the period in time to run the lookup)
      • OPTIONS (absolute, unaligned, percentage, min2max)
    • Alert Filtering: This functionality can be accessed through the Show advanced checkbox and it allows for filtering the alert health checks to be run only for specific components of the infrastructure. It helps in achieving a fine-grained configuration for any given alert.

      • HOSTS / NODES - By default all hosts are selected. You can pick nodes from the dropdown list, or enter a wildcard matching a list of hosts that you want the alert health check to run on. This field corresponds to the hosts line of the Alert configuration file.
      • INSTANCES - All instances are selected by default. You can pick instances from the dropdown list, or enter a wildcard matching a list of instances that you want the alert health check to run on. This field corresponds to the charts line of the Alert configuration file.
      • CHART LABELS - All chart labels are selected by default. You can pick a chart label from the dropdown list or enter a wildcard matching a list of chart labels that you want the alert health check to run on. This field corresponds to the chart labels line of the Alert configuration file.
      • OS - All Operating Systems are selected by default. You can choose which OS(s) an alert health check should run on. This field corresponds to the os line of the Alert configuration file.
    • Formula / Calculation: This field is available through the Show advanced checkbox and it is used to define a formula to be run on top of the lookup value. The result of the lookup is available in the $this variable, and after the formula is run, the result is also stored in $this and can be accessed while setting the alert thresholds. This field corresponds to the calc line of the Alert configuration file.

  • Alerting conditions

    • Warning and Critical Thresholds: These fields are used to set the thresholds for the Warning and Critical alert states, while also having the option to set the condition for the alert to be raised if it is above or below the given threshold. If the advanced settings are selected, a formula option can also be used, to define a custom formula instead of a threshold. These fields correspond to the warn and crit lines of the Alert configuration file.

      • Recovery Thresholds: This field is available through the Show advanced checkbox, and it is used to set the threshold that the metric value needs to meet to de-escalate from a given severity status, like Critical to Warning and from Warning to Clear. The logic is appended to the warn and crit lines of the Alert configuration file and resembles a one-line IF-THEN-ELSE clause.
    • Check Interval: This field is used to define the frequency of the health check for the alert and corresponds to the every line of the Alert configuration file.

    • Delay Notifications: This field is available through the Show advanced checkbox and it is used to set delay parameters on notifications for an alert severity escalation or de-escalation. It corresponds to the delay line of the Alert configuration file.

    • Agent Specific Options: These options are only available on the Netdata Agent and not honored on Netdata Cloud. They can be accessed through the Show advanced checkbox.

      • Repeat Notifications: This field defines the repeat frequency for the alert notification when the alert is in either warning or critical status and corresponds to the repeat line of the Alert configuration file.
      • Send to: This field is used to define a user role to which the alert notifications will be sent. If set to silent, then the alert won't be sent to any role. It corresponds to the to line of the Alert configuration file.
      • Custom Exec Script: This field is used to define a custom script that will be executed when the alert is triggered (but needs to be carefully designed as it needs to call the health_alarm_notify.sh module) and corresponds to the exec line of the Alert configuration file.
  • Alert Name, Description and Summary Section

    • Alert Template Name: This field uniquely identifies an alert and corresponds to the template field of the Alert configuration file. The Alerts Configuration Manager provides a default name for an Alert template but we recommend you modify this to have a meaningful name for your configured alert.
    • Alert Template Description: This field provides a brief explanation of the alert and corresponds to the info line of the Alert configuration file.
    • Alert Summary: This field enables the users to create a custom title for the alert notification (via Notification integrations) and corresponds to the summary line of the Alert configuration file.

Do you have any feedback for this page? If so, you can open a new issue on our netdata/learn repository.