Skip to content

Alerts

The create_alert method can be used to associate an alert with the current run. If the alert definition does not exist it will be created. The arguments are:

  • name: name of the alert
  • source: source for the alert, either metrics, events or user.
  • frequency: how often (in minutes) to check. Must be an integer, any floats provided to this argument will be rounded up
  • notification: type of notification, either none (default) or email

When source is set to metrics, the following are also required:

  • rule: type of alert, which needs to be one of:

    • is above,
    • is below,
    • is outside range,
    • is inside range.
  • metric: name of the metric to use

  • window: what time period (in minutes) over which to calculate the average of the metric. Must be an integer, any floats provided to this argument will be rounded up And, for the case of is above and is below:

  • threshold: threshold

and for is outside range and is inside range:

  • range_low: lower limit of range
  • range_high: upper limit of range

When source is set to events, the following is also required:

  • pattern: search for this string in each message

Finally when the source is set to user, this defines an alert which is manually triggered within the code itself using log_alert.

Examples

Here are some examples, illustrating each type of alert.

Threshold alert

For example, to create a threshold alert:

run.create_alert(
   name='quality too low',
   source='metrics',
   rule='is below',
   metric='quality',
   frequency=1,
   window=1,
   threshold=0.4
)
In this case if the 1-minute average of the quality metric, calculated at 1-minute intervals, goes below 0.4 the alert will be triggered.

Range-based alert

Similarly, here is an example of a range-based alert:

run.create_alert(
   name='density invalid',
   source='metrics',
   rule='is outside range',
   metric='density',
   frequency=2,
   window=2,
   range_low=1.0,
   range_high=5.0,
   notification='email'
)
In this case, if the 2-minute average of the density, calculated at 2-minute intervals, goes below 1.0 or above 5.0 the alert will be triggered. Here we set notification to email, so that an email will be sent to the user when the alert first goes critical.

Alert based on events

In this example we trigger an alert if the string error appears in an event message and send an email when this first happens:

run.create_alert(
   name='error detector',
   source='events',
   frequency=1,
   pattern='error',
   notification='email'
)

Manual alert

Finally, for this example an alert is triggered by the code itself. In order to set the alert status at a later point we need to store the identifier:

alert_id = run.create_alert(
   name='manual alert',
   source='user'
)
...
run.log_alert(alert_id, 'ok' if success else 'critical')

Aborting on alert

By default alerts are raised but no action is taken on the simulation. In cases where a simulation should be aborted if an alert is raised the argument trigger_abort should be set:

run.create_alert(
  name='abortable_alert',
  source='user',
  trigger_abort=True
)

this argument can be used for any of the alert types above.

Further Documentation