Upgrade prom and alert manager to 2.7.1 and 0.16.1

**What**
- Removes the `alert` label in the scale-up alert
- Updates the annotaitons to use the `function_name` label instead of
the `instance` label that was removed.
- Per prometheus/prometheus#4836 and the related mailing list discussion
https://groups.google.com/d/msg/prometheus-users/7Ul6ngc7Ogs/j_YDszV5BwAJ
the alert value should not be included in the alert labels otherwise
each calculation of the alert is treated like a new alert and then the
use of `for 5s` will not behave as expected.
- Ports the resoltuion openfaas/faas-netes#372

Signed-off-by: Lucas Roesler <roesler.lucas@gmail.com>
This commit is contained in:
Lucas Roesler
2019-02-21 10:01:40 +01:00
committed by Alex Ellis
parent f2ac7b906c
commit 3bcc10a07e
3 changed files with 40 additions and 43 deletions

View File

@ -9,7 +9,6 @@ groups:
labels:
service: gateway
severity: major
value: '{{$value}}'
annotations:
description: High invocation total on {{ $labels.instance }}
summary: High invocation total on {{ $labels.instance }}
description: High invocation total on {{ $labels.function_name }}
summary: High invocation total on {{ $labels.function_name }}

View File

@ -2,15 +2,14 @@ ALERT service_down
IF up == 0
ALERT APIHighInvocationRate
IF sum ( rate(gateway_function_invocation_total{code="200"}[10s]) ) by (function_name) > 5
IF sum ( rate(gateway_function_invocation_total{code="200"}[10s]) ) by (function_name) > 5
FOR 5s
LABELS {
service = "gateway",
severity = "major",
value = "{{$value}}"
}
ANNOTATIONS {
summary = "High invocation total on {{ $labels.instance }}",
description = "High invocation total on {{ $labels.instance }}"
}
summary = "High invocation total on {{ $labels.function_name }}",
description = "High invocation total on {{ $labels.function_name }}"
}