Hi,
My question is on monitoring.
Currently, we have an alert which is triggered when certain percentage of call failure happen and says the service outage.
Requirement is to send a new alert when outage is terminated(service working fine)
Any thoughts?
i thought of creating 2 alerts
Alert 1- that run every 30 min if the percentage(25%) of calls fails.
Alert 2 - to check if alert 1 is triggered in last 30 min.
When ever alert 1 is triggered, it says "Outage is happening" and it also enables alert 2. (ENABLE only if its not enabled)
Then, If Alert 1 not triggered in last 30 min, alert 2 is triggered saying "Outage is over, Service is back!".and disable the Alert 2.
Is this even possible?
I think the better solution here is to structure your search to examine the past hour, evaluate each 30 minute bin independently to determine if it should alert, and alert if the most recent 30 minute bin is in the alert state and the prior 30 minutes was not.
@chprvn, what is the query for your alert?
Alert 1 Query is to check the percentage of call failures
Alert 2 query is to check if alert one is triggered